- Academic Search

Artikelen

Scholar

Ongeveer 21 resultaten (0,02 sec)

Mijn profiel Mijn bibliotheek

Grounding DINO 1.5: Advance the" Edge" of Open-Set Object Detection

Zoeken in citerende artikelen

[Free GPT-4]

[PDF] arxiv.org

Dino-x: A unified vision model for open-world object detection and understanding

T Ren, Y Chen, Q Jiang, Z Zeng, Y ** (SLAM) approach that leverages a sparse and lightweight object-based …

Opslaan Citeren Geciteerd door 5 Verwante artikelen Alle 2 versies HTML-versie

[Free GPT-4]

[PDF] arxiv.org

Instruction-guided scene text recognition

Y Du, Z Chen, Y Su, C Jia… - IEEE Transactions on …, 2025 - ieeexplore.ieee.org

Multi-modal models have shown appealing performance in visual recognition tasks, as free-
form text-guided training evokes the ability to understand fine-grained visual content …

Opslaan Citeren Geciteerd door 4 Verwante artikelen Alle 2 versies

[Free GPT-4]

[PDF] arxiv.org

CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation

H Zhang, D Hong, T Gao, Y Wang, J Shao… - ar** via Semantic and Geometric Guided Segmentation

H Li, W Mao, W Deng, C Meng, R Zhang, F Jia… - ar**, which involves gras** specific parts of objects based on their
functions, is crucial for develo** advanced robotic systems capable of performing complex …

Opslaan Citeren Verwante artikelen Alle 2 versies HTML-versie

Melding maken

Citeren

Geavanceerd zoeken

Opgeslagen in Mijn bibliotheek

Grounding DINO 1.5: Advance the" Edge" of Open-Set Object Detection

Dino-x: A unified vision model for open-world object detection and understanding

Instruction-guided scene text recognition

CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation