- Academic Search

X Chang, P Ren, P Xu, Z Li, X Chen… - IEEE Transactions on …, 2021‏ - ieeexplore.ieee.org‏

Scene graph is a structured representation of a scene that can clearly express the objects,
attributes, and relationships between objects in the scene. As computer vision technology …‏

שמור צטט צוטט על ידי 324 מאמרים בנושא זה כל 12 הגרסאות

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] Scene graph generation: A comprehensive survey‏

H Li, G Zhu, L Zhang, Y Jiang, Y Dang, H Hou, P Shen… - Neurocomputing, 2024‏ - Elsevier‏

Deep learning techniques have led to remarkable breakthroughs in the field of object
detection and have spawned a lot of scene-understanding tasks in recent years. Scene …‏

שמור צטט צוטט על ידי 116 מאמרים בנושא זה כל 8 הגרסאות

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Unbiased scene graph generation from biased training‏

K Tang, Y Niu, J Huang, J Shi… - Proceedings of the …, 2020‏ - openaccess.thecvf.com‏

Today's scene graph generation (SGG) task is still far from practical, mainly due to the
severe training bias, eg, collapsing diverse" human walk on/sit on/lay on beach" into" human …‏

שמור צטט צוטט על ידי 820 מאמרים בנושא זה כל 10 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Teaching structured vision & language concepts to vision & language models‏

S Doveh, A Arbelle, S Harary… - Proceedings of the …, 2023‏ - openaccess.thecvf.com‏

Vision and Language (VL) models have demonstrated remarkable zero-shot performance in
a variety of tasks. However, some aspects of complex language understanding still remain a …‏

שמור צטט צוטט על ידי 81 מאמרים בנושא זה כל 10 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Video graph transformer for video question answering‏

J ** computer vision models that can reason about compositional …‏

שמור צטט צוטט על ידי 126 מאמרים בנושא זה כל 5 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Object-region video transformers‏

R Herzig, E Ben-Avraham… - Proceedings of the …, 2022‏ - openaccess.thecvf.com‏

Recently, video transformers have shown great success in video understanding, exceeding
CNN performance; yet existing video transformer models do not explicitly model objects …‏

שמור צטט צוטט על ידי 97 מאמרים בנושא זה כל 7 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Bottom-up shift and reasoning for referring image segmentation‏

S Yang, M **a, G Li, HY Zhou… - Proceedings of the IEEE …, 2021‏ - openaccess.thecvf.com‏

Referring image segmentation aims to segment the referent that is the corresponding object
or stuff referred by a natural language expression in an image. Its main challenge lies in how …‏

שמור צטט צוטט על ידי 99 מאמרים בנושא זה כל 8 הגרסאות פתיחה בתור HTML

יצירת התראה

צטט

חיפוש מתקדם

נשמר בספרייה שלי

Referring relationships

A comprehensive survey of scene graphs: Generation and application‏

[HTML][HTML] Scene graph generation: A comprehensive survey‏

Unbiased scene graph generation from biased training‏

Teaching structured vision & language concepts to vision & language models‏

Video graph transformer for video question answering‏

Object-region video transformers‏

Bottom-up shift and reasoning for referring image segmentation‏