Google Академик

H Li, G Zhu, L Zhang, Y Jiang, Y Dang, H Hou, P Shen… - Neurocomputing, 2024 - Elsevier

Deep learning techniques have led to remarkable breakthroughs in the field of object
detection and have spawned a lot of scene-understanding tasks in recent years. Scene …

Сачувај Цитирај 115 пута наведен Сродни чланци Све верзије (8)

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Panoptic video scene graph generation

J Yang, W Peng, X Li, Z Guo, L Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com

Towards building comprehensive real-world visual perception systems, we propose and
study a new problem called panoptic scene graph generation (PVSG). PVSG is related to …

Сачувај Цитирај 40 пута наведен Сродни чланци Све верзије (7) HTML верзија

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Enriching local and global contexts for temporal action localization

Z Zhu, W Tang, L Wang, N Zheng… - Proceedings of the …, 2021 - openaccess.thecvf.com

Effectively tackling the problem of temporal action localization (TAL) necessitates a visual
representation that jointly pursues two confounding goals, ie, fine-grained discrimination for …

Сачувај Цитирај 145 пута наведен Сродни чланци Све верзије (6) HTML верзија

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Sportshhi: A dataset for human-human interaction detection in sports videos

T Wu, R He, G Wu, L Wang - … of the IEEE/CVF conference on …, 2024 - openaccess.thecvf.com

Video-based visual relation detection tasks such as video scene graph generation play
important roles in fine-grained video understanding. However current video visual relation …

Сачувај Цитирај 7 пута наведен Сродни чланци Све верзије (6) HTML верзија

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Continuous scene representations for embodied ai

SY Gadre, K Ehsani, S Song… - Proceedings of the …, 2022 - openaccess.thecvf.com

Abstract We propose Continuous Scene Representations (CSR), a scene representation
constructed by an embodied agent navigating within a space, where objects and their …

Сачувај Цитирај 54 пута наведен Сродни чланци Све верзије (4) HTML верзија

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Target adaptive context aggregation for video scene graph generation

Y Teng, L Wang, Z Li, G Wu - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com

This paper deals with a challenging task of video scene graph generation (VidSGG), which
could serve as a structured video representation for high-level understanding tasks. We …

Сачувај Цитирај 69 пута наведен Сродни чланци Све верзије (6) HTML верзија

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Interventional video relation detection

Y Li, X Yang, X Shang, TS Chua - Proceedings of the 29th ACM …, 2021 - dl.acm.org

Video Visual Relation Detection (VidVRD) aims to semantically describe the dynamic
interactions across visual concepts localized in a video in the form of subject, predicate …

Сачувај Цитирај 65 пута наведен Сродни чланци

Few-shot human–object interaction video recognition with transformers

Q Li, X **e, J Zhang, G Shi - Neural Networks, 2023 - Elsevier

We propose a novel few-shot learning framework that can recognize human–object
interaction (HOI) classes with a few labeled samples. We achieve this by leveraging a meta …

Сачувај Цитирај 22 пута наведен Сродни чланци Све верзије (3)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Beyond mot: Semantic multi-object tracking

Y Li, Q Li, H Wang, X Ma, J Yao, S Dong, H Fan… - … on Computer Vision, 2024 - Springer

Current multi-object tracking (MOT) aims to predict trajectories of targets (ie,“where”) in
videos. Yet, knowing merely “where” is insufficient in many crucial applications. In …

Сачувај Цитирај 5 пута наведен Сродни чланци Све верзије (9)

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Scene graph contrastive learning for embodied navigation

KP Singh, J Salvador, L Weihs… - Proceedings of the …, 2023 - openaccess.thecvf.com

Training effective embodied AI agents often involves expert imitation, specialized
components such as maps, or leveraging additional sensors for depth and localization …

Сачувај Цитирај 11 пута наведен Сродни чланци Све верзије (3) HTML верзија

Направи обавештење

Цитирај

Напредна претрага

Сачувано у мојој библиотеци

Beyond short-term snippet: Video relation detection with spatio-temporal global context

[HTML][HTML] Scene graph generation: A comprehensive survey

Panoptic video scene graph generation

Enriching local and global contexts for temporal action localization

Sportshhi: A dataset for human-human interaction detection in sports videos

Continuous scene representations for embodied ai

Target adaptive context aggregation for video scene graph generation

Interventional video relation detection

Few-shot human–object interaction video recognition with transformers

Beyond mot: Semantic multi-object tracking

Scene graph contrastive learning for embodied navigation