A comprehensive survey of scene graphs: Generation and application

X Chang, P Ren, P Xu, Z Li, X Chen… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Scene graph is a structured representation of a scene that can clearly express the objects,
attributes, and relationships between objects in the scene. As computer vision technology …

Expansion-squeeze-excitation fusion network for elderly activity recognition

X Shu, J Yang, R Yan, Y Song - IEEE Transactions on Circuits …, 2022 - ieeexplore.ieee.org
This work focuses on the task of elderly activity recognition, which is a challenging task due
to the existence of individual actions and human-object interactions in elderly activities …

Tcgl: Temporal contrastive graph for self-supervised video representation learning

Y Liu, K Wang, L Liu, H Lan, L Lin - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Video self-supervised learning is a challenging task, which requires significant expressive
power from the model to leverage rich spatial-temporal knowledge and generate effective …

Advancements in perception system with multi-sensor fusion for embodied agents

H Du, L Ren, Y Wang, X Cao, C Sun - Information Fusion, 2024 - Elsevier
The multi-sensor data fusion perception technology, as a pivotal technique for achieving
complex environmental perception and decision-making, has been garnering extensive …

SRAI-LSTM: A social relation attention-based interaction-aware LSTM for human trajectory prediction

Y Peng, G Zhang, J Shi, B Xu, L Zheng - Neurocomputing, 2022 - Elsevier
Pedestrian trajectory prediction is one of the important research topics in the field of
computer vision and a key technology of autonomous driving system. Walking in groups is a …

Multimodal emotion classification with multi-level semantic reasoning network

T Zhu, L Li, J Yang, S Zhao… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Nowadays, people are accustomed to posting images and associated text for expressing
their emotions on social networks. Accordingly, multimodal sentiment analysis has drawn …

Graph-based social relation reasoning

W Li, Y Duan, J Lu, J Feng, J Zhou - European conference on computer …, 2020 - Springer
Human beings are fundamentally sociable—that we generally organize our social lives in
terms of relations with other people. Understanding social relations from an image has great …

Bilateral cross-modality graph matching attention for feature fusion in visual question answering

J Cao, X Qin, S Zhao, J Shen - IEEE Transactions on Neural …, 2022 - ieeexplore.ieee.org
Answering semantically complicated questions according to an image is challenging in a
visual question answering (VQA) task. Although the image can be well represented by deep …

[PDF][PDF] Scene graphs: A survey of generations and applications

X Chang, P Ren, P Xu, Z Li, X Chen… - arxiv preprint arxiv …, 2021 - xiaojun.ai
Scene graph is a structured representation of a scene that can clearly express the objects,
attributes, and relationships between objects in the scene. As computer vision technology …

SocialGPT: Prompting LLMs for Social Relation Reasoning via Greedy Segment Optimization

W Li, Z Meng, J Zhou, D Wei, C Gan… - arxiv preprint arxiv …, 2024 - arxiv.org
Social relation reasoning aims to identify relation categories such as friends, spouses, and
colleagues from images. While current methods adopt the paradigm of training a dedicated …