Object detection in 20 years: A survey

Z Zou, K Chen, Z Shi, Y Guo, J Ye - Proceedings of the IEEE, 2023 - ieeexplore.ieee.org
Object detection, as of one the most fundamental and challenging problems in computer
vision, has received great attention in recent years. Over the past two decades, we have …

Vision-based holistic scene understanding towards proactive human–robot collaboration

J Fan, P Zheng, S Li - Robotics and Computer-Integrated Manufacturing, 2022 - Elsevier
Recently human–robot collaboration (HRC) has emerged as a promising paradigm for mass
personalization in manufacturing owing to the potential to fully exploit the strength of human …

A survey of knowledge graph reasoning on graph types: Static, dynamic, and multi-modal

K Liang, L Meng, M Liu, Y Liu, W Tu… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
Knowledge graph reasoning (KGR), aiming to deduce new facts from existing facts based on
mined logic rules underlying knowledge graphs (KGs), has become a fast-growing research …

FFTI: Image inpainting algorithm via features fusion and two-steps inpainting

Y Chen, R **a, K Zou, K Yang - Journal of Visual Communication and …, 2023 - Elsevier
In view of the faultiness that the existing image inpainting methods fail to make full use of the
complete region to predict the missing region features when the object features are seriously …

Recent advances in small object detection based on deep learning: A review

K Tong, Y Wu, F Zhou - Image and Vision Computing, 2020 - Elsevier
Small object detection is a challenging problem in computer vision. It has been widely
applied in defense military, transportation, industry, etc. To facilitate in-depth understanding …

CNN-RNN based intelligent recommendation for online medical pre-diagnosis support

X Zhou, Y Li, W Liang - IEEE/ACM Transactions on …, 2020 - ieeexplore.ieee.org
The rapidly developed Health 2.0 technology has provided people with more opportunities
to conduct online medical consultation than ever before. Understanding contexts within …

A comprehensive survey of deep learning for image captioning

MDZ Hossain, F Sohel, MF Shiratuddin… - ACM Computing Surveys …, 2019 - dl.acm.org
Generating a description of an image is called image captioning. Image captioning requires
recognizing the important objects, their attributes, and their relationships in an image. It also …

Learning to answer questions in dynamic audio-visual scenarios

G Li, Y Wei, Y Tian, C Xu, JR Wen… - Proceedings of the …, 2022 - openaccess.thecvf.com
In this paper, we focus on the Audio-Visual Question Answering (AVQA) task, which aims to
answer questions regarding different visual objects, sounds, and their associations in …

Graph r-cnn for scene graph generation

J Yang, J Lu, S Lee, D Batra… - Proceedings of the …, 2018 - openaccess.thecvf.com
We propose a novel scene graph generation model called Graph R-CNN, that is both
effective and efficient at detecting objects and their relations in images. Our model contains …

Mattnet: Modular attention network for referring expression comprehension

L Yu, Z Lin, X Shen, J Yang, X Lu… - Proceedings of the …, 2018 - openaccess.thecvf.com
In this paper, we address referring expression comprehension: localizing an image region
described by a natural language expression. While most recent work treats expressions as a …