Tools, techniques, datasets and application areas for object detection in an image: a review
J Kaur, W Singh - Multimedia Tools and Applications, 2022 - Springer
Object detection is one of the most fundamental and challenging tasks to locate objects in
images and videos. Over the past, it has gained much attention to do more research on …
images and videos. Over the past, it has gained much attention to do more research on …
[HTML][HTML] A comprehensive survey on the application of deep and reinforcement learning approaches in autonomous driving
Abstract Recent advances in Intelligent Transport Systems (ITS) and Artificial Intelligence
(AI) have stimulated and paved the way toward the widespread introduction of Autonomous …
(AI) have stimulated and paved the way toward the widespread introduction of Autonomous …
Visual recognition with deep nearest centroids
We devise deep nearest centroids (DNC), a conceptually elegant yet surprisingly effective
network for large-scale visual recognition, by revisiting Nearest Centroids, one of the most …
network for large-scale visual recognition, by revisiting Nearest Centroids, one of the most …
Tf-blender: Temporal feature blender for video object detection
Video objection detection is a challenging task because isolated video frames may
encounter appearance deterioration, which introduces great confusion for detection. One of …
encounter appearance deterioration, which introduces great confusion for detection. One of …
Differential feature awareness network within antagonistic learning for infrared-visible object detection
R Zhang, L Li, Q Zhang, J Zhang, L Xu… - … on Circuits and …, 2023 - ieeexplore.ieee.org
The combination of infrared and visible videos aims to gather more comprehensive feature
information from multiple sources and reach superior results on various practical tasks, such …
information from multiple sources and reach superior results on various practical tasks, such …
Deep unsupervised part-whole relational visual saliency
Y Liu, X Dong, D Zhang, S Xu - Neurocomputing, 2024 - Elsevier
Abstract Deep Supervised Salient Object Detection (SSOD) excessively relies on large-
scale annotated pixel-level labels which consume intensive labour acquiring high quality …
scale annotated pixel-level labels which consume intensive labour acquiring high quality …
Densernet: Weakly supervised visual localization using multi-scale feature aggregation
In this work, we introduce a Denser Feature Network (DenserNet) for visual localization. Our
work provides three principal contributions. First, we develop a convolutional neural network …
work provides three principal contributions. First, we develop a convolutional neural network …
Video captioning using global-local representation
Video captioning is a challenging task as it needs to accurately transform visual
understanding into natural language description. To date, state-of-the-art methods …
understanding into natural language description. To date, state-of-the-art methods …
A survey on map-based localization techniques for autonomous vehicles
Autonomous vehicles integrate complex software stacks for realizing the necessary iterative
perception, planning, and action operations. One of the foundational layers of such stacks is …
perception, planning, and action operations. One of the foundational layers of such stacks is …
E^ 2VPT: An Effective and Efficient Approach for Visual Prompt Tuning
As the size of transformer-based models continues to grow, fine-tuning these large-scale
pretrained vision models for new tasks has become increasingly parameter-intensive …
pretrained vision models for new tasks has become increasingly parameter-intensive …