Tf-blender: Temporal feature blender for video object detection
Video objection detection is a challenging task because isolated video frames may
encounter appearance deterioration, which introduces great confusion for detection. One of …
encounter appearance deterioration, which introduces great confusion for detection. One of …
Transvpr: Transformer-based place recognition with multi-level attention aggregation
Visual place recognition is a challenging task for applications such as autonomous driving
navigation and mobile robot localization. Distracting elements presenting in complex scenes …
navigation and mobile robot localization. Distracting elements presenting in complex scenes …
A survey on map-based localization techniques for autonomous vehicles
Autonomous vehicles integrate complex software stacks for realizing the necessary iterative
perception, planning, and action operations. One of the foundational layers of such stacks is …
perception, planning, and action operations. One of the foundational layers of such stacks is …
Label-efficient video object segmentation with motion clues
Video object segmentation (VOS) plays an important role in video analysis and
understanding, which in turn facilitates a number of diverse applications, including video …
understanding, which in turn facilitates a number of diverse applications, including video …
Dynamic feature aggregation for efficient video object detection
Y Cui - Proceedings of the Asian Conference on Computer …, 2022 - openaccess.thecvf.com
Video object detection is a fundamental yet challenging task in computer vision. One
practical solution is to take advantage of temporal information from the video and apply …
practical solution is to take advantage of temporal information from the video and apply …
TransVLAD: Multi-scale attention-based global descriptors for visual geo-localization
Visual geo-localization remains a challenging task due to variations in the appearance and
perspective among captured images. This paper introduces an efficient TransVLAD module …
perspective among captured images. This paper introduces an efficient TransVLAD module …
Radiance Field Learners As UAV First-Person Viewers
Abstract First-Person-View (FPV) holds immense potential for revolutionizing the trajectory of
Unmanned Aerial Vehicles (UAVs), offering an exhilarating avenue for navigating complex …
Unmanned Aerial Vehicles (UAVs), offering an exhilarating avenue for navigating complex …
An efficient end-to-end EKF-SLAM architecture based on LiDAR, GNSS, and IMU data sensor fusion for autonomous ground vehicles
The autonomous ground vehicle's successful navigation with a high level of performance is
dependent on accurate state estimation, which may help in providing excellent decision …
dependent on accurate state estimation, which may help in providing excellent decision …
Image retrieval using compact deep semantic correlation descriptors
BJ Zhang, GH Liu, Z Li, SX Song - Information Processing & Management, 2024 - Elsevier
Significant progress has been made in instance image retrieval based on deep feature
aggregation. However, existing approaches are limited by two issues: 1) The inability of …
aggregation. However, existing approaches are limited by two issues: 1) The inability of …
Cml-mots: Collaborative multi-task learning for multi-object tracking and segmentation
The advancement of computer vision has pushed visual analysis tasks from still images to
the video domain. In recent years, video instance segmentation, which aims to track and …
the video domain. In recent years, video instance segmentation, which aims to track and …