Tf-blender: Temporal feature blender for video object detection

Y Cui, L Yan, Z Cao, D Liu - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
Video objection detection is a challenging task because isolated video frames may
encounter appearance deterioration, which introduces great confusion for detection. One of …

Transvpr: Transformer-based place recognition with multi-level attention aggregation

R Wang, Y Shen, W Zuo, S Zhou… - Proceedings of the …, 2022 - openaccess.thecvf.com
Visual place recognition is a challenging task for applications such as autonomous driving
navigation and mobile robot localization. Distracting elements presenting in complex scenes …

A survey on map-based localization techniques for autonomous vehicles

A Chalvatzaras, I Pratikakis… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Autonomous vehicles integrate complex software stacks for realizing the necessary iterative
perception, planning, and action operations. One of the foundational layers of such stacks is …

Label-efficient video object segmentation with motion clues

Y Lu, J Zhang, S Sun, Q Guo, Z Cao… - … on Circuits and …, 2023 - ieeexplore.ieee.org
Video object segmentation (VOS) plays an important role in video analysis and
understanding, which in turn facilitates a number of diverse applications, including video …

Dynamic feature aggregation for efficient video object detection

Y Cui - Proceedings of the Asian Conference on Computer …, 2022 - openaccess.thecvf.com
Video object detection is a fundamental yet challenging task in computer vision. One
practical solution is to take advantage of temporal information from the video and apply …

TransVLAD: Multi-scale attention-based global descriptors for visual geo-localization

Y Xu, P Shamsolmoali, E Granger… - Proceedings of the …, 2023 - openaccess.thecvf.com
Visual geo-localization remains a challenging task due to variations in the appearance and
perspective among captured images. This paper introduces an efficient TransVLAD module …

Radiance Field Learners As UAV First-Person Viewers

L Yan, Q Wang, J Zhao, Q Guan, Z Tang… - … on Computer Vision, 2024 - Springer
Abstract First-Person-View (FPV) holds immense potential for revolutionizing the trajectory of
Unmanned Aerial Vehicles (UAVs), offering an exhilarating avenue for navigating complex …

An efficient end-to-end EKF-SLAM architecture based on LiDAR, GNSS, and IMU data sensor fusion for autonomous ground vehicles

H MAILKA, M Abouzahir, M Ramzi - Multimedia Tools and Applications, 2024 - Springer
The autonomous ground vehicle's successful navigation with a high level of performance is
dependent on accurate state estimation, which may help in providing excellent decision …

Image retrieval using compact deep semantic correlation descriptors

BJ Zhang, GH Liu, Z Li, SX Song - Information Processing & Management, 2024 - Elsevier
Significant progress has been made in instance image retrieval based on deep feature
aggregation. However, existing approaches are limited by two issues: 1) The inability of …

Cml-mots: Collaborative multi-task learning for multi-object tracking and segmentation

Y Cui, C Han, D Liu - arxiv preprint arxiv:2311.00987, 2023 - arxiv.org
The advancement of computer vision has pushed visual analysis tasks from still images to
the video domain. In recent years, video instance segmentation, which aims to track and …