SSTtrack: A unified hyperspectral video tracking framework via modeling spectral-spatial-temporal conditions

Y Chen, Q Yuan, Y Tang, Y **ao, J He, T Han, Z Liu… - Information …, 2025 - Elsevier
Hyperspectral video contains rich spectral, spatial, and temporal conditions that are crucial
for capturing complex object variations and overcoming the inherent limitations (eg, multi …

Revisiting RGBT tracking benchmarks from the perspective of modality validity: A new benchmark, problem, and method

Z Tang, T Xu, Z Feng, X Zhu, H Wang, P Shao… - arxiv preprint arxiv …, 2024 - arxiv.org
RGBT tracking draws increasing attention due to its robustness in multi-modality warranting
(MMW) scenarios, such as nighttime and bad weather, where relying on a single sensing …

PHTrack: Prompting for Hyperspectral Video Tracking

Y Chen, Y Tang, X Su, J Li, Y **ao… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Hyperspectral (HS) video captures continuous spectral information of objects, enhancing
material identification in tracking tasks. It is expected to overcome the inherent limitations of …

Multi-Level Fusion for Robust RGBT Tracking via Enhanced Thermal Representation

Z Tang, T Xu, XJ Wu, J Kittler - ACM Transactions on Multimedia …, 2024 - dl.acm.org
Due to the limitations of visible (RGB) sensors in challenging scenarios, such as nighttime
and foggy environments, the thermal infrared (TIR) modality draws increasing attention as …

Cross-modal object tracking via modality-aware fusion network and a large-scale dataset

L Liu, M Zhang, C Li, C Li, J Tang - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Visual object tracking often faces challenges such as invalid targets and decreased
performance in low-light conditions when relying solely on RGB image sequences. While …

Awesome multi-modal object tracking

C Zhang, L Liu, H Wen, X Zhou, Y Wang - arxiv preprint arxiv:2405.14200, 2024 - arxiv.org
Multi-modal object tracking (MMOT) is an emerging field that combines data from various
modalities,\eg vision (RGB), depth, thermal infrared, event, language and audio, to estimate …

Adaptive Colour-Depth Aware Attention for RGB-D Object Tracking

XF Zhu, T Xu, XJ Wu - IEEE Signal Processing Letters, 2024 - ieeexplore.ieee.org
Recent advances in RGB-D tracking have been driven by the synergistic combination of
high-performing RGB-only trackers and auxiliary depth information. However, most existing …

Part-Whole Relational Fusion Towards Multi-Modal Scene Understanding

Y Liu, C Li, S Xu, J Han - arxiv preprint arxiv:2410.14944, 2024 - arxiv.org
Multi-modal fusion has played a vital role in multi-modal scene understanding. Most existing
methods focus on cross-modal fusion involving two modalities, often overlooking more …