Foundation model for advancing healthcare: challenges, opportunities and future directions

Y He, F Huang, X Jiang, Y Nie, M Wang… - IEEE Reviews in …, 2024 - ieeexplore.ieee.org
Foundation model, trained on a diverse range of data and adaptable to a myriad of tasks, is
advancing healthcare. It fosters the development of healthcare artificial intelligence (AI) …

Single-model and any-modality for video object tracking

Z Wu, J Zheng, X Ren, FA Vasluianu… - Proceedings of the …, 2024 - openaccess.thecvf.com
In the realm of video object tracking auxiliary modalities such as depth thermal or event data
have emerged as valuable assets to complement the RGB trackers. In practice most existing …

Multimodal co-learning meets remote sensing: Taxonomy, state of the art, and future works

N Kieu, K Nguyen, A Nazib, T Fernando… - IEEE Journal of …, 2024 - ieeexplore.ieee.org
In remote sensing (RS), multiple modalities of data are usually available, eg, RGB,
multispectral, hyperspectral, light detection and ranging (LiDAR), and synthetic aperture …

All in one framework for multimodal re-identification in the wild

H Li, M Ye, M Zhang, B Du - Proceedings of the IEEE/CVF …, 2024 - openaccess.thecvf.com
Abstract In Re-identification (ReID) recent advancements yield noteworthy progress in both
unimodal and cross-modal retrieval tasks. However the challenge persists in develo** a …

Drfuse: Learning disentangled representation for clinical multi-modal fusion with missing modality and modal inconsistency

W Yao, K Yin, WK Cheung, J Liu, J Qin - Proceedings of the AAAI …, 2024 - ojs.aaai.org
The combination of electronic health records (EHR) and medical images is crucial for
clinicians in making diagnoses and forecasting prognoses. Strategically fusing these two …

A novel transformer autoencoder for multi-modal emotion recognition with incomplete data

C Cheng, W Liu, Z Fan, L Feng, Z Jia - Neural Networks, 2024 - Elsevier
Multi-modal signals have become essential data for emotion recognition since they can
represent emotions more comprehensively. However, in real-world environments, it is often …

Towards robust multimodal sentiment analysis under uncertain signal missing

M Li, D Yang, L Zhang - IEEE Signal Processing Letters, 2023 - ieeexplore.ieee.org
Multimodal Sentiment Analysis (MSA) has attracted widespread research attention recently.
Most MSA studies are based on the assumption of signal completeness. However, many …

Husformer: A Multimodal Transformer for Multimodal Human State Recognition

R Wang, W Jo, D Zhao, W Wang… - … on Cognitive and …, 2024 - ieeexplore.ieee.org
Human state recognition is a critical topic with pervasive and important applications in
human–machine systems. Multimodal fusion, which entails integrating metrics from various …

Missing modality prediction for unpaired multimodal learning via joint embedding of unimodal models

D Kim, T Kim - European Conference on Computer Vision, 2024 - Springer
Multimodal learning typically relies on the assumption that all modalities are fully available
during both the training and inference phases. However, in real-world scenarios …

Removal and selection: Improving RGB-infrared object detection via coarse-to-fine fusion

T Zhao, M Yuan, F Jiang, N Wang, X Wei - arxiv preprint arxiv:2401.10731, 2024 - arxiv.org
Object detection in visible (RGB) and infrared (IR) images has been widely applied in recent
years. Leveraging the complementary characteristics of RGB and IR images, the object …