Learning in audio-visual context: A review, analysis, and new perspective

Y Wei, D Hu, Y Tian, X Li - ar**_Network_for_Sound_Localization_From_Mixtures_CVPR_2023_paper.pdf" data-clk="hl=tr&sa=T&oi=gga&ct=gga&cd=6&d=1463270960097422726&ei=-ya4Z6i6PLutieoPi7i3gAU" data-clk-atid="hm3Aei-VThQJ" target="_blank">[PDF] thecvf.com

Audio-visual grou** network for sound localization from mixtures

S Mo, Y Tian - Proceedings of the IEEE/CVF conference on …, 2023 - openaccess.thecvf.com
Sound source localization is a typical and challenging task that predicts the location of
sound sources in a video. Previous single-source methods mainly used the audio-visual …

Audio-visual class-incremental learning

W Pian, S Mo, Y Guo, Y Tian - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
In this paper, we introduce audio-visual class-incremental learning, a class-incremental
learning scenario for audio-visual video recognition. We demonstrate that joint audio-visual …

Multimodal variational auto-encoder based audio-visual segmentation

Y Mao, J Zhang, M **ang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract We propose an Explicit Conditional Multimodal Variational Auto-Encoder
(ECMVAE) for audio-visual segmentation (AVS), aiming to segment sound sources in the …

Learning audio-visual source localization via false negative aware contrastive learning

W Sun, J Zhang, J Wang, Z Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Self-supervised audio-visual source localization aims to locate sound-source objects in
video frames without extra annotations. Recent methods often approach this goal with the …