Audio-Language Models for Audio-Centric Tasks: A survey

Y Su, J Bai, Q Xu, K Xu, Y Dou - arxiv preprint arxiv:2501.15177, 2025 - arxiv.org
Audio-Language Models (ALMs), which are trained on audio-text data, focus on the
processing, understanding, and reasoning of sounds. Unlike traditional supervised learning …