Audio-CoT: Exploring Chain-of-Thought Reasoning in Large Audio Language Model
Large Audio-Language Models (LALMs) have demonstrated remarkable performance in
tasks involving audio perception and understanding, such as speech recognition and audio …
tasks involving audio perception and understanding, such as speech recognition and audio …
SpeechPrune: Context-aware Token Pruning for Speech Information Retrieval
We introduce Speech Information Retrieval (SIR), a new long-context task for Speech Large
Language Models (Speech LLMs), and present SPIRAL, a 1,012-sample benchmark testing …
Language Models (Speech LLMs), and present SPIRAL, a 1,012-sample benchmark testing …