Can Whisper Perform Speech-Based In-Context Learning? S Wang, CH Yang, J Wu, C Zhang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 22 | 2024 |
Bayesian Example Selection Improves In-Context Learning for Speech, Text and Visual Modalities S Wang, CHH Yang, J Wu, C Zhang EMNLP 2024, 2024 | 6 | 2024 |
SALMONN-omni: A Codec-free LLM for Full-duplex Speech Understanding and Generation W Yu, S Wang, X Yang, X Chen, X Tian, J Zhang, G Sun, L Lu, Y Wang, ... arXiv preprint arXiv:2411.18138, 2024 | 5 | 2024 |
Robust Zero-Shot Text-to-Speech Synthesis with Reverse Inference Optimization Y Hu, C Chen, S Wang, ES Chng, C Zhang arXiv preprint arXiv:2407.02243, 2024 | 3 | 2024 |
Enabling Auditory Large Language Models for Automatic Speech Quality Evaluation S Wang, W Yu, Y Yang, C Tang, Y Li, J Zhuang, X Chen, X Tian, J Zhang, ... arXiv preprint arXiv:2409.16644, 2024 | 1 | 2024 |
Audio Large Language Models Can Be Descriptive Speech Quality Evaluators C Chen, Y Hu, S Wang, H Wang, Z Chen, C Zhang, CHH Yang, ES Chng arXiv preprint arXiv:2501.17202, 2025 | | 2025 |