Av-superb: A multi-task evaluation benchmark for audio-visual representation models Y Tseng, L Berry, YT Chen, IH Chiu, HH Lin, M Liu, P Peng, YJ Shih, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 9 | 2024 |
Dynamic-superb phase-2: A collaboratively expanding benchmark for measuring the capabilities of spoken language models with 180 tasks C Huang, WC Chen, S Yang, AT Liu, CA Li, YX Lin, WC Tseng, A Diwan, ... arXiv preprint arXiv:2411.05361, 2024 | 4 | 2024 |
DFADD: The Diffusion and Flow-Matching Based Audio Deepfake Dataset J Du, IM Lin, IH Chiu, X Chen, H Wu, W Ren, Y Tsao, H Lee, JSR Jang 2024 IEEE Spoken Language Technology Workshop (SLT), 921-928, 2024 | 1 | 2024 |
Building a Taiwanese Mandarin Spoken Language Model: A First Attempt CK Yang, YK Fu, CA Li, YC Lin, YX Lin, WC Chen, HL Chung, CY Kuan, ... arXiv preprint arXiv:2411.07111, 2024 | 1 | 2024 |
CodecFake-Omni: A Large-Scale Codec-based Deepfake Speech Dataset J Du, X Chen, H Wu, L Zhang, I Lin, I Chiu, W Ren, Y Tseng, Y Tsao, ... arXiv preprint arXiv:2501.08238, 2025 | | 2025 |