Wavcaps: A chatgpt-assisted weakly-labelled audio captioning dataset for audio-language multimodal research X Mei, C Meng, H Liu, Q Kong, T Ko, C Zhao, MD Plumbley, Y Zou, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 | 208* | 2024 |
Gigast: A 10,000-hour pseudo speech translation corpus R Ye, C Zhao, T Ko, C Meng, T Wang, M Wang, J Cao arXiv preprint arXiv:2204.03939, 2022 | 19 | 2022 |