Mm-sap: A comprehensive benchmark for assessing self-awareness of multimodal large language models in perception Y Wang, Y Liao, H Liu, H Liu, Y Wang, Y Wang
arXiv preprint arXiv:2401.07529, 2024
15 2024 M AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset Z Chen, H Liu, W Yu, G Sun, H Liu, J Wu, C Zhang, Y Wang, Y Wang
arXiv preprint arXiv:2403.14168, 2024
4 2024 Librisqa: Pioneering freeform and open-ended spoken question answering with a novel dataset and framework Z Zhao, Y Jiang, H Liu, Y Wang, Y Wang
arXiv preprint arXiv: 2308.10390, 2023
3 2023 Towards an End-to-End Framework for Invasive Brain Signal Decoding with Large Language Models S Feng, H Liu, Y Wang, Y Wang
INTERSPEECH 2024, 0
2 * Decoding Linguistic Representations of Human Brain Y Wang, H Liu, Y Wang, C Xuan, Y Hou, S Feng, H Liu, Y Liao, Y Wang
arXiv preprint arXiv:2407.20622, 2024
2024 LibriSQA: A Novel Dataset and Framework for Spoken Question Answering with Large Language Models Z Zhao, Y Jiang, H Liu, Y Wang, Y Wang
IEEE Transactions on Artificial Intelligence, 2024
2024 Post-decoder Biasing for End-to-End Speech Recognition of Multi-turn Medical Interview H Liu, Y Wang, Y Wang
Proceedings of the 2024 Joint International Conference on Computational …, 2024
2024