Attention based fully convolutional network for speech emotion recognition Y Zhang, J Du, Z Wang, J Zhang, Y Tu 2018 Asia-Pacific Signal and Information Processing Association Annual …, 2018 | 177 | 2018 |
Exploring emotion features and fusion strategies for audio-video emotion recognition H Zhou, D Meng, Y Zhang, X Peng, J Du, K Wang, Y Qiao 2019 International conference on multimodal interaction, 562-566, 2019 | 83 | 2019 |
Information fusion in attention networks using adaptive and multi-level factorized bilinear pooling for audio-visual emotion recognition H Zhou, J Du, Y Zhang, Q Wang, QF Liu, CH Lee IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 2617-2629, 2021 | 60 | 2021 |
Deep fusion: An attention guided factorized bilinear pooling for audio-video emotion recognition Y Zhang, ZR Wang, J Du 2019 International Joint Conference on Neural Networks (IJCNN), 1-8, 2019 | 49 | 2019 |
Acoustic model fusion for end-to-end speech recognition Z Lei, M Xu, S Han, L Liu, Z Huang, T Ng, Y Zhang, E Pusateri, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-7, 2023 | 10 | 2023 |
Frame-level specaugment for deep convolutional neural networks in hybrid ASR systems X Li, Y Zhang, X Zhuang, D Liu 2021 IEEE Spoken Language Technology Workshop (SLT), 209-214, 2021 | 8 | 2021 |
Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers J Silovsky, L Deng, A Argueta, T Arvizo, R Hsiao, S Kuznietsov, YC Lin, ... arXiv preprint arXiv:2305.13652, 2023 | 2 | 2023 |
Contextualization of ASR with LLM using phonetic retrieval-based augmentation Z Lei, X Na, M Xu, E Pusateri, C Van Gysel, Y Zhang, S Han, Z Huang arXiv preprint arXiv:2409.15353, 2024 | 1 | 2024 |