Obserwuj
Yuanyuan Zhang
Yuanyuan Zhang
Zweryfikowany adres z apple.com
Tytuł
Cytowane przez
Cytowane przez
Rok
Attention based fully convolutional network for speech emotion recognition
Y Zhang, J Du, Z Wang, J Zhang, Y Tu
2018 Asia-Pacific Signal and Information Processing Association Annual …, 2018
1772018
Exploring emotion features and fusion strategies for audio-video emotion recognition
H Zhou, D Meng, Y Zhang, X Peng, J Du, K Wang, Y Qiao
2019 International conference on multimodal interaction, 562-566, 2019
832019
Information fusion in attention networks using adaptive and multi-level factorized bilinear pooling for audio-visual emotion recognition
H Zhou, J Du, Y Zhang, Q Wang, QF Liu, CH Lee
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 2617-2629, 2021
602021
Deep fusion: An attention guided factorized bilinear pooling for audio-video emotion recognition
Y Zhang, ZR Wang, J Du
2019 International Joint Conference on Neural Networks (IJCNN), 1-8, 2019
492019
Acoustic model fusion for end-to-end speech recognition
Z Lei, M Xu, S Han, L Liu, Z Huang, T Ng, Y Zhang, E Pusateri, ...
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-7, 2023
102023
Frame-level specaugment for deep convolutional neural networks in hybrid ASR systems
X Li, Y Zhang, X Zhuang, D Liu
2021 IEEE Spoken Language Technology Workshop (SLT), 209-214, 2021
82021
Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers
J Silovsky, L Deng, A Argueta, T Arvizo, R Hsiao, S Kuznietsov, YC Lin, ...
arXiv preprint arXiv:2305.13652, 2023
22023
Contextualization of ASR with LLM using phonetic retrieval-based augmentation
Z Lei, X Na, M Xu, E Pusateri, C Van Gysel, Y Zhang, S Han, Z Huang
arXiv preprint arXiv:2409.15353, 2024
12024
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–8