A study on robust detection of pronunciation erroneous tendency based on deep neural network. Y Gao, Y Xie, W Cao, J Zhang Interspeech, 693-696, 2015 | 44 | 2015 |
Improving mandarin tone recognition based on dnn by combining acoustic and articulatory features using extended recognition networks J Lin, W Li, Y Gao, Y Xie, NF Chen, SM Siniscalchi, J Zhang, CH Lee Journal of Signal Processing Systems 90, 1077-1087, 2018 | 30 | 2018 |
Auffusion: Leveraging the power of diffusion and large language models for text-to-audio generation J Xue, Y Deng, Y Gao, Y Li arXiv preprint arXiv:2401.01044, 2024 | 28 | 2024 |
M2-CTTS: End-to-End Multi-Scale Multi-Modal Conversational Text-to-Speech Synthesis J Xue, Y Deng, F Wang, Y Li, Y Gao, J Tao, J Sun, J Liang ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 22 | 2023 |
Articulatory Copy Synthesis Based on a Genetic Algorithm. Y Gao, S Stone, P Birkholz INTERSPEECH, 3770-3774, 2019 | 18 | 2019 |
Formant tracking using dilated convolutional networks through dense connection with gating mechanism W Dai, J Zhang, Y Gao, W Wei, D Ke, B Lin, Y Xie arXiv preprint arXiv:2005.10803, 2020 | 11 | 2020 |
Articulatory copy synthesis using long-short term memory networks Y Gao, P Steiner, P Birkholz Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung …, 2020 | 11 | 2020 |
Computational Modelling of Tone Perception Based on Direct Processing of f0 Contours Y Chen, Y Gao, Y Xu Brain Sciences 12 (3), 337, 2022 | 10 | 2022 |
Resynthesizing the geco speech corpus with vocaltractlab K Sering, N Stehwien, Y Gao, MV Butz, H Baayen Konferenz Elektronische Sprachsignalverarbeitung, 95-102, 2019 | 10 | 2019 |
End-to-End Mispronunciation Detection and Diagnosis Using Transfer Learning L Peng, Y Gao, R Bao, Y Li, J Zhang Applied Sciences 13 (11), 6793, 2023 | 8 | 2023 |
Improving pronunciation erroneous tendency detection with multi-model soft targets J Lin, Y Gao, W Zhang, L Wei, Y Xie, J Zhang Journal of Signal Processing Systems 92, 793-803, 2020 | 8 | 2020 |
An acoustic comparison of German tense and lax vowels produced by German native speakers and Mandarin Chinese learners Y Gao, H Ding, P Birkholz The Journal of the Acoustical Society of America 148 (1), EL112-EL118, 2020 | 8 | 2020 |
Improving Mandarin tone recognition based on DNN by combining acoustic and articulatory features J Lin, Y Xie, Y Gao, J Zhang 2016 10th International Symposium on Chinese Spoken Language Processing …, 2016 | 8 | 2016 |
Concss: Contrastive-based context comprehension for dialogue-appropriate prosody in conversational speech synthesis Y Deng, J Xue, Y Jia, Q Li, Y Han, F Wang, Y Gao, D Ke, Y Li ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 7 | 2024 |
Cmcu-css: Enhancing naturalness via commonsense-based multi-modal context understanding in conversational speech synthesis Y Deng, J Xue, F Wang, Y Gao, Y Li Proceedings of the 31st ACM International Conference on Multimedia, 6081-6089, 2023 | 7 | 2023 |
An Investigation of Applying Large Language Models to Spoken Language Learning Y Gao, B Nuchged, Y Li, L Peng Applied Sciences 14 (1), 224, 2023 | 6 | 2023 |
A practical way to improve automatic phonetic segmentation performance W Peng, Y Gao, B Lin, J Zhang 2021 12th International Symposium on Chinese Spoken Language Processing …, 2021 | 6 | 2021 |
Improving pronunciation erroneous tendency detection with convolutional long short-term memory L Yang, Y Xie, Y Gao, J Zhang 2017 International Conference on Asian Language Processing (IALP), 52-56, 2017 | 6 | 2017 |
FTA-net: A Frequency and Time Attention Network for Speech Depression Detection Q Li, D Wang, Y Ren, Y Gao, Y Li INTERSPEECH 2023, 1723-1727, 2023 | 5 | 2023 |
Text-aware end-to-end mispronunciation detection and diagnosis L Peng, Y Gao, B Lin, D Ke, Y Xie, J Zhang arXiv preprint arXiv:2206.07289, 2022 | 5 | 2022 |