ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech X Wang, J Yamagishi, M Todisco, H Delgado, A Nautsch, N Evans, ... Computer Speech & Language 64, 101114, 2020 | 429 | 2020 |
Sequence-to-sequence acoustic modeling for voice conversion JX Zhang, ZH Ling, LJ Liu, Y Jiang, LR Dai IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (3), 631-644, 2019 | 171 | 2019 |
Non-parallel sequence-to-sequence voice conversion with disentangled linguistic and speaker representations JX Zhang, ZH Ling, LR Dai IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 540-552, 2019 | 133 | 2019 |
Forward attention in sequence-to-sequence acoustic modeling for speech synthesis JX Zhang, ZH Ling, LR Dai 2018 IEEE International conference on acoustics, speech and signal …, 2018 | 110 | 2018 |
Improving sequence-to-sequence voice conversion by adding text-supervision JX Zhang, ZH Ling, Y Jiang, LJ Liu, C Liang, LR Dai ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 42 | 2019 |
TaL: a synchronised multi-speaker corpus of ultrasound tongue imaging, audio, and lip videos MS Ribeiro, J Sanger, JX Zhang, A Eshky, A Wrench, K Richmond, ... 2021 IEEE Spoken Language Technology Workshop (SLT), 1109-1116, 2021 | 37 | 2021 |
Voice conversion by cascading automatic speech recognition and text-to-speech synthesis with prosody transfer JX Zhang, LJ Liu, YN Chen, YJ Hu, Y Jiang, ZH Ling, LR Dai arXiv preprint arXiv:2009.01475, 2020 | 23 | 2020 |
Non-parallel voice conversion with autoregressive conversion model and duration adjustment LJ Liu, YN Chen, JX Zhang, Y Jiang, YJ Hu, ZH Ling, LR Dai Proc. Joint Workshop for the Blizzard Challenge and Voice Conversion …, 2020 | 20 | 2020 |
TaLNet: Voice reconstruction from tongue and lip articulation with transfer learning from text-to-speech synthesis JX Zhang, K Richmond, ZH Ling, L Dai Proceedings of the AAAI Conference on Artificial Intelligence 35 (16), 14402 …, 2021 | 17 | 2021 |
DNN-based spectral enhancement for neural waveform generators with low-bit quantization Y Ai, JX Zhang, L Chen, ZH Ling ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 13 | 2019 |
Self-supervised audio-visual speech representations learning by multimodal self-distillation JX Zhang, G Wan, ZH Ling, J Pan, J Gao, C Liu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 11 | 2023 |
Recognition-synthesis based non-parallel voice conversion with adversarial learning JX Zhang, ZH Ling, LR Dai arXiv preprint arXiv:2008.02371, 2020 | 8 | 2020 |
Is lip region-of-interest sufficient for lipreading? JX Zhang, G Wan, J Pan Proceedings of the 2022 International Conference on Multimodal Interaction …, 2022 | 7 | 2022 |
Adversarial post-processing of voice conversion against spoofing detection YY Ding, JX Zhang, LJ Liu, Y Jiang, Y Hu, ZH Ling 2020 Asia-Pacific Signal and Information Processing Association Annual …, 2020 | 5 | 2020 |
Grammar-supervised end-to-end speech recognition with part-of-speech tagging and dependency parsing G Wan, T Mao, J Zhang, H Chen, J Gao, Z Ye Applied Sciences 13 (7), 4243, 2023 | 3 | 2023 |