VQVAE Unsupervised Unit Discovery and Multi-scale Code2Spec Inverter for Zerospeech Challenge 2019 A Tjandra, B Sisman, M Zhang, S Sakti, H Li, S Nakamura arXiv preprint arXiv:1905.11449, 2019 | 90 | 2019 |
Joint Training Framework for Text-to-Speech and Voice Conversion Using Multi-Source Tacotron and WaveNet M Zhang, X Wang, F Fang, H Li, J Yamagishi Interspeech, 1298-1302, 2019 | 81 | 2019 |
Transfer learning from speech synthesis to voice conversion with non-parallel training data M Zhang, Y Zhou, L Zhao, H Li IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1290-1302, 2021 | 68 | 2021 |
Converting Anyone's Emotion: Towards Speaker-Independent Emotional Voice Conversion K Zhou, B Sisman, M Zhang, H Li arXiv preprint arXiv:2005.07025, 2020 | 67 | 2020 |
A Voice Conversion Framework with Tandem Feature Sparse Representation and Speaker-Adapted WaveNet Vocoder. B Sisman, M Zhang, H Li Interspeech, 1978-1982, 2018 | 62 | 2018 |
Group Sparse Representation With WaveNet Vocoder Adaptation for Spectrum and Prosody Conversion B Sisman, M Zhang, H Li IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (6), 1085 …, 2019 | 54 | 2019 |
Adaptive wavenet vocoder for residual compensation in gan-based voice conversion B Sisman, M Zhang, S Sakti, H Li, S Nakamura 2018 IEEE Spoken Language Technology Workshop (SLT), 282-289, 2018 | 46 | 2018 |
On the study of generative adversarial networks for cross-lingual voice conversion B Sisman, M Zhang, M Dong, H Li 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 40 | 2019 |
Deepconversion: Voice conversion with limited parallel training data M Zhang, B Sisman, L Zhao, H Li Speech Communication 122, 31-43, 2020 | 23 | 2020 |
VisualTTS: TTS with accurate lip-speech synchronization for automatic voice over J Lu, B Sisman, R Liu, M Zhang, H Li ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 19 | 2022 |
Error reduction network for dblstm-based voice conversion M Zhang, B Sisman, SS Rallabandi, H Li, L Zhao 2018 Asia-Pacific Signal and Information Processing Association Annual …, 2018 | 16 | 2018 |
TTS-Guided Training for Accent Conversion Without Parallel Data Y Zhou, Z Wu, M Zhang, X Tian, H Li IEEE Signal Processing Letters, 2023 | 11 | 2023 |
The NUS & NWPU System for Voice Conversion Challenge 2020 X Tian, Z Wang, S Yang, X Zhou, H Du, Y Zhou, M Zhang, K Zhou, ... Proc. Joint Workshop for the Blizzard Challenge and Voice Conversion …, 2020 | 11 | 2020 |
Accented Text-to-Speech Synthesis with Limited Data X Zhou, M Zhang, Y Zhou, Z Wu, H Li arXiv preprint arXiv:2305.04816, 2023 | 10 | 2023 |
High-Quality Automatic Voice Over with Accurate Alignment: Supervision through Self-Supervised Discrete Speech Units J Lu, B Sisman, M Zhang, H Li arXiv preprint arXiv:2306.17005, 2023 | 5 | 2023 |
Towards Zero-Shot Multi-Speaker Multi-Accent Text-to-Speech Synthesis M Zhang, X Zhou, Z Wu, H Li IEEE Signal Processing Letters, 2023 | 4 | 2023 |
RefXVC: Cross-Lingual Voice Conversion with Enhanced Reference Leveraging M Zhang, Y Zhou, Y Ren, C Zhang, X Yin, H Li arXiv preprint arXiv:2406.16326, 2024 | 3 | 2024 |
Multi-Scale Accent Modeling with Disentangling for Multi-Speaker Multi-Accent TTS Synthesis X Zhou, M Zhang, Y Zhou, Z Wu, H Li arXiv preprint arXiv:2406.10844, 2024 | 3 | 2024 |
Transfer the linguistic representations from TTS to accent conversion with non-parallel data X Chen, J Pei, L Xue, M Zhang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 3 | 2024 |
Zero-shot multi-speaker accent TTS with limited accent data M Zhang, Y Zhou, Z Wu, H Li 2023 Asia Pacific Signal and Information Processing Association Annual …, 2023 | 2 | 2023 |