Seen and Unseen Emotional Style Transfer for Voice Conversion with A New Emotional Speech Dataset K Zhou, B Sisman, R Liu, H Li ICASSP 2021, 2021 | 238 | 2021 |
Emotional Voice Conversion: Theory, Databases and ESD K Zhou, B Sisman, R Liu, H Li Speech Communication, 2022 | 185 | 2022 |
Transforming Spectrum and Prosody for Emotional Voice Conversion with Non-Parallel Training Data K Zhou, B Sisman, H Li Speaker Odyssey 2020, 2020 | 88 | 2020 |
Converting Anyone's Emotion: Towards Speaker-Independent Emotional Voice Conversion K Zhou, B Sisman, M Zhang, H Li INTERSPEECH 2020, 2020 | 67 | 2020 |
Emotion Intensity and its Control for Emotional Voice Conversion K Zhou, B Sisman, R Rana, BW Schuller, H Li IEEE Transactions on Affective Computing 14 (1), 31-48, 2022 | 63 | 2022 |
Speech Synthesis with Mixed Emotions K Zhou, B Sisman, R Rana, BW Schuller, H Li IEEE Transactions on Affective Computing 14 (4), 3120-3134, 2023 | 60 | 2023 |
VAW-GAN for Disentanglement and Recomposition of Emotional Elements in Speech K Zhou, B Sisman, H Li IEEE Spoken Language Technology Workshop (SLT), 2021, 2021 | 49 | 2021 |
Limited Data Emotional Voice Conversion Leveraging Text-to-Speech: Two-stage Sequence-to-Sequence Training K Zhou, B Sisman, H Li INTERSPEECH 2021, 2021 | 42 | 2021 |
Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conversion Z Du, B Sisman, K Zhou, H Li INTERSPEECH 2022, 2022 | 34* | 2022 |
Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer Z Du, B Sisman, K Zhou, H Li IEEE Automatic Speech Recognition and Undertanding Workshop (ASRU), 2021, 2021 | 24 | 2021 |
VAW-GAN for Singing Voice Conversion with Non-parallel Training Data J Lu, K Zhou, B Sisman, H Li Asia-Pacific Signal and Information Processing Association Annual Summit and …, 2020 | 22 | 2020 |
MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation S Zhao, Y Ma, C Ni, C Zhang, H Wang, TH Nguyen, K Zhou, J Yip, D Ng, ... ICASSP 2024, 2023 | 21 | 2023 |
The NUS & NWPU system for Voice Conversion Challenge 2020 X Tian, Z Wang, S Yang, X Zhou, H Du, Y Zhou, M Zhang, K Zhou, ... Proc. Joint Workshop for the Blizzard Challenge and Voice Conversion …, 2020 | 11 | 2020 |
Spectrum and Prosody Conversion for Cross-Lingual Voice Conversion with CycleGAN Z Du, K Zhou, B Sisman, H Li Asia-Pacific Signal and Information Processing Association Annual Summit and …, 2020 | 7 | 2020 |
Hierarchical Emotion Prediction and Control in Text-to-Speech Synthesis S Inoue, K Zhou, S Wang, H Li ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 6 | 2024 |
SPGM: Prioritizing Local Features for enhanced speech separation performance JQ Yip, S Zhao, Y Ma, C Ni, C Zhang, H Wang, TH Nguyen, K Zhou, D Ng, ... ICASSP 2024, 2023 | 6 | 2023 |
Mixed emotion modelling for emotional voice conversion K Zhou, B Sisman, C Busso, H Li Speaker Odyssey 6, 7, 2024 | 5 | 2024 |
Emotion Modelling for Speech Generation K Zhou Phd thesis, National University of Singapore, 2023 | 5* | 2023 |
Emotional dimension control in language model-based text-to-speech: Spanning a broad spectrum of human emotions K Zhou, Y Zhang, S Zhao, H Wang, Z Pan, D Ng, C Zhang, C Ni, Y Ma, ... arXiv preprint arXiv:2409.16681, 2024 | 3 | 2024 |
Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis K Zhou, S Zhao, Y Ma, C Zhang, H Wang, D Ng, C Ni, NT Hieu, JQ Yip, ... INTERSPEECH 2024, 2024 | 3 | 2024 |