Kun Zhou

Cited by

	All	Since 2020
Citations	947	946
h-index	12	12
i10-index	13	13

420

210

105

315

20202021202220232024202517 87 156 238 414 31

Public access

View all

9 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Haizhou LiThe Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen), China; NUS, SingaporeVerified email at u.nus.edu
Berrak SismanAssistant Professor, Johns Hopkins UniversityVerified email at jhu.edu
Björn SchullerProfessor, Technische Universität München (TUM) / Imperial College London & CSO, audEERINGVerified email at tum.de
Bin MaAlibabaVerified email at alibaba-inc.com
Rajib RanaProfessor of Computer Science, UniSQVerified email at usq.edu.au
Sho InoueThe Chinese University of Hong Kong, Shenzhen, PhD Candidate in Computer ScienceVerified email at link.cuhk.edu.cn
Carlos BussoProfessor of Language Technologies Institute, Carnegie Mellon UniversityVerified email at cmu.edu

Kun Zhou

Alibaba Group; National University of Singapore

Verified email at u.nus.edu - Homepage

Speech Synthesis Affective Computing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Seen and Unseen Emotional Style Transfer for Voice Conversion with A New Emotional Speech Dataset K Zhou, B Sisman, R Liu, H Li ICASSP 2021, 2021	238	2021
Emotional Voice Conversion: Theory, Databases and ESD K Zhou, B Sisman, R Liu, H Li Speech Communication, 2022	185	2022
Transforming Spectrum and Prosody for Emotional Voice Conversion with Non-Parallel Training Data K Zhou, B Sisman, H Li Speaker Odyssey 2020, 2020	88	2020
Converting Anyone's Emotion: Towards Speaker-Independent Emotional Voice Conversion K Zhou, B Sisman, M Zhang, H Li INTERSPEECH 2020, 2020	67	2020
Emotion Intensity and its Control for Emotional Voice Conversion K Zhou, B Sisman, R Rana, BW Schuller, H Li IEEE Transactions on Affective Computing 14 (1), 31-48, 2022	63	2022
Speech Synthesis with Mixed Emotions K Zhou, B Sisman, R Rana, BW Schuller, H Li IEEE Transactions on Affective Computing 14 (4), 3120-3134, 2023	60	2023
VAW-GAN for Disentanglement and Recomposition of Emotional Elements in Speech K Zhou, B Sisman, H Li IEEE Spoken Language Technology Workshop (SLT), 2021, 2021	49	2021
Limited Data Emotional Voice Conversion Leveraging Text-to-Speech: Two-stage Sequence-to-Sequence Training K Zhou, B Sisman, H Li INTERSPEECH 2021, 2021	42	2021
Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conversion Z Du, B Sisman, K Zhou, H Li INTERSPEECH 2022, 2022	34*	2022
Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer Z Du, B Sisman, K Zhou, H Li IEEE Automatic Speech Recognition and Undertanding Workshop (ASRU), 2021, 2021	24	2021
VAW-GAN for Singing Voice Conversion with Non-parallel Training Data J Lu, K Zhou, B Sisman, H Li Asia-Pacific Signal and Information Processing Association Annual Summit and …, 2020	22	2020
MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation S Zhao, Y Ma, C Ni, C Zhang, H Wang, TH Nguyen, K Zhou, J Yip, D Ng, ... ICASSP 2024, 2023	21	2023
The NUS & NWPU system for Voice Conversion Challenge 2020 X Tian, Z Wang, S Yang, X Zhou, H Du, Y Zhou, M Zhang, K Zhou, ... Proc. Joint Workshop for the Blizzard Challenge and Voice Conversion …, 2020	11	2020
Spectrum and Prosody Conversion for Cross-Lingual Voice Conversion with CycleGAN Z Du, K Zhou, B Sisman, H Li Asia-Pacific Signal and Information Processing Association Annual Summit and …, 2020	7	2020
Hierarchical Emotion Prediction and Control in Text-to-Speech Synthesis S Inoue, K Zhou, S Wang, H Li ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	6	2024
SPGM: Prioritizing Local Features for enhanced speech separation performance JQ Yip, S Zhao, Y Ma, C Ni, C Zhang, H Wang, TH Nguyen, K Zhou, D Ng, ... ICASSP 2024, 2023	6	2023
Mixed emotion modelling for emotional voice conversion K Zhou, B Sisman, C Busso, H Li Speaker Odyssey 6, 7, 2024	5	2024
Emotion Modelling for Speech Generation K Zhou Phd thesis, National University of Singapore, 2023	5*	2023
Emotional dimension control in language model-based text-to-speech: Spanning a broad spectrum of human emotions K Zhou, Y Zhang, S Zhao, H Wang, Z Pan, D Ng, C Zhang, C Ni, Y Ma, ... arXiv preprint arXiv:2409.16681, 2024	3	2024
Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis K Zhou, S Zhao, Y Ma, C Zhang, H Wang, D Ng, C Ni, NT Hieu, JQ Yip, ... INTERSPEECH 2024, 2024	3	2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors