Volgen
Yixuan Zhou (周逸轩)
Yixuan Zhou (周逸轩)
Andere namenYixuan Zhou
PhD student, Tsinghua University
Geverifieerd e-mailadres voor mails.tsinghua.edu.cn - Homepage
Titel
Geciteerd door
Geciteerd door
Jaar
Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis
Y Zhou, C Song, X Li, L Zhang, Z Wu, Y Bian, D Su, H Meng
Proc. Interspeech 2022, 2573-2577, 2022
282022
Towards expressive speaking style modelling with hierarchical context information for mandarin speech synthesis
S Lei, Y Zhou, L Chen, Z Wu, S Kang, H Meng
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
212022
Enhancing Word-Level Semantic Representation via Dependency Structure for Expressive Text-to-Speech Synthesis
Y Zhou, C Song, J Li, Z Wu, Y Bian, D Su, H Meng
Interspeech 2022, 2021
16*2021
MSStyleTTS: Multi-scale style modeling with hierarchical context information for expressive speech synthesis
S Lei, Y Zhou, L Chen, Z Wu, X Wu, S Kang, H Meng
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
102023
A character-level span-based model for mandarin prosodic structure prediction
X Chen, C Song, Y Zhou, Z Wu, C Chen, Z Wu, H Meng
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
102022
Context-aware coherent speaking style prediction with hierarchical transformers for audiobook speech synthesis
S Lei, Y Zhou, L Chen, Z Wu, S Kang, H Meng
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
62023
Towards spontaneous style modeling with semi-supervised pre-training for conversational text-to-speech synthesis
W Li, S Lei, Q Huang, Y Zhou, Z Wu, S Kang, H Meng
Interspeech 2023, 2023
52023
Syntactic representation learning for neural network based tts with syntactic parse tree traversal
C Song, J Li, Y Zhou, Z Wu, H Meng
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
52021
Spontaneous style text-to-speech synthesis with controllable spontaneous behaviors based on language models
W Li, P Yang, Y Zhong, Y Zhou, Z Wang, Z Wu, X Wu, H Meng
Interspeech 2024, 2024
42024
SongCreator: Lyrics-based Universal Song Generation
S Lei, Y Zhou, B Tang, MWY Lam, F Liu, H Liu, J Wu, S Kang, Z Wu, ...
NeurIPS 2024, 2024
32024
VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling
Y Zhou, X Qin, Z Jin, S Zhou, S Lei, S Zhou, Z Wu, J Jia
ACM MultiMedia 2024, 2024
32024
Multimodal Emotion Captioning Using Large Language Model with Prompt Engineering
Y Xu, Y Zhou, Y Cai, J Xie, R Ye, Z Wu
Proceedings of the 2nd International Workshop on Multimodal and Responsible …, 2024
12024
Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with Multi-Scale Acoustic Prompts
S Lei, Y Zhou, L Chen, D Luo, Z Wu, X Wu, S Kang, T Jiang, Y Zhou, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
12024
The Codec Language Model-based Zero-Shot Spontaneous Style TTS System for CoVoC Challenge 2024
S Zhou, Y Zhou, W Li, J Chen, R Ye, W Wu, Z Lin, S Lei, Z Wu
2024 IEEE 14th International Symposium on Chinese Spoken Language Processing …, 2024
2024
Robust Representation Learning for Multimodal Emotion Recognition with Contrastive Learning and Mixup
Y Cai, R Ye, J Xie, Y Zhou, Y Xu, Z Wu
Proceedings of the 2nd International Workshop on Multimodal and Responsible …, 2024
2024
The THU-HCSI Multi-Speaker Multi-Lingual Few-Shot Voice Cloning System for LIMMITS'24 Challenge
Y Zhou, S Zhou, S Lei, Z Wu, M Wu
ICASSP 2024, 2024
2024
Towards Multi-Scale Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis
S Lei, Y Zhou, L Chen, J Hu, Z Wu, S Kang, H Meng.
Proc. Interspeech 2022, 5523-5527, 2022
2022
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–17