متابعة
Liumeng Xue
عنوان
عدد مرات الاقتباسات
عدد مرات الاقتباسات
السنة
Controllable emotion transfer for end-to-end speech synthesis
T Li, S Yang, L Xue, L Xie
2021 12th International Symposium on Chinese Spoken Language Processing …, 2021
1002021
Pre-alignment guided attention for improving training efficiency and model stability in end-to-end speech synthesis
X Zhu, Y Zhang, S Yang, L Xue, L Xie
IEEE Access 7, 65955-65964, 2019
392019
On the localness modeling for the self-attention based end-to-end speech synthesis
S Yang, H Lu, S Kang, L Xue, J Xiao, D Su, L Xie, D Yu
Neural networks 125, 121-130, 2020
382020
Building a mixed-lingual neural TTS system with only monolingual data
L Xue, W Song, G Xu, L Xie, Z Wu
arXiv preprint arXiv:1904.06063, 2019
372019
Chatmusician: Understanding and generating music intrinsically with llm
R Yuan, H Lin, Y Wang, Z Tian, S Wu, T Shen, G Zhang, Y Wu, C Liu, ...
arXiv preprint arXiv:2402.16153, 2024
362024
Amphion: an Open-Source Audio, Music, and Speech Generation Toolkit
X Zhang, L Xue, Y Gu, Y Wang, J Li, H He, C Wang, S Liu, X Chen, ...
2024 IEEE Spoken Language Technology Workshop (SLT), 879-884, 2024
272024
Paratts: Learning linguistic and prosodic cross-sentence information in paragraph-based tts
L Xue, FK Soong, S Zhang, L Xie
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 2854-2864, 2022
262022
Cycle consistent network for end-to-end style transfer TTS training
L Xue, S Pan, L He, L Xie, FK Soong
Neural Networks 140, 223-236, 2021
242021
Building a controllable expressive speech synthesis system with multiple emotion strengths
X Zhu, L Xue
Cognitive Systems Research 59, 151-159, 2020
222020
Single-codec: Single-codebook speech codec towards high-performance speech generation
H Li, L Xue, H Guo, X Zhu, Y Lv, L Xie, Y Chen, H Yin, Z Li
arXiv preprint arXiv:2406.07422, 2024
212024
Expressive-vc: Highly expressive voice conversion with attention fusion of bottleneck and perturbation features
Z Ning, Q Xie, P Zhu, Z Wang, L Xue, J Yao, L Xie, M Bi
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
192023
Wenetspeech4tts: A 12,800-hour mandarin tts corpus for large speech generation model benchmark
L Ma, D Guo, K Song, Y Jiang, S Wang, L Xue, W Xu, H Zhao, B Zhang, ...
arXiv preprint arXiv:2406.05763, 2024
182024
Multi-scale sub-band constant-q transform discriminator for high-fidelity vocoder
Y Gu, X Zhang, L Xue, Z Wu
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
132024
Leveraging content-based features from multiple acoustic models for singing voice conversion
X Zhang, Y Gu, H Chen, Z Fang, L Zou, L Xue, Z Wu
arXiv preprint arXiv:2310.11160, 2023
9*2023
A comparison of expressive speech synthesis approaches based on neural network
L Xue, X Zhu, X An, L Xie
Proceedings of the Joint Workshop of the 4th Workshop on Affective Social …, 2018
82018
Spontts: modeling and transferring spontaneous style for tts
H Li, X Zhu, L Xue, Y Song, Y Chen, L Xie
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
62024
SingVisio: Visual analytics of diffusion model for singing voice conversion
L Xue, C Wang, M Wang, X Zhang, J Han, Z Wu
Computers & Graphics 124, 104058, 2024
42024
An Investigation of Time-Frequency Representation Discriminators for High-Fidelity Vocoders
Y Gu, X Zhang, L Xue, H Li, Z Wu
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
32024
Multi-level temporal-channel speaker retrieval for zero-shot voice conversion
Z Wang, L Xue, Q Kong, L Xie, Y Chen, Q Tian, Y Wang
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
32024
Transfer the linguistic representations from tts to accent conversion with non-parallel data
X Chen, J Pei, L Xue, M Zhang
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
32024
يتعذر على النظام إجراء العملية في الوقت الحالي. عاود المحاولة لاحقًا.
مقالات 1–20