Yanqing Liu

인용

	전체	2020년 이후
서지정보	3106	3048
h-index	19	19
i10-index	23	23

1500

750

375

1125

201920202021202220232024202547 121 261 336 708 1446 151

공개 액세스

모두 보기

자료 4개

자료 0개

공개

비공개

재정 지원 요구사항 기준

팔로우

Yanqing Liu

Microsoft Corporation

microsoft.com의 이메일 확인됨

Text-to-Speech Speech Recognition Speech Edition Overdubbing NLP


제목 서지정보순 정렬 연도순 정렬 제목순 정렬	인용 인용	연도
Neural speech synthesis with transformer network N Li, S Liu, Y Liu, S Zhao, M Liu Proceedings of the AAAI conference on artificial intelligence 33 (01), 6706-6713, 2019	917	2019
Neural codec language models are zero-shot text to speech synthesizers C Wang, S Chen, Y Wu, Z Zhang, L Zhou, S Liu, Z Chen, Y Liu, H Wang, ... arXiv preprint arXiv:2301.02111, 2023	623	2023
Naturalspeech: End-to-end text-to-speech synthesis with human-level quality X Tan, J Chen, H Liu, J Cong, C Zhang, Y Liu, X Wang, Y Leng, Y Yi, L He, ... IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024	221	2024
Naturalspeech 2: Latent diffusion models are natural and zero-shot speech and singing synthesizers K Shen, Z Ju, X Tan, Y Liu, Y Leng, L He, T Qin, S Zhao, J Bian arXiv preprint arXiv:2304.09116, 2023	212	2023
Adaspeech: Adaptive text to speech for custom voice M Chen, X Tan, B Li, Y Liu, T Qin, S Zhao, TY Liu arXiv preprint arXiv:2103.00993, 2021	197	2021
Speak foreign languages with your own voice: Cross-lingual neural codec language modeling Z Zhang, L Zhou, C Wang, S Chen, Y Wu, S Liu, Z Chen, Y Liu, H Wang, ... arXiv preprint arXiv:2303.03926, 2023	157	2023
Naturalspeech 3: Zero-shot speech synthesis with factorized codec and diffusion models Z Ju, Y Wang, K Shen, X Tan, D Xin, D Yang, Y Liu, Y Leng, K Song, ... arXiv preprint arXiv:2403.03100, 2024	126	2024
Close to human quality TTS with transformer N Li, S Liu, Y Liu, S Zhao, M Liu, M Zhou arXiv preprint arXiv:1809.08895 2, 2018	121	2018
Developing RNN-T models surpassing high-performance hybrid models with customization capability J Li, R Zhao, Z Meng, Y Liu, W Wei, S Parthasarathy, V Mazalov, Z Wang, ... arXiv preprint arXiv:2007.15188, 2020	119	2020
Delightfultts: The microsoft speech synthesis system for blizzard challenge 2021 Y Liu, Z Xu, G Wang, K Chen, B Li, X Tan, J Li, L He, S Zhao arXiv preprint arXiv:2110.12612, 2021	67	2021
VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers S Chen, S Liu, L Zhou, Y Liu, X Tan, J Li, S Zhao, Y Qian, F Wei arXiv e-prints, arXiv: 2406.05370, 2024	49	2024
Robutrans: A robust transformer-based text-to-speech model N Li, Y Liu, Y Wu, S Liu, S Zhao, M Liu Proceedings of the AAAI conference on artificial intelligence 34 (05), 8228-8235, 2020	47	2020
Prompttts 2: Describing and generating voices with text prompt Y Leng, Z Guo, K Shen, X Tan, Z Ju, Y Liu, Y Liu, D Yang, L Zhang, ... arXiv preprint arXiv:2309.02285, 2023	39	2023
Delightfultts 2: End-to-end speech synthesis with adversarial vector-quantized auto-encoders Y Liu, R Xue, L He, X Tan, S Zhao arXiv preprint arXiv:2207.04646, 2022	30	2022
Towards contextual spelling correction for customization of end-to-end speech recognition systems X Wang, Y Liu, J Li, V Miljanic, S Zhao, H Khalil IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 3089-3097, 2022	26	2022
Mixed-phoneme bert: Improving bert with mixed phoneme and sup-phoneme representations for text to speech G Zhang, K Song, X Tan, D Tan, Y Yan, Y Liu, G Wang, W Zhou, T Qin, ... arXiv preprint arXiv:2203.17190, 2022	25	2022
A light-weight contextual spelling correction model for customizing transducer-based speech recognition systems X Wang, Y Liu, S Zhao, J Li arXiv preprint arXiv:2108.07493, 2021	21	2021
Autoregressive Speech Synthesis without Vector Quantization L Meng, L Zhou, S Liu, S Chen, B Han, S Hu, Y Liu, J Li, S Zhao, X Wu, ... arXiv preprint arXiv:2407.08551, 2024	20	2024
E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS SE Eskimez, X Wang, M Thakker, C Li, CH Tsai, Z Xiao, H Yang, Z Zhu, ... arXiv preprint arXiv:2406.18009, 2024	19	2024
VALL-E R: Robust and Efficient Zero-Shot Text-to-Speech Synthesis via Monotonic Alignment B Han, L Zhou, S Liu, S Chen, L Meng, Y Qian, Y Liu, S Zhao, J Li, F Wei arXiv preprint arXiv:2406.07855, 2024	14	2024

현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.

학술자료 1–20

연간 인용횟수

중복된 서지정보

병합된 서지정보

공동 저자 추가공동 저자

팔로우

인용