Ziyang Ma

Citeras av

	Alla	Sedan 2020
Citat	735	735
h-index	16	16
i10-index	22	22

560

280

140

420

20222023202420257 33 550 144

Offentlig åtkomst

Visa alla

4 artiklar

0 artiklar

tillgänglig

inte tillgänglig

Enligt krav från finansiärer

Medförfattare

Xie ChenShanghai Jiao Tong UniversityVerifierad e-postadress på sjtu.edu.cn
Zhisheng ZhengThe University of Texas at AustinVerifierad e-postadress på cs.utexas.edu
ShiLiang ZhangSpeechLab，AlibabaVerifierad e-postadress på mail.ustc.edu.cn
gao zhifuTongyi Lab, Alibaba GroupVerifierad e-postadress på alibaba-inc.com
Kai Yu（俞凯）Shanghai Jiao Tong UniversityVerifierad e-postadress på sjtu.edu.cn
Zhihao DuAlibabaVerifierad e-postadress på alibaba-inc.com
Guanrou YangShanghai Jiao Tong UniversityVerifierad e-postadress på sjtu.edu.cn
Yifan YangShanghai Jiao Tong UniversityVerifierad e-postadress på sjtu.edu.cn
Wenxi ChenShanghai Jiao Tong UniversityVerifierad e-postadress på sjtu.edu.cn
Xiquan LiShanghai Jiao Tong UniversityVerifierad e-postadress på sjtu.edu.cn
Yuzhe LiangShanghai Jiao Tong UniversityVerifierad e-postadress på sjtu.edu.cn
Zhikang NiuShanghai Jiao Tong UniversityVerifierad e-postadress på sjtu.edu.cn
Zhuo ChenBytedance (formerly Microsoft, Columbia University)Verifierad e-postadress på columbia.edu
Changli TangTsinghua UniversityVerifierad e-postadress på mails.tsinghua.edu.cn
Xuenan XuShanghai Jiao Tong UniversityVerifierad e-postadress på sjtu.edu.cn
Yiwei GuoShanghai Jiao Tong UniversityVerifierad e-postadress på sjtu.edu.cn
Shujie Liu (刘树杰）Microsoft Research AsiaVerifierad e-postadress på microsoft.com
Zheng LianAssociate Professor, Institute of Automation, Chinese Academy of Sciences (CASIA)Verifierad e-postadress på ia.ac.cn
Ruibin YuanHKUSTVerifierad e-postadress på andrew.cmu.edu
Ge ZhangM-A-P, Bytedance, University of WaterlooVerifierad e-postadress på bytedance.com

Följ

Ziyang Ma

Shanghai Jiao Tong University

Verifierad e-postadress på sjtu.edu.cn - Startsida

Speech and Language Processing Textless NLP Self-supervised Learning Multimedia


Titel Sortera efter citat Sortera efter år Sortera efter titel	Citeras av Citeras av	År
emotion2vec: Self-supervised pre-training for speech emotion representation Z Ma, Z Zheng, J Ye, J Li, Z Gao, S Zhang, X Chen Proc. ACL 2024, 2024	80	2024
Lauragpt: Listen, attend, understand, and regenerate audio with gpt Z Du, J Wang, Q Chen, Y Chu, Z Gao, Z Li, K Hu, X Zhou, J Xu, Z Ma, ... arXiv preprint arXiv:2310.04673, 2023	68*	2023
Cosyvoice: A scalable multilingual zero-shot text-to-speech synthesizer based on supervised semantic tokens Z Du, Q Chen, S Zhang, K Hu, H Lu, Y Yang, H Hu, S Zheng, Y Gu, Z Ma, ... arXiv preprint arXiv:2407.05407, 2024	62	2024
An Embarrassingly Simple Approach for LLM with Strong ASR Capacity Z Ma, G Yang, Y Yang, Z Gao, J Wang, Z Du, F Yu, Q Chen, S Zheng, ... arXiv preprint arXiv:2402.08846, 2024	41	2024
Chatmusician: Understanding and generating music intrinsically with llm R Yuan, H Lin, Y Wang, Z Tian, S Wu, T Shen, G Zhang, Y Wu, C Liu, ... Proc. ACL 2024, 2024	35	2024
Ella-v: Stable neural codec language modeling with alignment-guided sequence reordering Y Song, Z Chen, X Wang, Z Ma, X Chen Proc. AAAI 2025, 2024	31	2024
Voiceflow: Efficient text-to-speech with rectified flow matching Y Guo, C Du, Z Ma, X Chen, K Yu ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	28*	2024
Towards universal speech discrete tokens: A case study for asr and tts Y Yang, F Shen, C Du, Z Ma, K Yu, D Povey, X Chen ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	28*	2024
Map-neo: Highly capable and transparent bilingual large language model series G Zhang, S Qu, J Liu, C Zhang, C Lin, CL Yu, D Pan, E Cheng, J Liu, ... arXiv preprint arXiv:2405.19327, 2024	27	2024
Funaudiollm: Voice understanding and generation foundation models for natural interaction between humans and llms K An, Q Chen, C Deng, Z Du, C Gao, Z Gao, Y Gu, T He, H Hu, K Hu, S Ji, ... arXiv preprint arXiv:2407.04051, 2024	25	2024
MT4SSL: Boosting self-supervised speech representation learning by integrating multiple targets Z Ma, Z Zheng, C Tang, Y Wang, X Chen Proc. Interspeech 2023 Best Student Paper Shortlist, 2023	22	2023
Mer 2024: Semi-supervised learning, noise robustness, and open-vocabulary multimodal emotion recognition Z Lian, H Sun, L Sun, Z Wen, S Zhang, S Chen, H Gu, J Zhao, Z Ma, ... Proceedings of the 2nd International Workshop on Multimodal and Responsible …, 2024	21	2024
F5-tts: A fairytaler that fakes fluent and faithful speech with flow matching Y Chen, Z Niu, Z Ma, K Deng, C Wang, J Zhao, K Yu, X Chen arXiv preprint arXiv:2410.06885, 2024	20	2024
EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark Z Ma, M Chen, H Zhang, Z Zheng, W Chen, X Li, J Ye, X Chen, T Hain Proc. Interspeech 2024, 2024	17	2024
Leveraging speech PTM, text LLM, and emotional TTS for speech emotion recognition Z Ma, W Wu, Z Zheng, Y Guo, Q Chen, S Zhang, X Chen ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	17	2024
EAT: Self-supervised pre-training with efficient audio transformer W Chen, Y Liang, Z Ma, Z Zheng, X Chen Proc. IJCAI 2024, 2024	16	2024
Language Model Can Listen While Speaking Z Ma, Y Song, C Du, J Cong, Z Chen, Y Wang, Y Wang, X Chen Proc. AAAI 2025, 2024	14*	2024
LoRA-Whisper: Parameter-Efficient and Extensible Multilingual ASR Z Song, J Zhuo, Y Yang, Z Ma, S Zhang, X Chen Proc. Interspeech 2024, 2024	13	2024
Chinese tiny llm: Pretraining a chinese-centric large language model X Du, Z Yu, S Gao, D Pan, Y Cheng, Z Ma, R Yuan, X Qu, J Liu, T Zheng, ... Proc. 1st COLM, 2024	13	2024
Foundation models for music: A survey Y Ma, A Øland, A Ragni, BMS Del Sette, C Saitis, C Donahue, C Lin, ... arXiv preprint arXiv:2408.14340, 2024	12	2024

Systemet kan inte utföra åtgärden just nu. Försök igen senare.

Artiklar 1–20

Citat per år

Dubblettcitat

Sammanfogade citat

Lägg till medförfattareMedförfattare

Följ

Citeras av

Medförfattare