Guangzhi Sun

نقل شده توسط

	همهٔ موارد	از 2020
نقل‌‏قول‌‏ها	910	907
شاخص h	13	13
شاخص i10	16	16

440

220

110

330

20202021202220232024202534 106 99 143 432 88

دسترسی عمومی

مشاهدهٔ همه

۴ مقاله

۰ مقاله

در دسترس

در دسترس نیست

براساس دستورات هزینه انتشار

دنبال کردن

Guangzhi Sun

University of Cambridge

ایمیل تأیید شده در cam.ac.uk - صفحهٔ اصلی

Speech and language technology conversational AI


عنوان به‌ترتیب نقل قول‌ها به‌ترتیب سال به‌ترتیب عنوان	نقل شده توسط نقل شده توسط	سال
Salmonn: Towards generic hearing abilities for large language models‏ C Tang, W Yu, G Sun, X Chen, T Tan, W Li, L Lu, Z Ma, C Zhang‏ arXiv preprint arXiv:2310.13289, 2023‏	236	2023
Fully-hierarchical fine-grained prosody modeling for interpretable speech synthesis‏ G Sun, Y Zhang, RJ Weiss, Y Cao, H Zen, Y Wu‏ ICASSP 2020-2020 IEEE international conference on acoustics, speech and …, 2020‏	154	2020
Generating diverse and natural text-to-speech samples using a quantized fine-grained vae and autoregressive prosody prior‏ G Sun, Y Zhang, RJ Weiss, Y Cao, H Zen, A Rosenberg, B Ramabhadran, ...‏ ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020‏	121*	2020
Connecting speech encoder and large language model for asr‏ W Yu, C Tang, G Sun, X Chen, T Tan, W Li, L Lu, Z Ma, C Zhang‏ ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024‏	49	2024
Transformer language models with LSTM-based cross-utterance information representation‏ G Sun, C Zhang, PC Woodland‏ ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021‏	44	2021
Speaker diarisation using 2D self-attentive combination of embeddings‏ G Sun, C Zhang, PC Woodland‏ ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019‏	39	2019
Tree-constrained pointer generator for end-to-end contextual speech recognition‏ G Sun, C Zhang, PC Woodland‏ 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021‏	28	2021
video-salmonn: Speech-enhanced audio-visual large language models‏ G Sun, W Yu, C Tang, X Chen, T Tan, W Li, L Lu, Z Ma, Y Wang, C Zhang‏ arXiv preprint arXiv:2406.15704, 2024‏	25*	2024
Large language models surpass human experts in predicting neuroscience results‏ X Luo, A Rechardt, G Sun, KK Nejad, F Yáñez, B Yilmaz, K Lee, ...‏ Nature human behaviour, 1-11, 2024‏	22	2024
Combination of deep speaker embeddings for diarisation‏ G Sun, C Zhang, PC Woodland‏ Neural Networks 141, 372-384, 2021‏	22	2021
Can contextual biasing remain effective with Whisper and GPT-2?‏ G Sun, X Zheng, C Zhang, PC Woodland‏ arXiv preprint arXiv:2306.01942, 2023‏	18	2023
TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch‏ J Hwang, M Hira, C Chen, X Zhang, Z Ni, G Sun, P Ma, R Huang, V Pratap, ...‏ 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-9, 2023‏	17	2023
Minimising biasing word errors for contextual ASR with the tree-constrained pointer generator‏ G Sun, C Zhang, PC Woodland‏ IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 345-354, 2022‏	16	2022
Building better ai agents: A provocation on the utilisation of persona in llm-based conversational agents‏ G Sun, X Zhan, J Such‏ Proceedings of the 6th ACM Conference on Conversational User Interfaces, 1-6, 2024‏	12	2024
Affect recognition in conversations using large language models‏ S Feng, G Sun, N Lubis, W Wu, C Zhang, M Gašić‏ arXiv preprint arXiv:2309.12881, 2023‏	12	2023
Tree-constrained pointer generator with graph neural network encodings for contextual speech recognition‏ G Sun, C Zhang, PC Woodland‏ arXiv preprint arXiv:2207.00857, 2022‏	11	2022
Extending large language models for speech and audio captioning‏ C Tang, W Yu, G Sun, X Chen, T Tan, W Li, L Lu, Z Ma, C Zhang‏ ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024‏	9	2024
End-to-end spoken language understanding with tree-constrained pointer generator‏ G Sun, C Zhang, PC Woodland‏ ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023‏	9	2023
Knowledge-aware audio-grounded generative slot filling for limited annotated data‏ G Sun, C Zhang, I Vulić, P Budzianowski, PC Woodland‏ Computer Speech & Language 89, 101707, 2025‏	7	2025
Cross-utterance conditioned VAE for non-autoregressive text-to-speech‏ Y Li, C Yu, G Sun, H Jiang, F Sun, W Zu, Y Wen, Y Yang, J Wang‏ arXiv preprint arXiv:2205.04120, 2022‏	7	2022

سیستم در حال حاضر قادر به انجام عملکرد نیست. بعداً دوباره امتحان کنید.

مقاله‌ها 1–20

نقل‌قول‌ها در سال

نقل‌قول تکراری

نقل‌قول‌های ادغام شده

افزودن نویسنده‌های همکارنویسندگان مشترک

دنبال کردن

نقل شده توسط