Zhihao Du

Процитировано

	Все	Начиная с 2020 г.
Статистика цитирования	599	599
h-индекс	13	13
i10-индекс	15	15

380

190

285

2020202120222023202420255 11 41 105 362 75

Общий доступ

Просмотреть все

5 статей

3 статьи

доступно

недоступно

На основе финансирования

Zhihao Du

Alibaba

Подтвержден адрес электронной почты в домене alibaba-inc.com

Speech separation speech enchancement speaker diarization


Название По числу цитат По году По названию	Процитировано Процитировано	Год
M2MeT: The ICASSP 2022 multi-channel multi-party meeting transcription challenge F Yu, S Zhang, Y Fu, L Xie, S Zheng, Z Du, W Huang, P Guo, Z Yan, B Ma, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	102	2022
Lauragpt: Listen, attend, understand, and regenerate audio with gpt Z Du, J Wang, Q Chen, Y Chu, Z Gao, Z Li, K Hu, X Zhou, J Xu, Z Ma, ... arXiv preprint arXiv:2310.04673, 2023	68	2023
Cosyvoice: A scalable multilingual zero-shot text-to-speech synthesizer based on supervised semantic tokens Z Du, Q Chen, S Zhang, K Hu, H Lu, Y Yang, H Hu, S Zheng, Y Gu, Z Ma, ... arXiv preprint arXiv:2407.05407, 2024	63	2024
Funcodec: A fundamental, reproducible and integrable open-source toolkit for neural speech codec Z Du, S Zhang, K Hu, S Zheng ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	53	2024
Funasr: A fundamental end-to-end speech recognition toolkit Z Gao, Z Li, J Wang, H Luo, X Shi, M Chen, Y Li, L Zuo, Z Du, Z Xiao, ... arXiv preprint arXiv:2305.11013, 2023	53	2023
Summary on the ICASSP 2022 multi-channel multi-party meeting transcription grand challenge F Yu, S Zhang, P Guo, Y Fu, Z Du, S Zheng, W Huang, L Xie, ZH Tan, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	31	2022
An embarrassingly simple approach for LLM with strong ASR capacity Z Ma, G Yang, Y Yang, Z Gao, J Wang, Z Du, F Yu, Q Chen, S Zheng, ... arXiv preprint arXiv:2402.08846, 2024	26	2024
Funaudiollm: Voice understanding and generation foundation models for natural interaction between humans and llms K An, Q Chen, C Deng, Z Du, C Gao, Z Gao, Y Gu, T He, H Hu, K Hu, S Ji, ... arXiv preprint arXiv:2407.04051, 2024	25	2024
A joint framework of denoising autoencoder and generative vocoder for monaural speech enhancement Z Du, X Zhang, J Han IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 1493-1505, 2020	23	2020
Acoustic scene classification by implicitly identifying distinct sound events H Song, J Han, S Deng, Z Du arXiv preprint arXiv:1904.05204, 2019	18	2019
Speaker Overlap-aware Neural Diarization for Multi-party Meeting Analysis Z Du, S Zhang, S Zheng, Z Yan EMNLP 2022, 2022	15	2022
A comparative study on speaker-attributed automatic speech recognition in multi-party meetings F Yu, Z Du, S Zhang, Y Lin, L Xie arXiv preprint arXiv:2203.16834, 2022	14	2022
MFCCA: Multi-Frame Cross-Channel attention for multi-speaker ASR in Multi-party meeting scenario F Yu, S Zhang, P Guo, Y Liang, Z Du, Y Lin, L Xie 2022 IEEE Spoken Language Technology Workshop (SLT), 144-151, 2023	13	2023
Self-Supervised Adversarial Multi-Task Learning for Vocoder-Based Monaural Speech Enhancement. Z Du, M Lei, J Han, S Zhang Interspeech, 3271-3275, 2020	13	2020
An efficient joint training framework for robust small-footprint keyword spotting Y Gu, Z Du, H Zhang, X Zhang International Conference on Neural Information Processing, 12-23, 2020	12*	2020
Casa-asr: Context-aware speaker-attributed asr M Shi, Z Du, Q Chen, F Yu, Y Li, S Zhang, J Zhang, LR Dai arXiv preprint arXiv:2305.12459, 2023	7	2023
Cosyvoice 2: Scalable streaming speech synthesis with large language models Z Du, Y Wang, Q Chen, X Shi, X Lv, T Zhao, Z Gao, Y Yang, C Gao, ... arXiv preprint arXiv:2412.10117, 2024	6	2024
Omniflatten: An end-to-end gpt model for seamless voice conversation Q Zhang, L Cheng, C Deng, Q Chen, W Wang, S Zheng, J Liu, H Yu, ... arXiv preprint arXiv:2410.17799, 2024	6	2024
Told: A novel two-stage overlap-aware framework for speaker diarization J Wang, Z Du, S Zhang ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	6	2023
Pan: Phoneme-aware network for monaural speech enhancement Z Du, M Lei, J Han, S Zhang ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	6	2020

В данный момент система не может выполнить эту операцию. Повторите попытку позднее.

Статьи 1–20

Ссылок за год

Повторяющиеся цитирования

Объединенные цитирования

СоавторыСоавторы

Подписаться

Процитировано