Jian Wu (巫健)

Cited by

	All	Since 2020
Citations	4085	4072
h-index	18	18
i10-index	27	27

1800

900

450

1350

201920202021202220232024202511 71 284 663 1116 1779 150

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Zhuo ChenBytedance (formerly Microsoft, Columbia University)Verified email at columbia.edu
Jinyu LiPartner Applied Science Manager, MicrosoftVerified email at microsoft.com
Takuya YoshiokaAssemblyAIVerified email at assemblyai.com
Naoyuki KandaMetaVerified email at meta.com
Lei XieNorthwestern Polytechnical UniversityVerified email at nwpu.edu.cn
Shujie Liu (刘树杰）Microsoft Research AsiaVerified email at microsoft.com
Xiong XiaoPrincipal Applied scientist, MicrosoftVerified email at microsoft.com
Yihui FuTechnische Universität BraunschweigVerified email at tu-braunschweig.de
Shi-Xiong (Austin) ZhangSr. Director | AI Foundations@Capital One | ex-Microsoft, ex-Tencent, Cambridge PhDVerified email at capitalone.com
Xiaofei WangMicrosoftVerified email at jhu.edu
Yashesh GaurMeta, GenAI, Llama foundation modelsVerified email at cs.cmu.edu
Meng YUTencent AI LabVerified email at tencent.com
Yong XuAI Research Scientist, Meta, USAVerified email at meta.com
Yi LuoVerified email at columbia.edu
Rongzhi GuTencent AI LabVerified email at pku.edu.cn
Sanyuan Chen (陈三元)Meta FAIRVerified email at meta.com
Dong Yu (俞栋)Distinguished Scientist @ Tencent AI Lab, ACM/IEEE/ISCA FellowVerified email at global.tencent.com
Fahimeh BahmaninezhadMicrosoftVerified email at microsoft.com
Yu Wu (吴俣)DeepSeek AIVerified email at deepseek.com

Jian Wu (巫健)

Research Scientist, ByteDance

Verified email at bytedance.com

Speech Recognition Speech Enhancement Speech Separation Speech Generation


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Wavlm: Large-scale self-supervised pre-training for full stack speech processing S Chen, C Wang, Z Chen, Y Wu, S Liu, Z Chen, J Li, N Kanda, T Yoshioka, ... IEEE Journal of Selected Topics in Signal Processing 16 (6), 1505-1518, 2022	1834	2022
DCCRN: Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement Y Hu, Y Liu, S Lv, M Xing, S Zhang, Y Fu, J Wu, B Zhang, L Xie Proc. Interspeech 2020, 2472--2476, 2020	737	2020
Continuous Speech Separation: Dataset and Analysis Z Chen, T Yoshioka, L Lu, T Zhou, Z Meng, Y Luo, J Wu, X Xiao, J Li ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	236	2020
Continuous speech separation with conformer S Chen, Y Wu, Z Chen, J Wu, J Li, T Yoshioka, C Wang, S Liu, M Zhou ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	151	2021
Time Domain Audio Visual Speech Separation J Wu, Y Xu, SX Zhang, LW Chen, M Yu, L Xie, D Yu 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019	143	2019
Audio-visual Recognition of Overlapped Speech for the LRS2 dataset J Yu, SX Zhang, J Wu, S Ghorbani, B Wu, S Kang, S Liu, X Liu, H Meng, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	107	2020
On decoder-only architecture for speech-to-text and large language model integration J Wu, Y Gaur, Z Chen, L Zhou, Y Zhu, T Wang, J Li, S Liu, B Ren, L Liu, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023	104	2023
Unispeech-sat: Universal speech representation learning with speaker aware pre-training S Chen, Y Wu, C Wang, Z Chen, Z Chen, S Liu, J Wu, Y Qian, F Wei, J Li, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	99	2022
A Comprehensive Study of Speech Separation: Spectrogram vs Waveform Separation F Bahmaninezhad, J Wu, R Gu, SX Zhang, Y Xu, M Yu, D Yu Interspeech 2019, 4574--4578, 2019	98	2019
AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario Y Fu, L Cheng, S Lv, Y Jv, Y Kong, Z Chen, Y Hu, L Xie, J Wu, H Bu, X Xu, ... Proc. Interspeech 2021, 3665--3669, 2021	90	2021
End-to-end multi-channel speech separation R Gu, J Wu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu arXiv preprint arXiv:1905.06286, 2019	88	2019
Streaming multi-talker ASR with token-level serialized output training N Kanda, J Wu, Y Wu, X Xiao, Z Meng, X Wang, Y Gaur, Z Chen, J Li, ... Proc. Interspeech 2022, 521--525, 2022	63	2022
Seed-TTS: A Family of High-Quality Versatile Speech Generation Models P Anastassiou, J Chen, J Chen, Y Chen, Z Chen, Z Chen, J Cong, L Deng, ... arXiv preprint arXiv:2406.02430, 2024	52	2024
Channel-Wise Subband Input for Better Voice and Accompaniment Separation on High Resolution Music H Liu, L Xie, J Wu, G Yang Proc. Interspeech 2020, 1241--1245, 2020	33	2020
Streaming speaker-attributed ASR with token-level speaker embeddings N Kanda, J Wu, Y Wu, X Xiao, Z Meng, X Wang, Y Gaur, Z Chen, J Li, ... Proc. Interspeech 2022, 521--525, 2022	30	2022
Desnet: A multi-channel network for simultaneous speech dereverberation, enhancement and separation Y Fu, J Wu, Y Hu, M Xing, L Xie 2021 IEEE Spoken Language Technology Workshop (SLT), 857-864, 2021	30	2021
An End-to-end Architecture of Online Multi-channel Speech Separation J Wu, Z Chen, J Li, T Yoshioka, Z Tan, E Lin, Y Luo, L Xie Proc. Interspeech 2020, 3066--3070, 2020	25	2020
Cosmic: Data efficient instruction-tuning for speech in-context learning J Pan, J Wu, Y Gaur, S Sivasankaran, Z Chen, S Liu, J Li arXiv preprint arXiv:2311.02248, 2023	22	2023
Investigation of Practical Aspects of Single Channel Speech Separation for ASR J Wu, Z Chen, S Chen, Y Wu, T Yoshioka, N Kanda, S Liu, J Li Proc. Interspeech 2021, 3066--3070, 2021	18	2021
NPU Speaker Verification System for INTERSPEECH 2020 Far-Field Speaker Verification Challenge L Zhang, J Wu, L Xie Proc. Interspeech 2020, 3471--3475, 2020	16	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors