Follow
Jian Wu (巫健)
Jian Wu (巫健)
Research Scientist, ByteDance
Verified email at bytedance.com
Title
Cited by
Cited by
Year
Wavlm: Large-scale self-supervised pre-training for full stack speech processing
S Chen, C Wang, Z Chen, Y Wu, S Liu, Z Chen, J Li, N Kanda, T Yoshioka, ...
IEEE Journal of Selected Topics in Signal Processing 16 (6), 1505-1518, 2022
18342022
DCCRN: Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement
Y Hu, Y Liu, S Lv, M Xing, S Zhang, Y Fu, J Wu, B Zhang, L Xie
Proc. Interspeech 2020, 2472--2476, 2020
7372020
Continuous Speech Separation: Dataset and Analysis
Z Chen, T Yoshioka, L Lu, T Zhou, Z Meng, Y Luo, J Wu, X Xiao, J Li
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
2362020
Continuous speech separation with conformer
S Chen, Y Wu, Z Chen, J Wu, J Li, T Yoshioka, C Wang, S Liu, M Zhou
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
1512021
Time Domain Audio Visual Speech Separation
J Wu, Y Xu, SX Zhang, LW Chen, M Yu, L Xie, D Yu
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
1432019
Audio-visual Recognition of Overlapped Speech for the LRS2 dataset
J Yu, SX Zhang, J Wu, S Ghorbani, B Wu, S Kang, S Liu, X Liu, H Meng, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1072020
On decoder-only architecture for speech-to-text and large language model integration
J Wu, Y Gaur, Z Chen, L Zhou, Y Zhu, T Wang, J Li, S Liu, B Ren, L Liu, ...
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
1042023
Unispeech-sat: Universal speech representation learning with speaker aware pre-training
S Chen, Y Wu, C Wang, Z Chen, Z Chen, S Liu, J Wu, Y Qian, F Wei, J Li, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
992022
A Comprehensive Study of Speech Separation: Spectrogram vs Waveform Separation
F Bahmaninezhad, J Wu, R Gu, SX Zhang, Y Xu, M Yu, D Yu
Interspeech 2019, 4574--4578, 2019
982019
AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario
Y Fu, L Cheng, S Lv, Y Jv, Y Kong, Z Chen, Y Hu, L Xie, J Wu, H Bu, X Xu, ...
Proc. Interspeech 2021, 3665--3669, 2021
902021
End-to-end multi-channel speech separation
R Gu, J Wu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu
arXiv preprint arXiv:1905.06286, 2019
882019
Streaming multi-talker ASR with token-level serialized output training
N Kanda, J Wu, Y Wu, X Xiao, Z Meng, X Wang, Y Gaur, Z Chen, J Li, ...
Proc. Interspeech 2022, 521--525, 2022
632022
Seed-TTS: A Family of High-Quality Versatile Speech Generation Models
P Anastassiou, J Chen, J Chen, Y Chen, Z Chen, Z Chen, J Cong, L Deng, ...
arXiv preprint arXiv:2406.02430, 2024
522024
Channel-Wise Subband Input for Better Voice and Accompaniment Separation on High Resolution Music
H Liu, L Xie, J Wu, G Yang
Proc. Interspeech 2020, 1241--1245, 2020
332020
Streaming speaker-attributed ASR with token-level speaker embeddings
N Kanda, J Wu, Y Wu, X Xiao, Z Meng, X Wang, Y Gaur, Z Chen, J Li, ...
Proc. Interspeech 2022, 521--525, 2022
302022
Desnet: A multi-channel network for simultaneous speech dereverberation, enhancement and separation
Y Fu, J Wu, Y Hu, M Xing, L Xie
2021 IEEE Spoken Language Technology Workshop (SLT), 857-864, 2021
302021
An End-to-end Architecture of Online Multi-channel Speech Separation
J Wu, Z Chen, J Li, T Yoshioka, Z Tan, E Lin, Y Luo, L Xie
Proc. Interspeech 2020, 3066--3070, 2020
252020
Cosmic: Data efficient instruction-tuning for speech in-context learning
J Pan, J Wu, Y Gaur, S Sivasankaran, Z Chen, S Liu, J Li
arXiv preprint arXiv:2311.02248, 2023
222023
Investigation of Practical Aspects of Single Channel Speech Separation for ASR
J Wu, Z Chen, S Chen, Y Wu, T Yoshioka, N Kanda, S Liu, J Li
Proc. Interspeech 2021, 3066--3070, 2021
182021
NPU Speaker Verification System for INTERSPEECH 2020 Far-Field Speaker Verification Challenge
L Zhang, J Wu, L Xie
Proc. Interspeech 2020, 3471--3475, 2020
162020
The system can't perform the operation now. Try again later.
Articles 1–20