关注
Zhuo Chen
Zhuo Chen
Bytedance (formerly Microsoft, Columbia University)
在 columbia.edu 的电子邮件经过验证
标题
引用次数
引用次数
年份
Wavlm: Large-scale self-supervised pre-training for full stack speech processing
S Chen, C Wang, Z Chen, Y Wu, S Liu, Z Chen, J Li, N Kanda, T Yoshioka, ...
IEEE Journal of Selected Topics in Signal Processing 16 (6), 1505-1518, 2022
18202022
Deep clustering: Discriminative embeddings for segmentation and separation
JR Hershey, Z Chen, J Le Roux, S Watanabe
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
16272016
Dual-path rnn: efficient long sequence modeling for time-domain single-channel speech separation
Y Luo, Z Chen, T Yoshioka
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
8892020
Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
C Wang, S Chen, Y Wu, Z Zhang, L Zhou, S Liu, Z Chen, Y Liu, H Wang, ...
arXiv preprint arXiv:2301.02111, 2023
6202023
Deep attractor network for single-microphone speaker separation
Z Chen, Y Luo, N Mesgarani
arXiv preprint arXiv:1611.08930, 2016
5172016
Single-channel multi-speaker separation using deep clustering
Y Isik, JL Roux, Z Chen, S Watanabe, JR Hershey
arXiv preprint arXiv:1607.02173, 2016
5102016
Speaker-independent speech separation with deep attractor network
Y Luo, Z Chen, N Mesgarani
IEEE/ACM Transactions on Audio, Speech, and Language Processing 26 (4), 787-796, 2018
3012018
BEATs: Audio Pre-Training with Acoustic Tokenizers
S Chen, Y Wu, C Wang, S Liu, D Tompkins, Z Chen, F Wei
arXiv preprint arXiv:2212.09058, 2022
2782022
Continuous speech separation: Dataset and analysis
Z Chen, T Yoshioka, L Lu, T Zhou, Z Meng, Y Luo, J Wu, X Xiao, J Li
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
2362020
Deep clustering and conventional networks for music separation: Stronger together
Y Luo, Z Chen, JR Hershey, J Le Roux, N Mesgarani
2017 IEEE international conference on acoustics, speech and signal …, 2017
2142017
End-to-end attention based text-dependent speaker verification
SX Zhang, Z Chen, Y Zhao, J Li, Y Gong
2016 IEEE Spoken Language Technology Workshop (SLT), 171-178, 2016
2082016
End-to-end microphone permutation and number invariant multi-channel speech separation
Y Luo, Z Chen, N Mesgarani, T Yoshioka
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1972020
Integration of speech enhancement and recognition using long-short term memory recurrent neural network
Z Chen, S Watanabe, H Erdogan, J Hershey
Proc. Interspeech, 1-7, 2015
1882015
Multi-Channel Overlapped Speech Recognition with Location Guided Speech Extraction Network
Z Chen, X Xiao, T Yoshioka, H Erdogan, J Li, Y Gong
2018 IEEE Spoken Language Technology Workshop (SLT), 558-565, 2018
1612018
Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling
Z Zhang, L Zhou, C Wang, S Chen, Y Wu, S Liu, Z Chen, Y Liu, H Wang, ...
arXiv preprint arXiv:2303.03926, 2023
1572023
Continuous speech separation with conformer
S Chen, Y Wu, Z Chen, J Wu, J Li, T Yoshioka, C Wang, S Liu, M Zhou
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
1512021
Speaker-invariant training via adversarial learning
Z Meng, J Li, Z Chen, Y Zhao, V Mazalov, Y Gong, BH Juang
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
1492018
Neural decoding of attentional selection in multi-speaker environments without access to clean sources
J O’Sullivan, Z Chen, J Herrero, GM McKhann, SA Sheth, AD Mehta, ...
Journal of Neural Engineering 14 (5), 056001, 2017
1392017
Multi-Microphone Neural Speech Separation for Far-Field Multi-Talker Speech Recognition
T Yoshioka, H Erdogan, Z Chen, F Alleva
ICASSP 2018, 2018
1372018
On decoder-only architecture for speech-to-text and large language model integration
J Wu, Y Gaur, Z Chen, L Zhou, Y Zhu, T Wang, J Li, S Liu, B Ren, L Liu, ...
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
1032023
系统目前无法执行此操作,请稍后再试。
文章 1–20