关注
Rongzhi Gu
Rongzhi Gu
Tencent AI Lab
在 pku.edu.cn 的电子邮件经过验证
标题
引用次数
引用次数
年份
Multi-modal multi-channel target speech separation
R Gu, SX Zhang, Y Xu, L Chen, Y Zou, D Yu
IEEE Journal of Selected Topics in Signal Processing 14 (3), 530-541, 2020
1162020
Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information.
R Gu, L Chen, SX Zhang, J Zheng, Y Xu, M Yu, D Su, Y Zou, D Yu
Interspeech, 4290-4294, 2019
1052019
A comprehensive study of speech separation: spectrogram vs waveform separation
F Bahmaninezhad, J Wu, R Gu, SX Zhang, Y Xu, M Yu, D Yu
arXiv preprint arXiv:1905.07497, 2019
982019
End-to-end multi-channel speech separation
R Gu, J Wu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu
arXiv preprint arXiv:1905.06286, 2019
952019
Enhancing end-to-end multi-channel speech separation via spatial feature learning
R Gu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
682020
Complex neural spatial filter: Enhancing multi-channel target speech separation in complex domain
R Gu, SX Zhang, Y Zou, D Yu
IEEE Signal Processing Letters 28, 1370-1374, 2021
442021
High fidelity speech enhancement with band-split rnn
J Yu, Y Luo, H Chen, R Gu, C Weng
arXiv preprint arXiv:2212.00406, 2022
292022
Towards unified all-neural beamforming for time and frequency domain speech separation
R Gu, SX Zhang, Y Zou, D Yu
IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 849-862, 2022
282022
Parameter-efficient transfer learning of pre-trained transformer models for speaker verification using adapters
J Peng, T Stafylakis, R Gu, O Plchot, L Mošner, L Burget, J Černocký
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
262023
Target confusion in end-to-end speaker extraction: Analysis and approaches
Z Zhao, D Yang, R Gu, H Zhang, Y Zou
arXiv preprint arXiv:2204.01355, 2022
222022
Audio-visual multi-channel recognition of overlapped speech
J Yu, B Wu, R Gu, SX Zhang, L Chen, YXM Yu, D Su, D Yu, X Liu, H Meng
arXiv preprint arXiv:2005.08571, 2020
222020
Temporal-spatial neural filter: Direction informed end-to-end multi-channel target speech separation
R Gu, Y Zou
arXiv preprint arXiv:2001.00391, 2020
202020
3d spatial features for multi-channel target speech separation
R Gu, SX Zhang, M Yu, D Yu
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
142021
Rezero: Region-customizable sound extraction
R Gu, Y Luo
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
122024
Tspeech-ai system description to the 5th deep noise suppression (dns) challenge
J Yu, H Chen, Y Luo, R Gu, W Li, C Weng
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
122023
Improving dual-microphone speech enhancement by learning cross-channel features with multi-head attention
X Xu, R Gu, Y Zou
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
112022
The Sound Demixing Challenge 2023-Cinematic Demixing Track.
S Uhlich, G Fabbro, M Hirano, S Takahashi, G Wichern, J Le Roux, ...
Trans. Int. Soc. Music. Inf. Retr. 7 (1), 44-62, 2024
82024
Ultra dual-path compression for joint echo cancellation and noise suppression
H Chen, J Yu, Y Luo, R Gu, W Li, Z Lu, C Weng
arXiv preprint arXiv:2308.11053, 2023
82023
ICSpk: Interpretable Complex Speaker Embedding Extractor from Raw Waveform.
J Peng, X Qu, J Wang, R Gu, J Xiao, L Burget, J Cernocký
Interspeech, 511-515, 2021
82021
Learning a robust DOA estimation model with acoustic vector sensor cues
Y Zou, R Gu, D Wang, A Jiang, CH Ritz
2017 Asia-Pacific Signal and Information Processing Association Annual …, 2017
82017
系统目前无法执行此操作,请稍后再试。
文章 1–20