Rongzhi Gu

Посилання

	Усі	З 2020
Цитування	815	800
h-індекс	13	13
i10-індекс	16	16

260

130

195

201820192020202120222023202420255 10 71 147 129 170 250 33

Доступні для всіх

Переглянути всі

4 статті

1 стаття

доступні

недоступні

За умовами фінансування

Співавтори

Shi-Xiong (Austin) ZhangSr. Director | AI Foundations@Capital One | ex-Microsoft, ex-Tencent, Cambridge PhDПідтверджена електронна адреса в capitalone.com
Yi LuoПідтверджена електронна адреса в columbia.edu

Підписатись

Rongzhi Gu

Tencent AI Lab

Підтверджена електронна адреса в pku.edu.cn

Speech separation


Назва Сортувати за цитуваннями Сортувати за роком Сортувати за назвою	Посилання Посилання	Рік
Multi-modal multi-channel target speech separation R Gu, SX Zhang, Y Xu, L Chen, Y Zou, D Yu IEEE Journal of Selected Topics in Signal Processing 14 (3), 530-541, 2020	117	2020
Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information. R Gu, L Chen, SX Zhang, J Zheng, Y Xu, M Yu, D Su, Y Zou, D Yu Interspeech, 4290-4294, 2019	104	2019
A comprehensive study of speech separation: spectrogram vs waveform separation F Bahmaninezhad, J Wu, R Gu, SX Zhang, Y Xu, M Yu, D Yu arXiv preprint arXiv:1905.07497, 2019	98	2019
End-to-end multi-channel speech separation R Gu, J Wu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu arXiv preprint arXiv:1905.06286, 2019	95	2019
Enhancing end-to-end multi-channel speech separation via spatial feature learning R Gu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	69	2020
Complex neural spatial filter: Enhancing multi-channel target speech separation in complex domain R Gu, SX Zhang, Y Zou, D Yu IEEE Signal Processing Letters 28, 1370-1374, 2021	42	2021
Towards unified all-neural beamforming for time and frequency domain speech separation R Gu, SX Zhang, Y Zou, D Yu IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 849-862, 2022	29	2022
High fidelity speech enhancement with band-split rnn J Yu, Y Luo, H Chen, R Gu, C Weng arXiv preprint arXiv:2212.00406, 2022	29	2022
Parameter-efficient transfer learning of pre-trained transformer models for speaker verification using adapters J Peng, T Stafylakis, R Gu, O Plchot, L Mošner, L Burget, J Černocký ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	27	2023
Target confusion in end-to-end speaker extraction: Analysis and approaches Z Zhao, D Yang, R Gu, H Zhang, Y Zou arXiv preprint arXiv:2204.01355, 2022	23	2022
Audio-visual multi-channel recognition of overlapped speech J Yu, B Wu, R Gu, SX Zhang, L Chen, YXM Yu, D Su, D Yu, X Liu, H Meng arXiv preprint arXiv:2005.08571, 2020	21	2020
Temporal-spatial neural filter: Direction informed end-to-end multi-channel target speech separation R Gu, Y Zou arXiv preprint arXiv:2001.00391, 2020	21	2020
Rezero: Region-customizable sound extraction R Gu, Y Luo IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024	13	2024
Tspeech-ai system description to the 5th deep noise suppression (dns) challenge J Yu, H Chen, Y Luo, R Gu, W Li, C Weng ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	13	2023
3d spatial features for multi-channel target speech separation R Gu, SX Zhang, M Yu, D Yu 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021	13	2021
Improving dual-microphone speech enhancement by learning cross-channel features with multi-head attention X Xu, R Gu, Y Zou ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	11	2022
The Sound Demixing Challenge 2023$\unicode {x2013} $ Cinematic Demixing Track S Uhlich, G Fabbro, M Hirano, S Takahashi, G Wichern, JL Roux, ... arXiv preprint arXiv:2308.06981, 2023	9	2023
Learning a robust DOA estimation model with acoustic vector sensor cues Y Zou, R Gu, D Wang, A Jiang, CH Ritz 2017 Asia-Pacific Signal and Information Processing Association Annual …, 2017	9	2017
Ultra dual-path compression for joint echo cancellation and noise suppression H Chen, J Yu, Y Luo, R Gu, W Li, Z Lu, C Weng arXiv preprint arXiv:2308.11053, 2023	8	2023
ICSpk: Interpretable Complex Speaker Embedding Extractor from Raw Waveform. J Peng, X Qu, J Wang, R Gu, J Xiao, L Burget, J Cernocký Interspeech, 511-515, 2021	8	2021

У даний момент система не може виконати операцію. Спробуйте пізніше.

Статті 1–20

Кількість бібліографічних посилань на рік

Повторювані посилання

Об’єднані посилання

Додати співавторівСпівавтори

Підписатись

Посилання

Співавтори