Online direction of arrival estimation based on deep learning Q Li, X Zhang, H Li 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 79 | 2018 |
Speakerfilter: Deep learning-based target speaker extraction using anchor speech S He, H Li, X Zhang ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 34 | 2020 |
A robust text-independent speaker verification method based on speech separation and deep speaker F Zhao, H Li, X Zhang ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 33 | 2019 |
Using optimal ratio mask as training target for supervised speech separation S Xia, H Li, X Zhang 2017 Asia-Pacific Signal and Information Processing Association Annual …, 2017 | 32 | 2017 |
DBNet: A dual-branch network architecture processing on spectrum and waveform for single-channel speech enhancement K Zhang, S He, H Li, X Zhang arXiv preprint arXiv:2105.02436, 2021 | 18 | 2021 |
Exploiting spectro-temporal structures using NMF for DNN-based supervised speech separation S Nie, S Liang, H Li, XL Zhang, ZL Yang, WJ Liu, LK Dong 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 16 | 2016 |
Jointly Optimizing Activation Coefficients of Convolutive NMF Using DNN for Speech Separation. H Li, S Nie, X Zhang, H Zhang Interspeech, 550-554, 2016 | 14 | 2016 |
Frame-Level Signal-to-Noise Ratio Estimation Using Deep Learning. H Li, DL Wang, X Zhang, G Gao Interspeech, 4626-4630, 2020 | 13 | 2020 |
Recurrent neural networks and acoustic features for frame-level signal-to-noise ratio estimation H Li, DL Wang, X Zhang, G Gao IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 2878-2887, 2021 | 9 | 2021 |
Neural multi-channel and multi-microphone acoustic echo cancellation C Zhang, J Liu, H Li, X Zhang IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 2181-2192, 2023 | 8 | 2023 |
Model Compression by Iterative Pruning with Knowledge Distillation and Its Application to Speech Enhancement. Z Wei, L Hao, X Zhang INTERSPEECH, 941-945, 2022 | 7 | 2022 |
Integrated speech enhancement method based on weighted prediction error and DNN for dereverberation and denoising H Li, X Zhang, H Zhang, G Gao arXiv preprint arXiv:1708.08251, 2017 | 6 | 2017 |
Speakerfilter-pro: an improved target speaker extractor combines the time domain and frequency domain S He, H Li, X Zhang 2022 13th International Symposium on Chinese Spoken Language Processing …, 2022 | 4 | 2022 |
Robust speech dereverberation based on wpe and deep learning H Li, X Zhang, G Gao 2020 Asia-Pacific Signal and Information Processing Association Annual …, 2020 | 4 | 2020 |
Beamformed feature for learning-based dual-channel speech separation H Li, X Zhang, G Gao ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 4 | 2020 |
Improve Data Utilization with Two-stage Learning in CNN-LSTM-based Voice Activity Detection T Xu, H Li, H Zhang, X Zhang 2019 Asia-Pacific Signal and Information Processing Association Annual …, 2019 | 4 | 2019 |
Dynamic-attention based encoder-decoder model for speaker extraction with anchor speech H Li, X Zhang, G Gao 2019 Asia-Pacific Signal and Information Processing Association Annual …, 2019 | 3 | 2019 |
Cross-Attention-Guided Wavenet for Mel Spectrogram Reconstruction in The ICASSP 2024 Auditory EEG Challenge Y Fang, H Li, X Zhang, F Chen, G Gao 2024 IEEE International Conference on Acoustics, Speech, and Signal …, 2024 | 2 | 2024 |
3S-TSE: Efficient three-stage target speaker extraction for real-time and low-resource applications S He, J Liu, H Li, Y Yang, F Chen, X Zhang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 2 | 2024 |
Cross-Subject Classification of Spoken Mandarin Vowels and Tones with EEG Signals: A Study of End-to-End CNN with Fine-Tuning X Wang, M Li, H Li, SH Pun, F Chen 2023 Asia Pacific Signal and Information Processing Association Annual …, 2023 | 2 | 2023 |