Self-supervised speaker verification using dynamic loss-gate and label correction B Han, Z Chen, Y Qian arXiv preprint arXiv:2208.01928, 2022 | 38 | 2022 |
Local information modeling with self-attention for speaker verification B Han, Z Chen, Y Qian ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 28 | 2022 |
DF-ResNet: Boosting Speaker Verification Performance with Depth-First Design. B Liu, Z Chen, S Wang, H Wang, B Han, Y Qian INTERSPEECH, 296-300, 2022 | 27 | 2022 |
Self-supervised learning with cluster-aware-dino for high-performance robust speaker verification B Han, Z Chen, Y Qian IEEE/ACM Transactions on Audio, Speech, and Language Processing 32, 529-541, 2023 | 25 | 2023 |
Autoregressive speech synthesis without vector quantization L Meng, L Zhou, S Liu, S Chen, B Han, S Hu, Y Liu, J Li, S Zhao, X Wu, ... arXiv preprint arXiv:2407.08551, 2024 | 24 | 2024 |
The sjtu x-lance lab system for cnsrc 2022 Z Chen, B Liu, B Han, L Zhang, Y Qian arXiv preprint arXiv:2206.11699, 2022 | 23 | 2022 |
Attention-based encoder-decoder end-to-end neural diarization with embedding enhancer Z Chen, B Han, S Wang, Y Qian IEEE/ACM Transactions on Audio, Speech, and Language Processing 32, 1636-1649, 2024 | 22 | 2024 |
Attention-based encoder-decoder network for end-to-end neural speaker diarization with target speaker attractor Z Chen, B Han, S Wang, Y Qian arXiv preprint arXiv:2305.10704, 2023 | 20 | 2023 |
Instructme: An instruction guided music edit and remix framework with latent diffusion models B Han, J Dai, W Hao, X He, D Guo, J Chen, Y Wang, Y Qian, X Song arXiv preprint arXiv:2308.14360, 2023 | 17 | 2023 |
VALL-E R: Robust and efficient zero-shot text-to-speech synthesis via monotonic alignment B Han, L Zhou, S Liu, S Chen, L Meng, Y Qian, Y Liu, S Zhao, J Li, F Wei arXiv preprint arXiv:2406.07855, 2024 | 16 | 2024 |
Unsupervised anomalous detection based on unsupervised pretrained models Z Lv, B Han, Z Chen, Y Qian, J Ding, J Liu Tech. Rep., Technical report, DCASE2023 Challenge, 2023 | 16 | 2023 |
A comprehensive study on self-supervised distillation for speaker representation learning Z Chen, Y Qian, B Han, Y Qian, M Zeng 2022 IEEE Spoken Language Technology Workshop (SLT), 599-604, 2023 | 16 | 2023 |
Mlp-svnet: A multi-layer perceptrons based network for speaker verification B Han, Z Chen, B Liu, Y Qian ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 16 | 2022 |
Build a sre challenge system: Lessons from voxsrc 2022 and cnsrc 2022 Z Chen, B Han, X Xiang, H Huang, B Liu, Y Qian arXiv preprint arXiv:2211.00815, 2022 | 15 | 2022 |
Advancing speaker embedding learning: Wespeaker toolkit for research and production S Wang, Z Chen, B Han, H Wang, C Liang, B Zhang, X Xiang, W Ding, ... Speech Communication 162, 103104, 2024 | 11 | 2024 |
Exploring large scale pre-trained models for robust machine anomalous sound detection B Han, Z Lv, A Jiang, W Huang, Z Chen, Y Deng, J Ding, C Lu, WQ Zhang, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 11 | 2024 |
Exploring binary classification loss for speaker verification B Han, Z Chen, Y Qian ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 10 | 2023 |
Synaug: Synthesis-based data augmentation for text-dependent speaker verification C Du, B Han, S Wang, Y Qian, K Yu ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 10 | 2021 |
Sjtu-aispeech system for voxceleb speaker recognition challenge 2022 Z Chen, B Han, X Xiang, H Huang, B Liu, Y Qian arXiv preprint arXiv:2209.09076, 2022 | 8 | 2022 |
The SJTU System for Short-Duration Speaker Verification Challenge 2021 B Han, Z Chen, Z Zhou, Y Qian Interspeech, 2332-2336, 2021 | 8 | 2021 |