Recent developments on espnet toolkit boosted by conformer P Guo, F Boyer, X Chang, T Hayashi, Y Higuchi, H Inaguma, N Kamo, C Li, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 304 | 2021 |
Espnet-slu: Advancing spoken language understanding through espnet S Arora, S Dalmia, P Denisov, X Chang, Y Ueda, Y Peng, Y Zhang, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 81 | 2022 |
x-Vectors Meet Adversarial Attacks: Benchmarking Adversarial Robustness in Speaker Verification. J Villalba, Y Zhang, N Dehak Interspeech, 4233-4237, 2020 | 55 | 2020 |
Black-Box Attacks on Spoofing Countermeasures Using Transferability of Adversarial Examples. Y Zhang, Z Jiang, J Villalba, N Dehak Interspeech, 4238-4242, 2020 | 52 | 2020 |
Spgispeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition PK O'Neill, V Lavrukhin, S Majumdar, V Noroozi, Y Zhang, O Kuchaiev, ... arXiv preprint arXiv:2104.02014, 2021 | 47 | 2021 |
Tiny transducer: A highly-efficient speech recognition model on edge devices Y Zhang, S Sun, L Ma ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 34 | 2021 |
Sequence-to-sequence singing voice synthesis with perceptual entropy loss J Shi, S Guo, N Huo, Y Zhang, Q Jin ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 30 | 2021 |
TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch J Hwang, M Hira, C Chen, X Zhang, Z Ni, G Sun, P Ma, R Huang, V Pratap, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-9, 2023 | 16 | 2023 |
Trimtail: Low-latency streaming asr with simple but effective spectrogram-level length penalty X Song, D Wu, Z Wu, B Zhang, Y Zhang, Z Peng, W Li, F Pan, C Zhu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 8 | 2023 |
TouchTTS: An Embarrassingly Simple TTS Framework that Everyone Can Touch X Song, M Xing, C Ma, S Li, D Wu, B Zhang, F Pan, D Zhou, Y Zhang, ... arXiv preprint arXiv:2412.08237, 2024 | 2 | 2024 |