Add 2022: the first audio deep synthesis detection challenge J Yi, R Fu, J Tao, S Nie, H Ma, C Wang, T Wang, Z Tian, Y Bai, C Fan, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 202 | 2022 |
Gated recurrent fusion with joint training framework for robust end-to-end speech recognition C Fan, J Yi, J Tao, Z Tian, B Liu, Z Wen IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 198-209, 2020 | 93 | 2020 |
Half-truth: A partially fake audio detection dataset J Yi, Y Bai, J Tao, H Ma, Z Tian, C Wang, T Wang, R Fu arXiv preprint arXiv:2104.03617, 2021 | 92 | 2021 |
Self-attention transducers for end-to-end speech recognition Z Tian, J Yi, J Tao, Y Bai, Z Wen Interspeech 2019, 4395--4399, 2019 | 83 | 2019 |
Synchronous Transformers for End-to-End Speech Recognition Z Tian, J Yi, Y Bai, J Tao, S Zhang, Z Wen ICASSP 2020, 2019 | 79 | 2019 |
Fast end-to-end speech recognition via non-autoregressive models and cross-modal knowledge transferring from BERT Y Bai, J Yi, J Tao, Z Tian, Z Wen, S Zhang IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1897-1911, 2021 | 71 | 2021 |
Spike-triggered non-autoregressive transformer for end-to-end speech recognition Z Tian, J Yi, J Tao, Y Bai, S Zhang, Z Wen arXiv preprint arXiv:2005.07903, 2020 | 67 | 2020 |
Listen attentively, and spell once: Whole sentence generation via a non-autoregressive architecture for low-latency speech recognition Y Bai, J Yi, J Tao, Z Tian, Z Wen, S Zhang arXiv preprint arXiv:2005.04862, 2020 | 47 | 2020 |
Adversarial transfer learning for punctuation restoration J Yi, J Tao, Y Bai, Z Tian, C Fan arXiv preprint arXiv:2004.00248, 2020 | 45 | 2020 |
A large-scale Chinese multimodal NER dataset with speech clues D Sui, Z Tian, Y Chen, K Liu, J Zhao Proceedings of the 59th Annual Meeting of the Association for Computational …, 2021 | 42 | 2021 |
Continual learning for fake audio detection H Ma, J Yi, J Tao, Y Bai, Z Tian, C Wang arXiv preprint arXiv:2104.07286, 2021 | 42 | 2021 |
Learn spelling from teachers: Transferring knowledge from language models to sequence-to-sequence speech recognition Y Bai, J Yi, J Tao, Z Tian, Z Wen arXiv preprint arXiv:1907.06017, 2019 | 41 | 2019 |
A Time Delay Neural Network with Shared Weight Self-Attention for Small-Footprint Keyword Spotting. Y Bai, J Yi, J Tao, Z Wen, Z Tian, C Zhao, C Fan INTERSPEECH, 2190-2194, 2019 | 37 | 2019 |
Fully automated end-to-end fake audio detection C Wang, J Yi, J Tao, H Sun, X Chen, Z Tian, H Ma, C Fan, R Fu Proceedings of the 1st International Workshop on Deepfake Detection for …, 2022 | 34 | 2022 |
Rnn-transducer with language bias for end-to-end mandarin-english code-switching speech recognition S Zhang, J Yi, Z Tian, J Tao, Y Bai 2021 12th international symposium on Chinese spoken language processing …, 2021 | 30 | 2021 |
Deep imitator: Handwriting calligraphy imitation via deep attention networks B Zhao, J Tao, M Yang, Z Tian, C Fan, Y Bai Pattern Recognition 104, 107080, 2020 | 29 | 2020 |
Focal Loss for Punctuation Prediction. J Yi, J Tao, Z Tian, Y Bai, C Fan Interspeech, 721-725, 2020 | 26 | 2020 |
Hybrid autoregressive and non-autoregressive transformer models for speech recognition Z Tian, J Yi, J Tao, S Zhang, Z Wen IEEE Signal Processing Letters 29, 762-766, 2022 | 25 | 2022 |
Fsr: Accelerating the inference process of transducer-based models by applying fast-skip regularization Z Tian, J Yi, Y Bai, J Tao, S Zhang, Z Wen arXiv preprint arXiv:2104.02882, 2021 | 17 | 2021 |
Reducing language context confusion for end-to-end code-switching automatic speech recognition S Zhang, J Yi, Z Tian, J Tao, YT Yeung, L Deng arXiv preprint arXiv:2201.12155, 2022 | 16 | 2022 |