Ye Bai (白 烨)

Citeret af

	Alle	Siden 2020
Henvisninger	1222	1197
h-index	18	18
i10-indeks	28	28

340

170

255

201820192020202120222023202420257 15 111 191 230 283 335 45

Offentlig adgang

Se alle

18 artikler

4 artikler

tilgængelige

ikke tilgængelige

Baseret på krav i forbindelse med finansiering

Medforfattere

Jianhua TAOTsinghua UniversityVerificeret mail på tsinghua.edu.cn
Jiangyan YiTsinghua UniversityVerificeret mail på tsinghua.edu.cn
Zhengkun TianMeituan Inc.Verificeret mail på meituan.com
Cunhang Fan（范存航）School of Computer Science and Technology, Anhui UniversityVerificeret mail på ahu.edu.cn
Ya LIAssociate Professor, Beijing University of Posts and Telecommunications (BUPT)Verificeret mail på bupt.edu.cn
BO ZHAO西安电子科技大学Verificeret mail på xidian.edu.cn

Følg

Ye Bai (白烨)

Bytedance Inc.

Verificeret mail på bytedance.com - Startside

Speech Recognition Language Modeling Audio Fake Detection


Titel Sortér efter henvisninger Sortér efter årstal Sortér efter titel	Citeret af Citeret af	År
Add 2022: the first audio deep synthesis detection challenge J Yi, R Fu, J Tao, S Nie, H Ma, C Wang, T Wang, Z Tian, Y Bai, C Fan, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	206	2022
Half-truth: A partially fake audio detection dataset J Yi, Y Bai, J Tao, H Ma, Z Tian, C Wang, T Wang, R Fu arXiv preprint arXiv:2104.03617, 2021	92	2021
Synchronous transformers for end-to-end speech recognition Z Tian, J Yi, Y Bai, J Tao, S Zhang, Z Wen ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	80	2020
Self-attention transducers for end-to-end speech recognition Z Tian, J Yi, J Tao, Y Bai, Z Wen arXiv preprint arXiv:1909.13037, 2019	80	2019
Fast end-to-end speech recognition via non-autoregressive models and cross-modal knowledge transferring from BERT Y Bai, J Yi, J Tao, Z Tian, Z Wen, S Zhang IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1897-1911, 2021	71	2021
Language-adversarial transfer learning for low-resource speech recognition J Yi, J Tao, Z Wen, Y Bai IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (3), 621-630, 2018	68	2018
Spike-triggered non-autoregressive transformer for end-to-end speech recognition Z Tian, J Yi, J Tao, Y Bai, S Zhang, Z Wen arXiv preprint arXiv:2005.07903, 2020	67	2020
Listen attentively, and spell once: Whole sentence generation via a non-autoregressive architecture for low-latency speech recognition Y Bai, J Yi, J Tao, Z Tian, Z Wen, S Zhang arXiv preprint arXiv:2005.04862, 2020	46	2020
Adversarial transfer learning for punctuation restoration J Yi, J Tao, Y Bai, Z Tian, C Fan arXiv preprint arXiv:2004.00248, 2020	46	2020
Continual learning for fake audio detection H Ma, J Yi, J Tao, Y Bai, Z Tian, C Wang arXiv preprint arXiv:2104.07286, 2021	42	2021
Learn spelling from teachers: Transferring knowledge from language models to sequence-to-sequence speech recognition Y Bai, J Yi, J Tao, Z Tian, Z Wen arXiv preprint arXiv:1907.06017, 2019	41	2019
A Time Delay Neural Network with Shared Weight Self-Attention for Small-Footprint Keyword Spotting. Y Bai, J Yi, J Tao, Z Wen, Z Tian, C Zhao, C Fan INTERSPEECH, 2190-2194, 2019	37	2019
Rnn-transducer with language bias for end-to-end mandarin-english code-switching speech recognition S Zhang, J Yi, Z Tian, J Tao, Y Bai 2021 12th international symposium on Chinese spoken language processing …, 2021	31	2021
Adversarial multilingual training for low-resource speech recognition J Yi, J Tao, Z Wen, Y Bai 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	31	2018
End-to-end keywords spotting based on connectionist temporal classification for mandarin Y Bai, J Yi, H Ni, Z Wen, B Liu, Y Li, J Tao 2016 10th international symposium on Chinese spoken language processing …, 2016	31	2016
Deep imitator: Handwriting calligraphy imitation via deep attention networks B Zhao, J Tao, M Yang, Z Tian, C Fan, Y Bai Pattern Recognition 104, 107080, 2020	30	2020
Focal Loss for Punctuation Prediction. J Yi, J Tao, Z Tian, Y Bai, C Fan Interspeech, 721-725, 2020	27	2020
Polyvoice: Language models for speech to speech translation Q Dong, Z Huang, Q Tian, C Xu, T Ko, Y Zhao, S Feng, T Li, K Wang, ... arXiv preprint arXiv:2306.02982, 2023	24	2023
Fsr: Accelerating the inference process of transducer-based models by applying fast-skip regularization Z Tian, J Yi, Y Bai, J Tao, S Zhang, Z Wen arXiv preprint arXiv:2104.02882, 2021	16	2021
Utterance-level permutation invariant training with discriminative learning for single channel speech separation C Fan, B Liu, J Tao, Z Wen, J Yi, Y Bai 2018 11th International Symposium on Chinese Spoken Language Processing …, 2018	16	2018

Systemet kan ikke foretage handlingen nu. Prøv igen senere.

Artikler 1–20

Henvisninger pr. år

Dublerede henvisninger

Flettede henvisninger

Tilføj medforfattereMedforfattere

Følg

Citeret af

Medforfattere