Følg
Ye Bai (白 烨)
Ye Bai (白 烨)
Bytedance Inc.
Verificeret mail på bytedance.com - Startside
Titel
Citeret af
Citeret af
År
Add 2022: the first audio deep synthesis detection challenge
J Yi, R Fu, J Tao, S Nie, H Ma, C Wang, T Wang, Z Tian, Y Bai, C Fan, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
2062022
Half-truth: A partially fake audio detection dataset
J Yi, Y Bai, J Tao, H Ma, Z Tian, C Wang, T Wang, R Fu
arXiv preprint arXiv:2104.03617, 2021
922021
Synchronous transformers for end-to-end speech recognition
Z Tian, J Yi, Y Bai, J Tao, S Zhang, Z Wen
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
802020
Self-attention transducers for end-to-end speech recognition
Z Tian, J Yi, J Tao, Y Bai, Z Wen
arXiv preprint arXiv:1909.13037, 2019
802019
Fast end-to-end speech recognition via non-autoregressive models and cross-modal knowledge transferring from BERT
Y Bai, J Yi, J Tao, Z Tian, Z Wen, S Zhang
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1897-1911, 2021
712021
Language-adversarial transfer learning for low-resource speech recognition
J Yi, J Tao, Z Wen, Y Bai
IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (3), 621-630, 2018
682018
Spike-triggered non-autoregressive transformer for end-to-end speech recognition
Z Tian, J Yi, J Tao, Y Bai, S Zhang, Z Wen
arXiv preprint arXiv:2005.07903, 2020
672020
Listen attentively, and spell once: Whole sentence generation via a non-autoregressive architecture for low-latency speech recognition
Y Bai, J Yi, J Tao, Z Tian, Z Wen, S Zhang
arXiv preprint arXiv:2005.04862, 2020
462020
Adversarial transfer learning for punctuation restoration
J Yi, J Tao, Y Bai, Z Tian, C Fan
arXiv preprint arXiv:2004.00248, 2020
462020
Continual learning for fake audio detection
H Ma, J Yi, J Tao, Y Bai, Z Tian, C Wang
arXiv preprint arXiv:2104.07286, 2021
422021
Learn spelling from teachers: Transferring knowledge from language models to sequence-to-sequence speech recognition
Y Bai, J Yi, J Tao, Z Tian, Z Wen
arXiv preprint arXiv:1907.06017, 2019
412019
A Time Delay Neural Network with Shared Weight Self-Attention for Small-Footprint Keyword Spotting.
Y Bai, J Yi, J Tao, Z Wen, Z Tian, C Zhao, C Fan
INTERSPEECH, 2190-2194, 2019
372019
Rnn-transducer with language bias for end-to-end mandarin-english code-switching speech recognition
S Zhang, J Yi, Z Tian, J Tao, Y Bai
2021 12th international symposium on Chinese spoken language processing …, 2021
312021
Adversarial multilingual training for low-resource speech recognition
J Yi, J Tao, Z Wen, Y Bai
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
312018
End-to-end keywords spotting based on connectionist temporal classification for mandarin
Y Bai, J Yi, H Ni, Z Wen, B Liu, Y Li, J Tao
2016 10th international symposium on Chinese spoken language processing …, 2016
312016
Deep imitator: Handwriting calligraphy imitation via deep attention networks
B Zhao, J Tao, M Yang, Z Tian, C Fan, Y Bai
Pattern Recognition 104, 107080, 2020
302020
Focal Loss for Punctuation Prediction.
J Yi, J Tao, Z Tian, Y Bai, C Fan
Interspeech, 721-725, 2020
272020
Polyvoice: Language models for speech to speech translation
Q Dong, Z Huang, Q Tian, C Xu, T Ko, Y Zhao, S Feng, T Li, K Wang, ...
arXiv preprint arXiv:2306.02982, 2023
242023
Fsr: Accelerating the inference process of transducer-based models by applying fast-skip regularization
Z Tian, J Yi, Y Bai, J Tao, S Zhang, Z Wen
arXiv preprint arXiv:2104.02882, 2021
162021
Utterance-level permutation invariant training with discriminative learning for single channel speech separation
C Fan, B Liu, J Tao, Z Wen, J Yi, Y Bai
2018 11th International Symposium on Chinese Spoken Language Processing …, 2018
162018
Systemet kan ikke foretage handlingen nu. Prøv igen senere.
Artikler 1–20