Följ
Ye Bai (白 烨)
Ye Bai (白 烨)
Bytedance Inc.
Verifierad e-postadress på bytedance.com - Startsida
Titel
Citeras av
Citeras av
År
Add 2022: the first audio deep synthesis detection challenge
J Yi, R Fu, J Tao, S Nie, H Ma, C Wang, T Wang, Z Tian, Y Bai, C Fan, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
2072022
Half-truth: A partially fake audio detection dataset
J Yi, Y Bai, J Tao, H Ma, Z Tian, C Wang, T Wang, R Fu
arXiv preprint arXiv:2104.03617, 2021
942021
Self-attention transducers for end-to-end speech recognition
Z Tian, J Yi, J Tao, Y Bai, Z Wen
arXiv preprint arXiv:1909.13037, 2019
832019
Synchronous transformers for end-to-end speech recognition
Z Tian, J Yi, Y Bai, J Tao, S Zhang, Z Wen
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
802020
Fast end-to-end speech recognition via non-autoregressive models and cross-modal knowledge transferring from BERT
Y Bai, J Yi, J Tao, Z Tian, Z Wen, S Zhang
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1897-1911, 2021
712021
Language-adversarial transfer learning for low-resource speech recognition
J Yi, J Tao, Z Wen, Y Bai
IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (3), 621-630, 2018
702018
Spike-triggered non-autoregressive transformer for end-to-end speech recognition
Z Tian, J Yi, J Tao, Y Bai, S Zhang, Z Wen
arXiv preprint arXiv:2005.07903, 2020
672020
Listen attentively, and spell once: Whole sentence generation via a non-autoregressive architecture for low-latency speech recognition
Y Bai, J Yi, J Tao, Z Tian, Z Wen, S Zhang
arXiv preprint arXiv:2005.04862, 2020
472020
Adversarial transfer learning for punctuation restoration
J Yi, J Tao, Y Bai, Z Tian, C Fan
arXiv preprint arXiv:2004.00248, 2020
452020
Continual learning for fake audio detection
H Ma, J Yi, J Tao, Y Bai, Z Tian, C Wang
arXiv preprint arXiv:2104.07286, 2021
422021
Learn spelling from teachers: Transferring knowledge from language models to sequence-to-sequence speech recognition
Y Bai, J Yi, J Tao, Z Tian, Z Wen
arXiv preprint arXiv:1907.06017, 2019
412019
A Time Delay Neural Network with Shared Weight Self-Attention for Small-Footprint Keyword Spotting.
Y Bai, J Yi, J Tao, Z Wen, Z Tian, C Zhao, C Fan
INTERSPEECH, 2190-2194, 2019
372019
Adversarial multilingual training for low-resource speech recognition
J Yi, J Tao, Z Wen, Y Bai
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
312018
End-to-end keywords spotting based on connectionist temporal classification for mandarin
Y Bai, J Yi, H Ni, Z Wen, B Liu, Y Li, J Tao
2016 10th international symposium on Chinese spoken language processing …, 2016
312016
Rnn-transducer with language bias for end-to-end mandarin-english code-switching speech recognition
S Zhang, J Yi, Z Tian, J Tao, Y Bai
2021 12th international symposium on Chinese spoken language processing …, 2021
302021
Deep imitator: Handwriting calligraphy imitation via deep attention networks
B Zhao, J Tao, M Yang, Z Tian, C Fan, Y Bai
Pattern Recognition 104, 107080, 2020
302020
Focal Loss for Punctuation Prediction.
J Yi, J Tao, Z Tian, Y Bai, C Fan
Interspeech, 721-725, 2020
272020
Polyvoice: Language models for speech to speech translation
Q Dong, Z Huang, Q Tian, C Xu, T Ko, Y Zhao, S Feng, T Li, K Wang, ...
arXiv preprint arXiv:2306.02982, 2023
242023
Fsr: Accelerating the inference process of transducer-based models by applying fast-skip regularization
Z Tian, J Yi, Y Bai, J Tao, S Zhang, Z Wen
arXiv preprint arXiv:2104.02882, 2021
172021
Noise prior knowledge learning for speech enhancement via gated convolutional generative adversarial network
C Fan, B Liu, J Tao, J Yi, Z Wen, Y Bai
2019 Asia-Pacific Signal and Information Processing Association Annual …, 2019
172019
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–20