دنبال کردن
Ruchao Fan
Ruchao Fan
Speech Scientist, Microsoft
ایمیل تأیید شده در microsoft.com - صفحهٔ اصلی
عنوان
نقل شده توسط
نقل شده توسط
سال
An online attention-based model for speech recognition
R Fan, P Zhou, W Chen, J Jia, G Liu
Proc. Interspeech 2019, 4390--4394, 2019
612019
CASS-NAT: CTC alignment-based single step non-autoregressive transformer for speech recognition
R Fan, W Chu, P Chang, J Xiao
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
472021
Towards better domain adaptation for self-supervised models: A case study of child ASR
R Fan, Y Zhu, J Wang, A Alwan
IEEE Journal of Selected Topics in Signal Processing 16 (6), 1242-1252, 2022
352022
DRAFT: A Novel Framework to Reduce Domain Shifting in Self-supervised Learning and Its Application to Children's ASR
R Fan, A Alwan
Proc. Interspeech 2022, 4900--4904, 2022
352022
Improving generalization of transformer for speech recognition with parallel schedule sampling and relative positional embedding
P Zhou, R Fan, W Chen, J Jia
arXiv preprint arXiv:1911.00203, 2019
322019
Fundamental frequency feature normalization and data augmentation for child speech recognition
G Yeung, R Fan, A Alwan
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
292021
An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition
R Fan, W Chu, P Chang, J Xiao, A Alwan
Proc. Interspeech 2021, 3715--3719, 2021
232021
A CTC alignment-based non-autoregressive transformer for end-to-end automatic speech recognition
R Fan, W Chu, P Chang, A Alwan
IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 1436-1448, 2023
182023
Bi-apc: Bidirectional autoregressive predictive coding for unsupervised pre-training and its application to children’s asr
R Fan, A Afshan, A Alwan
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
182021
Exploring the use of an unsupervised autoregressive model as a shared encoder for text-dependent speaker verification
V Ravi, R Fan, A Afshan, H Lu, A Alwan
Proc. Interspeech 2020, 766--770, 2020
162020
LPC augment: an LPC-based ASR data augmentation algorithm for low and zero-resource children’s dialects
A Johnson, R Fan, R Morris, A Alwan
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
152022
Fundamental frequency feature warping for frequency normalization and data augmentation in child automatic speech recognition
G Yeung, R Fan, A Alwan
Speech Communication 135, 1-10, 2021
132021
Low Resource German ASR with Untranscribed Data Spoken by Non-native Children--INTERSPEECH 2021 Shared Task SPAPL System
J Wang, Y Zhu, R Fan, W Chu, A Alwan
Proc. Interspeech 2021, 1279--1283, 2021
132021
CTCBERT: Advancing hidden-unit bert with CTC objectives
R Fan, Y Wang, Y Gaur, J Li
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
102023
CNN-based audio front end processing on speech recognition
R Fan, G Liu
2018 International Conference on Audio, Language and Image Processing …, 2018
92018
Benchmarking Children's ASR with Supervised and Self-supervised Speech Foundation Models
R Fan, NB Shankar, A Alwan
Proc. Interspeech 2024, 5173-5177, 2024
82024
Acoustic-aware non-autoregressive spell correction with mask sample decoding
R Fan, G Ye, Y Gaur, J Li
arXiv preprint arXiv:2210.08665, 2022
52022
Towards better meta-initialization with task augmentation for kindergarten-aged speech recognition
Y Zhu, R Fan, A Alwan
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
52022
UniEnc-CASSNAT: An Encoder-only Non-autoregressive ASR for Speech SSL Models
R Fan, NB Shankar, A Alwan
IEEE Signal Processing Letters 31, 711-715, 2024
22024
Research on end-to-end speech recognition [D]
R Fan
Beijing University of Posts and Telecommunications, 2-5, 2019
2*2019
سیستم در حال حاضر قادر به انجام عملکرد نیست. بعداً دوباره امتحان کنید.
مقاله‌ها 1–20