Ruchao Fan

نقل شده توسط

	همهٔ موارد	از 2020
نقل‌‏قول‌‏ها	398	392
شاخص h	13	13
شاخص i10	14	14

140

105

20192020202120222023202420255 20 50 72 104 131 15

دسترسی عمومی

مشاهدهٔ همه

۹ مقاله

۰ مقاله*

در دسترس

در دسترس نیست

براساس دستورات هزینه انتشار

نویسندگان مشترک

Abeer AlwanProfessor of Electrical Engineering, UCLAایمیل تأیید شده در ee.ucla.edu
Wei ChuOlewaveایمیل تأیید شده در olewave.com
Peng ChangPAII Inc.ایمیل تأیید شده در paii-labs.com
Jia Jia (贾珈)Professor, 清华大学(Tsinghua University)ایمیل تأیید شده در tsinghua.edu.cn
Pan ZhouUniversity of Science and Technology of Chinaایمیل تأیید شده در mail.ustc.edu.cn
Jing XiaoGroup Chief Scientist, Ping An Insurance Groupایمیل تأیید شده در pingan.com.cn
Amber AfshanApplied Scientist, AWSایمیل تأیید شده در ucla.edu
Jinhan WangNVIDIAایمیل تأیید شده در nvidia.com
Jinyu LiPartner Applied Science Manager, Microsoftایمیل تأیید شده در microsoft.com
Gary YeungUniversity of California, Los Angelesایمیل تأیید شده در g.ucla.edu
Natarajan Balaji ShankarGraduate Student in Electrical and Computer Engineering, University of California Los Angelesایمیل تأیید شده در ucla.edu
Yashesh GaurMeta, GenAI, Llama foundation modelsایمیل تأیید شده در cs.cmu.edu
VIJAY RAVIAI Researcher, Amplifier Healthایمیل تأیید شده در amplifierhealth.com
Yiming WangMicrosoftایمیل تأیید شده در microsoft.com
Rui Zhaomicrosoftایمیل تأیید شده در microsoft.com
Matt PostMicrosoft Translatorایمیل تأیید شده در cs.jhu.edu
Bo RenMicrosoft Corporationایمیل تأیید شده در microsoft.com
Shujie Liu (刘树杰）Microsoft Research Asiaایمیل تأیید شده در microsoft.com

دنبال کردن

Ruchao Fan

Speech Scientist, Microsoft

ایمیل تأیید شده در microsoft.com - صفحهٔ اصلی

speech processing representation learning multi-modal LLM


عنوان به‌ترتیب نقل قول‌ها به‌ترتیب سال به‌ترتیب عنوان	نقل شده توسط نقل شده توسط	سال
An online attention-based model for speech recognition‏ R Fan, P Zhou, W Chen, J Jia, G Liu‏ Proc. Interspeech 2019, 4390--4394, 2019‏	61	2019
CASS-NAT: CTC alignment-based single step non-autoregressive transformer for speech recognition‏ R Fan, W Chu, P Chang, J Xiao‏ ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021‏	47	2021
Towards better domain adaptation for self-supervised models: A case study of child ASR‏ R Fan, Y Zhu, J Wang, A Alwan‏ IEEE Journal of Selected Topics in Signal Processing 16 (6), 1242-1252, 2022‏	35	2022
DRAFT: A Novel Framework to Reduce Domain Shifting in Self-supervised Learning and Its Application to Children's ASR‏ R Fan, A Alwan‏ Proc. Interspeech 2022, 4900--4904, 2022‏	35	2022
Improving generalization of transformer for speech recognition with parallel schedule sampling and relative positional embedding‏ P Zhou, R Fan, W Chen, J Jia‏ arXiv preprint arXiv:1911.00203, 2019‏	32	2019
Fundamental frequency feature normalization and data augmentation for child speech recognition‏ G Yeung, R Fan, A Alwan‏ ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021‏	29	2021
An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition‏ R Fan, W Chu, P Chang, J Xiao, A Alwan‏ Proc. Interspeech 2021, 3715--3719, 2021‏	23	2021
A CTC alignment-based non-autoregressive transformer for end-to-end automatic speech recognition‏ R Fan, W Chu, P Chang, A Alwan‏ IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 1436-1448, 2023‏	18	2023
Bi-apc: Bidirectional autoregressive predictive coding for unsupervised pre-training and its application to children’s asr‏ R Fan, A Afshan, A Alwan‏ ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021‏	18	2021
Exploring the use of an unsupervised autoregressive model as a shared encoder for text-dependent speaker verification‏ V Ravi, R Fan, A Afshan, H Lu, A Alwan‏ Proc. Interspeech 2020, 766--770, 2020‏	16	2020
LPC augment: an LPC-based ASR data augmentation algorithm for low and zero-resource children’s dialects‏ A Johnson, R Fan, R Morris, A Alwan‏ ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022‏	15	2022
Fundamental frequency feature warping for frequency normalization and data augmentation in child automatic speech recognition‏ G Yeung, R Fan, A Alwan‏ Speech Communication 135, 1-10, 2021‏	13	2021
Low Resource German ASR with Untranscribed Data Spoken by Non-native Children--INTERSPEECH 2021 Shared Task SPAPL System‏ J Wang, Y Zhu, R Fan, W Chu, A Alwan‏ Proc. Interspeech 2021, 1279--1283, 2021‏	13	2021
CTCBERT: Advancing hidden-unit bert with CTC objectives‏ R Fan, Y Wang, Y Gaur, J Li‏ ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023‏	10	2023
CNN-based audio front end processing on speech recognition‏ R Fan, G Liu‏ 2018 International Conference on Audio, Language and Image Processing …, 2018‏	9	2018
Benchmarking Children's ASR with Supervised and Self-supervised Speech Foundation Models‏ R Fan, NB Shankar, A Alwan‏ Proc. Interspeech 2024, 5173-5177, 2024‏	8	2024
Acoustic-aware non-autoregressive spell correction with mask sample decoding‏ R Fan, G Ye, Y Gaur, J Li‏ arXiv preprint arXiv:2210.08665, 2022‏	5	2022
Towards better meta-initialization with task augmentation for kindergarten-aged speech recognition‏ Y Zhu, R Fan, A Alwan‏ ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022‏	5	2022
UniEnc-CASSNAT: An Encoder-only Non-autoregressive ASR for Speech SSL Models‏ R Fan, NB Shankar, A Alwan‏ IEEE Signal Processing Letters 31, 711-715, 2024‏	2	2024
Research on end-to-end speech recognition [D]‏ R Fan‏ Beijing University of Posts and Telecommunications, 2-5, 2019‏	2*	2019

سیستم در حال حاضر قادر به انجام عملکرد نیست. بعداً دوباره امتحان کنید.

مقاله‌ها 1–20

نقل‌قول‌ها در سال

نقل‌قول تکراری

نقل‌قول‌های ادغام شده

افزودن نویسنده‌های همکارنویسندگان مشترک

دنبال کردن

نقل شده توسط

نویسندگان مشترک