Підписатись
Ann Lee
Ann Lee
Meta AI
Підтверджена електронна адреса в csail.mit.edu
Назва
Посилання
Посилання
Рік
Voxpopuli: A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation
C Wang, M Rivière, A Lee, A Wu, C Talnikar, D Haziza, M Williamson, ...
arXiv preprint arXiv:2101.00390, 2021
5062021
Self-training for end-to-end speech recognition
J Kahn, A Lee, A Hannun
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
2692020
Robust wav2vec 2.0: Analyzing domain shift in self-supervised pre-training
WN Hsu, A Sriram, A Baevski, T Likhomanenko, Q Xu, V Pratap, J Kahn, ...
arXiv preprint arXiv:2104.01027, 2021
2592021
Direct speech-to-speech translation with discrete units
A Lee, PJ Chen, C Wang, J Gu, S Popuri, X Ma, A Polyak, Y Adi, Q He, ...
arXiv preprint arXiv:2107.05604, 2021
1722021
Textless speech-to-speech translation on real data
A Lee, H Gong, PA Duquenne, H Schwenk, PJ Chen, C Wang, S Popuri, ...
arXiv preprint arXiv:2112.08352, 2021
1472021
Text-free prosody-aware generative spoken language modeling
E Kharitonov, A Lee, A Polyak, Y Adi, J Copet, K Lakhotia, TA Nguyen, ...
arXiv preprint arXiv:2109.03264, 2021
1292021
Sequence-to-sequence speech recognition with time-depth separable convolutions
A Hannun, A Lee, Q Xu, R Collobert
arXiv preprint arXiv:1904.02619, 2019
1212019
SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
L Barrault, YA Chung, MC Meglioli, D Dale, N Dong, PA Duquenne, ...
arXiv preprint arXiv:2308.11596, 2023
1142023
Seamless: Multilingual Expressive and Streaming Speech Translation
L Barrault, YA Chung, MC Meglioli, D Dale, N Dong, M Duppenthaler, ...
arXiv preprint arXiv:2312.05187, 2023
1092023
A comparison-based approach to mispronunciation detection
A Lee, J Glass
2012 IEEE Spoken Language Technology Workshop (SLT), 382-387, 2012
762012
Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation
S Popuri, PJ Chen, C Wang, J Pino, Y Adi, J Gu, WN Hsu, A Lee
arXiv preprint arXiv:2204.02967, 2022
662022
Mispronunciation detection via dynamic time warping on deep belief network-based posteriorgrams
A Lee, Y Zhang, J Glass
2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013
652013
Discriminative Reranking for Neural Machine Translation
A Lee, M Auli, MA Ranzato
Proceedings of the 59th Annual Meeting of the Association for Computational …, 2021
502021
UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units
H Inaguma, S Popuri, I Kulikov, PJ Chen, C Wang, YA Chung, Y Tang, ...
arXiv preprint arXiv:2212.08055, 2022
462022
Semi-supervised speech recognition via local prior matching
WN Hsu, A Lee, G Synnaeve, A Hannun
arXiv preprint arXiv:2002.10336, 2020
442020
Exploiting depth and highway connections in convolutional recurrent deep neural networks for speech recognition
WN Hsu, Y Zhang, A Lee, J Glass
cell 50 (1), 2016
422016
Mispronunciation detection without nonnative training data
A Lee, J Glass
Sixteenth Annual Conference of the International Speech Communication …, 2015
342015
fairseq S^ 2: A Scalable and Integrable Speech Synthesis Toolkit
C Wang, WN Hsu, Y Adi, A Polyak, A Lee, PJ Chen, J Gu, J Pino
arXiv preprint arXiv:2109.06912, 2021
332021
Pronunciation assessment via a comparison-based system
A Lee, J Glass
Speech and Language Technology in Education, 2013
332013
Personalized mispronunciation detection and diagnosis based on unsupervised error pattern discovery
A Lee, NF Chen, J Glass
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
322016
У даний момент система не може виконати операцію. Спробуйте пізніше.
Статті 1–20