Accent-robust automatic speech recognition using supervised and unsupervised wav2vec embeddings J Li, V Manohar, P Chitkara, A Tjandra, M Picheny, F Zhang, X Zhang, ... arXiv preprint arXiv:2110.03520, 2021 | 18 | 2021 |
Analysis of acoustic and voice quality features for the classification of infant and mother vocalizations J Li, M Hasegawa-Johnson, NL McElwain Speech communication 133, 41-61, 2021 | 17 | 2021 |
Towards robust family-infant audio analysis based on unsupervised pretraining of wav2vec 2.0 on large-scale unlabeled family audio J Li, M Hasegawa-Johnson, NL McElwain Interspeech 2023, 2023 | 12 | 2023 |
Autosegmental neural nets: Should phones and tones be synchronous or asynchronous? J Li, M Hasegawa-Johnson Interspeech 2020, 2020 | 9 | 2020 |
Listen, decipher and sign: Toward unsupervised speech-to-sign language recognition L Wang, J Ni, H Gao, J Li, KC Chang, X Fan, J Wu, M Hasegawa-Johnson, ... Findings of the Association for Computational Linguistics: ACL 2023, 6785-6800, 2023 | 7 | 2023 |
An embodied, platform-invariant architecture for connecting high-level spatial commands to platform articulation AJ Sher, U Huzaifa, J Li, V Jain, A Zurawski, A LaViers Robotics and Autonomous Systems 119, 263-277, 2019 | 5 | 2019 |
Analysis of Self-Supervised Speech Models on Children's Speech and Infant Vocalizations J Li, M Hasegawa-Johnson, NL McElwain ICASSP 2024 SASB Workshop, 2024 | 4 | 2024 |
Preliminary technical validation of LittleBeats™: A multimodal sensing platform to capture cardiac physiology, motion, and vocalizations B Islam, NL McElwain, J Li, MI Davila, Y Hu, K Hu, JM Bodway, A Dhekne, ... Sensors 24 (3), 901, 2024 | 4 | 2024 |
Enhancing Child Vocalization Classification with Phonetically-Tuned Embeddings for Assisting Autism Diagnosis J Li, M Hasegawa-Johnson, K Karahalios arXiv preprint arXiv:2309.07287, 2023 | 2* | 2023 |
Sound tagging in infant-centric home soundscapes MNH Khan, J Li, NL McElwain, M Hasegawa–Johnson, B Islam 2024 IEEE/ACM Conference on Connected Health: Applications, Systems and …, 2024 | 1 | 2024 |
Autosegmental Neural Nets 2.0: An Extensive Study of Training Synchronous and Asynchronous Phones and Tones for Under-Resourced Tonal Languages J Li, M Hasegawa-Johnson IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1918-1926, 2022 | 1 | 2022 |
Visualizations of Complex Sequences of Family-Infant Vocalizations Using Bag-of-Audio-Words Approach Based on Wav2vec 2.0 Features J Li, M Hasegawa-Johnson, NL McElwain arXiv preprint arXiv:2203.15183, 2022 | 1 | 2022 |
A comparable phone set for the timit dataset discovered in clustering of listen, attend and spell J Li, M Hasegawa-Johnson NIPS 2018 Workshop IRASL, 2018 | 1 | 2018 |
Breaking down barriers: advancing interdisciplinary speech applications in early children’s development J Li University of Illinois at Urbana-Champaign, 2024 | | 2024 |