DistilHuBERT: Speech Representation Learning by Layer-wise Distillation of Hidden-unit BERT HJ Chang, S Yang, H Lee ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 173 | 2022 |
SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities HS Tsai, HJ Chang, WC Huang, Z Huang, K Lakhotia, S Yang, S Dong, ... Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022 | 106 | 2022 |
Towards Lifelong Learning of End-to-end ASR HJ Chang, H Lee, L Lee Proc. Interspeech 2021, 2021 | 41 | 2021 |
SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model YJ Shih, HF Wang, HJ Chang, L Berry, H Lee, D Harwath 2022 IEEE Spoken Language Technology Workshop (SLT), 715-722, 2023 | 37 | 2023 |
DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning AH Liu, HJ Chang, M Auli, WN Hsu, J Glass Advances in Neural Information Processing Systems 36, 2024 | 19 | 2024 |
Mandarin-English Code-switching Speech Recognition with Self-supervised Speech Representation Models LH Tseng, YK Fu, HJ Chang, H Lee arXiv preprint arXiv:2110.03504, 2021 | 19 | 2021 |
End-to-end Whispered Speech Recognition with Frequency-weighted Approaches and Pseudo Whisper Pre-training HJ Chang, AH Liu, H Lee, L Lee 2021 IEEE Spoken Language Technology Workshop (SLT), 2021 | 19* | 2021 |
A Large-Scale Evaluation of Speech Foundation Models S Yang, HJ Chang, Z Huang, AT Liu, CI Lai, H Wu, J Shi, X Chang, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 | 18 | 2024 |
Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering HJ Chang, AH Liu, J Glass Proc. Interspeech 2023, 2023 | 17 | 2023 |
Non-autoregressive Mandarin-English Code-switching Speech Recognition SP Chuang, HJ Chang, SF Huang, H Lee 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 2021 | 14 | 2021 |
M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image Retrieval L Berry, YJ Shih, HF Wang, HJ Chang, H Lee, D Harwath ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 11 | 2023 |
CoLLD: Contrastive Layer-to-Layer Distillation for Compressing Multilingual Pre-Trained Speech Encoders HJ Chang, N Dong, R Mavlyutov, S Popuri, YA Chung ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 4 | 2024 |
DC-Spin: A Speaker-invariant Speech Tokenizer for Spoken Language Models HJ Chang, H Gong, C Wang, J Glass, YA Chung arXiv preprint arXiv:2410.24177, 2024 | 2 | 2024 |
SpeechCLIP+: Self-supervised multi-task representation learning for speech via CLIP and speech-image data HF Wang, YJ Shih, HJ Chang, L Berry, P Peng, H Lee, HM Wang, ... arXiv preprint arXiv:2402.06959, 2024 | 2 | 2024 |
R-Spin: Efficient Speaker and Noise-invariant Representation Learning with Acoustic Pieces HJ Chang, J Glass Proceedings of the 2024 Conference of the North American Chapter of the …, 2023 | 1 | 2023 |
Perturbation-invariant Speech Representation Learning by Online Clustering HJ Chang Massachusetts Institute of Technology, 2024 | | 2024 |