Volgen
Kwanghee Choi
Kwanghee Choi
Carnegie Mellon University, Language Technologies Institute
Geverifieerd e-mailadres voor andrew.cmu.edu - Homepage
Titel
Geciteerd door
Geciteerd door
Jaar
Disentangling label distribution for long-tailed visual recognition
Y Hong, S Han, K Choi, S Seo, B Kim, B Chang
CVPR 2021, 6626-6636, 2021
2982021
Exploring speech recognition, translation, and understanding with discrete speech units: A comparative study
X Chang, B Yan, K Choi, JW Jung, Y Lu, S Maiti, R Sharma, J Shi, J Tian, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
372024
Owsm v3. 1: Better and faster open whisper-style speech models based on e-branchformer
Y Peng, J Tian, W Chen, S Arora, B Yan, Y Sudo, M Shakeel, K Choi, ...
arXiv preprint arXiv:2401.16658, 2024
362024
Temporal Knowledge Distillation for On-device Audio Classification
K Choi, M Kersner, J Morton, B Chang
ICASSP 2022, 486-490, 2022
262022
Learning with Noisy Labels by Efficient Transition Matrix Estimation to Combat Label Miscorrection
SM Kye, K Choi, J Yi, B Chang
ECCV 2022, 2022
222022
Opening the black box of wav2vec feature encoder
K Choi, EJ Yeo
arXiv preprint arXiv:2210.15386, 2022
192022
Combating the Instability of Mutual Information-based Losses via Regularization
K Choi, S Lee
UAI 2022, 2022
18*2022
EVCMR: a tool for the quantitative evaluation and visualization of cardiac MRI data
YC Kim, KR Kim, K Choi, M Kim, Y Chung, YH Choe
Computers in Biology and Medicine 111, 103334, 2019
162019
Distilling a Pretrained Language Model to a Multilingual ASR Model
K Choi, HM Park
Interspeech 2022, 2203--2207, 2022
152022
Automatic severity classification of dysarthric speech by using self-supervised model with multi-task learning
EJ Yeo, K Choi, S Kim, M Chung
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
14*2023
Self-supervised speech representations are more phonetic than semantic
K Choi, A Pasad, T Nakamura, S Fukayama, K Livescu, S Watanabe
arXiv preprint arXiv:2406.08619, 2024
132024
TiDAL: Learning training dynamics for active learning
SM Kye, K Choi, H Byun, B Chang
Proceedings of the IEEE/CVF international conference on computer vision …, 2023
132023
Cross-lingual Dysarthria Severity Classification for English, Korean, and Tamil
EJ Yeo, K Choi, S Kim, M Chung
APSIPA 2022, 2022
92022
OLKAVS: an open large-scale Korean audio-visual speech dataset
J Park, JW Hwang, K Choi, SH Lee, JH Ahn, RH Park, HM Park
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
72024
Reliable decision from multiple subtasks through threshold optimization: Content moderation in the wild
D Son, B Lew, K Choi, Y Baek, S Choi, B Shin, S Ha, B Chang
WSDM 2023, 2022
72022
Speech intelligibility assessment of dysarthric speech by using goodness of pronunciation with uncertainty quantification
EJ Yeo, K Choi, S Kim, M Chung
arXiv preprint arXiv:2305.18392, 2023
62023
On the effects of heterogeneous data sources on speech-to-text foundation models
J Tian, Y Peng, W Chen, K Choi, K Livescu, S Watanabe
arXiv preprint arXiv:2406.09282, 2024
52024
Dynamic-superb phase-2: A collaboratively expanding benchmark for measuring the capabilities of spoken language models with 180 tasks
C Huang, WC Chen, S Yang, AT Liu, CA Li, YX Lin, WC Tseng, A Diwan, ...
arXiv preprint arXiv:2411.05361, 2024
42024
Understanding probe behaviors through variational bounds of mutual information
K Choi, J Jung, S Watanabe
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
42024
Correcting faulty road maps by image inpainting
S Hong, K Choi
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
3*2024
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–20