How Generative Spoken Language Modeling Encodes Noisy Speech: Investigation from Phonetics to Syntactics J Park, S Takamichi, T Nakamura, K Seki, D Xin, H Saruwatari Proc. INTERSPEECH 2023, 2023 | 3 | 2023 |
Analyzing The Language of Visual Tokens DM Chan, R Corona, J Park, CJ Cho, Y Bai, T Darrell arXiv preprint arXiv:2411.05001, 2024 | 1 | 2024 |
Do Learned Speech Symbols Follow Zipf’s Law? S Takamichi, H Maeda, J Park, D Saito, H Saruwatari ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 1 | 2024 |
Analytic Study of Text-Free Speech Synthesis for Raw Audio using a Self-Supervised Learning Model J Park, D Saito, N Minematsu Proc. APSIPA ASC 2024, 2024 | | 2024 |
A Pilot Study of GSLM-based Simulation of Foreign Accentuation Only Using Native Speech Corpora K Onda, J Park, N Minematsu, D Saito Proc. INTERSPEECH 2024, 2024 | | 2024 |