Fre-GAN: Adversarial frequency-consistent audio synthesis JH Kim, SH Lee, JH Lee, SW Lee arXiv preprint arXiv:2106.02297, 2021 | 68 | 2021 |
Multi-spectrogan: High-diversity and high-fidelity spectrogram generation with adversarial style combination for speech synthesis SH Lee, HW Yoon, HR Noh, JH Kim, SW Lee Proceedings of the AAAI Conference on Artificial Intelligence 35 (14), 13198 …, 2021 | 64 | 2021 |
Voicemixer: Adversarial voice style mixup SH Lee, JH Kim, H Chung, SW Lee Advances in Neural Information Processing Systems 34, 294-308, 2021 | 43 | 2021 |
PVAE-TTS: Adaptive text-to-speech via progressive style adaptation JH Lee, SH Lee, JH Kim, SW Lee ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 17 | 2022 |
Fre-gan 2: Fast and efficient frequency-consistent audio synthesis SH Lee, JH Kim, KE Lee, SW Lee ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 15 | 2022 |
TriniTTS: Pitch-controllable End-to-end TTS without External Aligner. Y Ju, I Kim, H Yang, JH Kim, B Kim, S Maiti, S Watanabe Interspeech, 16-20, 2022 | 11 | 2022 |
Crossspeech: Speaker-independent acoustic representation for cross-lingual speech synthesis JH Kim, HS Yang, YC Ju, IH Kim, BY Kim ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 10 | 2023 |
Fregrad: Lightweight and fast frequency-aware diffusion vocoder TD Nguyen, JH Kim, Y Jang, J Kim, JS Chung ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 7 | 2024 |
Faces that speak: Jointly synthesising talking face and speech from text Y Jang, JH Kim, J Ahn, D Kwak, HS Yang, YC Ju, IH Kim, BY Kim, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 7 | 2024 |
GC-TTS: Few-shot speaker adaptation with geometric constraints JH Kim, SH Lee, JH Lee, HG Jung, SW Lee 2021 IEEE International Conference on Systems, Man, and Cybernetics (SMC …, 2021 | 7 | 2021 |
FlowAVSE: Efficient audio-visual speech enhancement with conditional flow matching C Jung, S Lee, JH Kim, JS Chung arXiv preprint arXiv:2406.09286, 2024 | 3 | 2024 |
Let there be sound: reconstructing high quality speech from silent videos JH Kim, J Kim, JS Chung Proceedings of the AAAI Conference on Artificial Intelligence 38 (3), 2759-2767, 2024 | 3 | 2024 |
FACTSpeech: Speaking a foreign language pronunciation using only your native characters HS Yang, JH Kim, YC Ju, IH Kim, BY Kim, SJ Choi, HY Kim Proc. INTERSPEECH 2023, 606-610, 2023 | 3 | 2023 |
Text-to-speech synthesis in the wild J Jung, W Zhang, S Maiti, Y Wu, X Wang, JH Kim, Y Matsunaga, S Um, ... arXiv preprint arXiv:2409.08711, 2024 | 1 | 2024 |
VoxSim: A perceptual voice similarity dataset J Ahn, Y Kim, Y Choi, D Kwak, JH Kim, S Mun, JS Chung arXiv preprint arXiv:2407.18505, 2024 | 1 | 2024 |
SpoofCeleb: Speech Deepfake Detection and SASV In The Wild J Jung, Y Wu, X Wang, JH Kim, S Maiti, Y Matsunaga, H Shim, J Tian, ... IEEE Open Journal of Signal Processing, 2025 | | 2025 |
AdaptVC: High Quality Voice Conversion with Adaptive Learning J Kim, JH Kim, Y Choi, TD Nguyen, S Mun, JS Chung arXiv preprint arXiv:2501.01347, 2025 | | 2025 |
CrossSpeech++: Cross-lingual Speech Synthesis with Decoupled Language and Speaker Generation JH Kim, HS Yang, YC Ju, IH Kim, BY Kim, JS Chung arXiv preprint arXiv:2412.20048, 2024 | | 2024 |
V2SFlow: Video-to-Speech Generation with Speech Decomposition and Rectified Flow J Choi, JH Kim, J Li, JS Chung, S Liu ICASSP, 2024 | | 2024 |
Accelerating Codec-based Speech Synthesis with Multi-Token Prediction and Speculative Decoding TD Nguyen, JH Kim, J Choi, S Choi, J Park, Y Lee, JS Chung arXiv preprint arXiv:2410.13839, 2024 | | 2024 |