Ji-Hoon Kim

Cituota

	Visi	Nuo 2020
Šaltiniai	260	257
h-rodyklė	7	7
i10-rodyklė	7	7

100

202120222023202420258 54 86 93 13

Bendraautoriai

Sang-Hoon LeeAjou UniversityPatvirtintas el. paštas ajou.ac.kr
Joon Son ChungKAISTPatvirtintas el. paštas kaist.ac.kr
Youngjoon JangKAISTPatvirtintas el. paštas kaist.ac.kr
Tan Dat NguyenStudent, KAISTPatvirtintas el. paštas kaist.ac.kr
Junseok AhnKAISTPatvirtintas el. paštas kaist.ac.kr
Jee-weon JungApple, Carnegie Mellon UniversityPatvirtintas el. paštas ieee.org
Jeongsoo ChoiKAISTPatvirtintas el. paštas kaist.ac.kr

Stebėti

Ji-Hoon Kim

KAIST

Patvirtintas el. paštas kaist.ac.kr - Pagrindinis puslapis

speech processing speech synthesis multimodal learning


Pavadinimas Rūšiuoti pagal šaltinius Rūšiuoti pagal metus Rūšiuoti pagal pavadinimą	Cituota Cituota	Metai
Fre-GAN: Adversarial frequency-consistent audio synthesis JH Kim, SH Lee, JH Lee, SW Lee arXiv preprint arXiv:2106.02297, 2021	68	2021
Multi-spectrogan: High-diversity and high-fidelity spectrogram generation with adversarial style combination for speech synthesis SH Lee, HW Yoon, HR Noh, JH Kim, SW Lee Proceedings of the AAAI Conference on Artificial Intelligence 35 (14), 13198 …, 2021	64	2021
Voicemixer: Adversarial voice style mixup SH Lee, JH Kim, H Chung, SW Lee Advances in Neural Information Processing Systems 34, 294-308, 2021	43	2021
PVAE-TTS: Adaptive text-to-speech via progressive style adaptation JH Lee, SH Lee, JH Kim, SW Lee ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	17	2022
Fre-gan 2: Fast and efficient frequency-consistent audio synthesis SH Lee, JH Kim, KE Lee, SW Lee ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	15	2022
TriniTTS: Pitch-controllable End-to-end TTS without External Aligner. Y Ju, I Kim, H Yang, JH Kim, B Kim, S Maiti, S Watanabe Interspeech, 16-20, 2022	11	2022
Crossspeech: Speaker-independent acoustic representation for cross-lingual speech synthesis JH Kim, HS Yang, YC Ju, IH Kim, BY Kim ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	10	2023
Fregrad: Lightweight and fast frequency-aware diffusion vocoder TD Nguyen, JH Kim, Y Jang, J Kim, JS Chung ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	7	2024
Faces that speak: Jointly synthesising talking face and speech from text Y Jang, JH Kim, J Ahn, D Kwak, HS Yang, YC Ju, IH Kim, BY Kim, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	7	2024
GC-TTS: Few-shot speaker adaptation with geometric constraints JH Kim, SH Lee, JH Lee, HG Jung, SW Lee 2021 IEEE International Conference on Systems, Man, and Cybernetics (SMC …, 2021	7	2021
FlowAVSE: Efficient audio-visual speech enhancement with conditional flow matching C Jung, S Lee, JH Kim, JS Chung arXiv preprint arXiv:2406.09286, 2024	3	2024
Let there be sound: reconstructing high quality speech from silent videos JH Kim, J Kim, JS Chung Proceedings of the AAAI Conference on Artificial Intelligence 38 (3), 2759-2767, 2024	3	2024
FACTSpeech: Speaking a foreign language pronunciation using only your native characters HS Yang, JH Kim, YC Ju, IH Kim, BY Kim, SJ Choi, HY Kim Proc. INTERSPEECH 2023, 606-610, 2023	3	2023
Text-to-speech synthesis in the wild J Jung, W Zhang, S Maiti, Y Wu, X Wang, JH Kim, Y Matsunaga, S Um, ... arXiv preprint arXiv:2409.08711, 2024	1	2024
VoxSim: A perceptual voice similarity dataset J Ahn, Y Kim, Y Choi, D Kwak, JH Kim, S Mun, JS Chung arXiv preprint arXiv:2407.18505, 2024	1	2024
SpoofCeleb: Speech Deepfake Detection and SASV In The Wild J Jung, Y Wu, X Wang, JH Kim, S Maiti, Y Matsunaga, H Shim, J Tian, ... IEEE Open Journal of Signal Processing, 2025		2025
AdaptVC: High Quality Voice Conversion with Adaptive Learning J Kim, JH Kim, Y Choi, TD Nguyen, S Mun, JS Chung arXiv preprint arXiv:2501.01347, 2025		2025
CrossSpeech++: Cross-lingual Speech Synthesis with Decoupled Language and Speaker Generation JH Kim, HS Yang, YC Ju, IH Kim, BY Kim, JS Chung arXiv preprint arXiv:2412.20048, 2024		2024
V2SFlow: Video-to-Speech Generation with Speech Decomposition and Rectified Flow J Choi, JH Kim, J Li, JS Chung, S Liu ICASSP, 2024		2024
Accelerating Codec-based Speech Synthesis with Multi-Token Prediction and Speculative Decoding TD Nguyen, JH Kim, J Choi, S Choi, J Park, Y Lee, JS Chung arXiv preprint arXiv:2410.13839, 2024		2024

Sistema negali atlikti operacijos. Bandykite vėliau dar kartą.

Straipsniai 1–20

Šaltinių per metus

Dubliuoti šaltiniai

Sujungti šaltiniai

Pridėti bendraautoriusBendraautoriai

Stebėti

Cituota

Bendraautoriai