Stebėti
Ji-Hoon Kim
Pavadinimas
Cituota
Cituota
Metai
Fre-GAN: Adversarial frequency-consistent audio synthesis
JH Kim, SH Lee, JH Lee, SW Lee
arXiv preprint arXiv:2106.02297, 2021
682021
Multi-spectrogan: High-diversity and high-fidelity spectrogram generation with adversarial style combination for speech synthesis
SH Lee, HW Yoon, HR Noh, JH Kim, SW Lee
Proceedings of the AAAI Conference on Artificial Intelligence 35 (14), 13198 …, 2021
642021
Voicemixer: Adversarial voice style mixup
SH Lee, JH Kim, H Chung, SW Lee
Advances in Neural Information Processing Systems 34, 294-308, 2021
432021
PVAE-TTS: Adaptive text-to-speech via progressive style adaptation
JH Lee, SH Lee, JH Kim, SW Lee
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
172022
Fre-gan 2: Fast and efficient frequency-consistent audio synthesis
SH Lee, JH Kim, KE Lee, SW Lee
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
152022
TriniTTS: Pitch-controllable End-to-end TTS without External Aligner.
Y Ju, I Kim, H Yang, JH Kim, B Kim, S Maiti, S Watanabe
Interspeech, 16-20, 2022
112022
Crossspeech: Speaker-independent acoustic representation for cross-lingual speech synthesis
JH Kim, HS Yang, YC Ju, IH Kim, BY Kim
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
102023
Fregrad: Lightweight and fast frequency-aware diffusion vocoder
TD Nguyen, JH Kim, Y Jang, J Kim, JS Chung
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
72024
Faces that speak: Jointly synthesising talking face and speech from text
Y Jang, JH Kim, J Ahn, D Kwak, HS Yang, YC Ju, IH Kim, BY Kim, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
72024
GC-TTS: Few-shot speaker adaptation with geometric constraints
JH Kim, SH Lee, JH Lee, HG Jung, SW Lee
2021 IEEE International Conference on Systems, Man, and Cybernetics (SMC …, 2021
72021
FlowAVSE: Efficient audio-visual speech enhancement with conditional flow matching
C Jung, S Lee, JH Kim, JS Chung
arXiv preprint arXiv:2406.09286, 2024
32024
Let there be sound: reconstructing high quality speech from silent videos
JH Kim, J Kim, JS Chung
Proceedings of the AAAI Conference on Artificial Intelligence 38 (3), 2759-2767, 2024
32024
FACTSpeech: Speaking a foreign language pronunciation using only your native characters
HS Yang, JH Kim, YC Ju, IH Kim, BY Kim, SJ Choi, HY Kim
Proc. INTERSPEECH 2023, 606-610, 2023
32023
Text-to-speech synthesis in the wild
J Jung, W Zhang, S Maiti, Y Wu, X Wang, JH Kim, Y Matsunaga, S Um, ...
arXiv preprint arXiv:2409.08711, 2024
12024
VoxSim: A perceptual voice similarity dataset
J Ahn, Y Kim, Y Choi, D Kwak, JH Kim, S Mun, JS Chung
arXiv preprint arXiv:2407.18505, 2024
12024
SpoofCeleb: Speech Deepfake Detection and SASV In The Wild
J Jung, Y Wu, X Wang, JH Kim, S Maiti, Y Matsunaga, H Shim, J Tian, ...
IEEE Open Journal of Signal Processing, 2025
2025
AdaptVC: High Quality Voice Conversion with Adaptive Learning
J Kim, JH Kim, Y Choi, TD Nguyen, S Mun, JS Chung
arXiv preprint arXiv:2501.01347, 2025
2025
CrossSpeech++: Cross-lingual Speech Synthesis with Decoupled Language and Speaker Generation
JH Kim, HS Yang, YC Ju, IH Kim, BY Kim, JS Chung
arXiv preprint arXiv:2412.20048, 2024
2024
V2SFlow: Video-to-Speech Generation with Speech Decomposition and Rectified Flow
J Choi, JH Kim, J Li, JS Chung, S Liu
ICASSP, 2024
2024
Accelerating Codec-based Speech Synthesis with Multi-Token Prediction and Speculative Decoding
TD Nguyen, JH Kim, J Choi, S Choi, J Park, Y Lee, JS Chung
arXiv preprint arXiv:2410.13839, 2024
2024
Sistema negali atlikti operacijos. Bandykite vėliau dar kartą.
Straipsniai 1–20