Sledovat
Heeseung Kim
Název
Citace
Citace
Rok
Priorgrad: Improving conditional denoising diffusion models with data-driven adaptive prior
S Lee, H Kim, C Shin, X Tan, C Liu, Q Meng, T Qin, W Chen, S Yoon, ...
The Tenth International Conference on Learning Representations (ICLR), 2021
120*2021
Guided-tts: A diffusion model for text-to-speech via classifier guidance
H Kim, S Kim, S Yoon
International Conference on Machine Learning, 11119-11133, 2022
1052022
Edit-A-Video: Single Video Editing with Object-Aware Consistency
C Shin, H Kim, CH Lee, S Lee, S Yoon
Asian Conference on Machine Learning (ACML), Best Paper Award, 2023
512023
Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech with Untranscribed Data
S Kim, H Kim, S Yoon
arXiv preprint arXiv:2205.15370, 2022
472022
Rare Tokens Degenerate All Tokens: Improving Neural Text Generation via Adaptive Gradient Gating for Rare Token Embeddings
S Yu, J Song, H Kim, S Lee, WJ Ryu, S Yoon
Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022
342022
UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data
H Kim, S Kim, J Yeom, S Yoon
INTERSPEECH 2023, 2023
252023
Silent Speech Recognition with Strain Sensors and Deep Learning Analysis of Directional Facial Muscle Movement
H Yoo, E Kim, JW Chung, H Cho, S Jeong, H Kim, D Jang, H Kim, J Yoon, ...
ACS Applied Materials & Interfaces 14 (48), 54157-54169, 2022
192022
Paralinguistics-Aware Speech-Empowered Large Language Models for Natural Conversation
H Kim, S Seo, K Jeong, O Kwon, J Kim, J Lee, E Song, M Oh, S Yoon, ...
The Thirty-Eighth Annual Conference on Neural Information Processing Systems …, 2024
8*2024
HyperCLOVA X Technical Report
KM Yoo, J Han, S In, H Jeon, J Jeong, J Kang, H Kim, KM Kim, M Kim, ...
arXiv preprint arXiv:2404.01954, 2024
62024
Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator
C Shin, J Choi, H Kim, S Yoon
arXiv preprint arXiv:2411.15466, 2024
32024
Stein Latent Optimization for Generative Adversarial Networks
U Hwang, H Kim, D Jung, H Jang, H Lee, S Yoon
The Tenth International Conference on Learning Representations (ICLR), 2021
32021
VoiceTailor: Lightweight Plug-In Adapter for Diffusion-Based Personalized Text-to-Speech
H Kim, S Lee, J Yeom, CH Lee, S Kim, S Yoon
INTERSPEECH 2024, 2024
22024
NanoVoice: Efficient Speaker-Adaptive Text-to-Speech for Multiple Speakers
N Park, H Kim, CH Lee, J Choi, J Yeom, S Yoon
arXiv preprint arXiv:2409.15760, 2024
12024
VoiceGuider: Enhancing Out-of-Domain Performance in Parameter-Efficient Speaker-Adaptive Text-to-Speech via Autoguidance
J Yeom, H Kim, J Choi, CH Lee, N Park, S Yoon
arXiv preprint arXiv:2409.15759, 2024
12024
Speech recognition using facial skin strain data
Y Sungroh, E Kim, KIM Heeseung
US Patent 11,810,549, 2023
12023
Style-Friendly SNR Sampler for Style-Driven Generation
J Choi, C Shin, Y Oh, H Kim, S Yoon
arXiv preprint arXiv:2411.14793, 2024
2024
Method and apparatus for training an unsupervised conditional generative model
Y Sungroh, U Hwang, KIM Heeseung
US Patent App. 18/204,457, 2023
2023
Systém momentálně nemůže danou operaci provést. Zkuste to znovu později.
Články 1–17