BigVGAN: A Universal Neural Vocoder with Large-Scale Training S Lee, W Ping, B Ginsburg, B Catanzaro, S Yoon International Conference on Learning Representations (ICLR), 2023 | 247 | 2023 |
FloWaveNet: A generative flow for raw audio S Kim, S Lee, J Song, J Kim, S Yoon International Conference on Machine Learning (ICML), 2018 | 215 | 2018 |
PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Dependent Adaptive Prior S Lee, H Kim, C Shin, X Tan, C Liu, Q Meng, T Qin, W Chen, S Yoon, ... International Conference on Learning Representations (ICLR), 2022 | 122 | 2022 |
Liver Lesion Detection from Weakly-Labeled Multi-phase CT Volumes with a Grouped Single Shot MultiBox Detector S Lee, JS Bae, H Kim, JH Kim, S Yoon International Conference on Medical Image Computing and Computer-Assisted …, 2018 | 50 | 2018 |
Polyphonic Music Generation with Sequence Generative Adversarial Networks S Lee, U Hwang, S Min, S Yoon Journal of KIISE 51 (1), 78-85, 2017 | 48* | 2017 |
Edit-A-Video: Single Video Editing with Object-Aware Consistency C Shin, H Kim, CH Lee, S Lee, S Yoon Asian Conference on Machine Learning (ACML), Best Paper Award, 2023 | 45 | 2023 |
NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity S Lee, S Kim, S Yoon Neural Information Processing Systems (NeurIPS), 2020 | 17 | 2020 |
One-Shot Learning for Text-to-SQL Generation D Lee, J Yoon, J Song, S Lee, S Yoon arXiv preprint arXiv:1905.11499, 2019 | 15 | 2019 |
Robust End-to-End Focal Liver Lesion Detection using Unregistered Multiphase Computed Tomography Images S Lee, E Kim, JS Bae, JH Kim, S Yoon IEEE Transactions on Emerging Topics in Computational Intelligence (TETCI), 2021 | 12 | 2021 |
Improving Text-To-Audio Models with Synthetic Captions Z Kong, S Lee, D Ghosal, N Majumder, A Mehrish, R Valle, S Poria, ... Interspeech SynData4GenAI, 2024 | 10 | 2024 |
Small RNA transcriptome of hibiscus Syriacus provides insights into the potential influence of microRNAs in flower development and terpene synthesis T Kim, JH Park, S Lee, S Kim, J Kim, J Lee, C Shin Molecules and cells 40 (8), 587-597, 2017 | 9 | 2017 |
Low Frame-rate Speech Codec: a Codec Designed for Fast High-quality Speech LLM Training and Inference E Casanova, R Langman, P Neekhara, S Hussain, J Li, S Ghosh, A Jukić, ... arXiv preprint arXiv:2409.12117, 2024 | 3 | 2024 |
VoiceTailor: Lightweight Plug-In Adapter for Diffusion-Based Personalized Text-to-Speech H Kim, S Lee, J Yeom, CH Lee, S Kim, S Yoon Interspeech, 2024 | 2 | 2024 |
An efficient approach to boosting performance of deep spiking network training S Park, S Lee, H Nam, S Yoon Neural Information Processing Systems (NIPS) Workshop on Computing with Spikes, 2016 | 2 | 2016 |
A2SB: Audio-to-Audio Schrodinger Bridges Z Kong, KJ Shih, W Nie, A Vahdat, S Lee, JF Santos, A Jukic, R Valle, ... arXiv preprint arXiv:2501.11311, 2025 | | 2025 |
Fugatto 1: Foundational Generative Audio Transformer Opus 1 R Valle, R Badlani, Z Kong, S Lee, A Goel, JF Santos, A Aljafari, S Kim, ... The Thirteenth International Conference on Learning Representations, 2025 | | 2025 |
ETTA: Elucidating the Design Space of Text-to-Audio Models S Lee, Z Kong, A Goel, S Kim, R Valle, B Catanzaro arXiv preprint arXiv:2412.19351, 2024 | | 2024 |
Deep Generative Model for Waveform Synthesis S Lee Ph.D. Dissertation, Seoul National University. Link: https://snu-primo …, 2023 | | 2023 |