Eunwoo Song

Citado por

	Total	Desde 2020
Citas	1578	1501
Índice h	14	14
Índice i10	17	16

380

190

285

201520162017201820192020202120222023202420254 7 8 10 44 117 283 348 339 365 35

Seguir

Eunwoo Song

Voice, Naver Cloud

Dirección de correo verificada de navercorp.com - Página principal

Speech Synthesis


Título Ordenar por citas Ordenar por año Ordenar por título	Citado por Citado por	Año
Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram R Yamamoto, E Song, JM Kim ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	1028	2020
Effective spectral and excitation modeling techniques for LSTM-RNN-based speech synthesis systems E Song, FK Soong, HG Kang IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (11 …, 2017	72	2017
Probability density distillation with generative adversarial networks for high-quality parallel waveform generation R Yamamoto, E Song, JM Kim arXiv preprint arXiv:1904.04472, 2019	62	2019
HierSpeech: Bridging the gap between text and speech by hierarchical variational inference using self-supervised representations for speech synthesis SH Lee, SB Kim, JH Lee, E Song, MJ Hwang, SW Lee Advances in Neural Information Processing Systems 35, 16624-16636, 2022	49	2022
ExcitNet vocoder: A neural excitation model for parametric speech synthesis systems E Song, K Byun, HG Kang 2019 27th European Signal Processing Conference (EUSIPCO), 1-5, 2019	44	2019
TTS-by-TTS: TTS-driven data augmentation for fast and high-quality speech synthesis MJ Hwang, R Yamamoto, E Song, JM Kim ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	41	2021
LP-WaveNet: Linear prediction-based WaveNet speech synthesis MJ Hwang, F Soong, E Song, X Wang, H Kang, HG Kang 2020 Asia-Pacific Signal and Information Processing Association Annual …, 2020	33	2020
Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators R Yamamoto, E Song, MJ Hwang, JM Kim ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	24	2021
Improved Parallel WaveGAN vocoder with perceptually weighted spectrogram loss E Song, R Yamamoto, MJ Hwang, JS Kim, O Kwon, JM Kim 2021 IEEE Spoken Language Technology Workshop (SLT), 470-476, 2021	24	2021
Cross-speaker emotion transfer for low-resource text-to-speech using non-parallel voice conversion with pitch-shift data augmentation R Terashima, R Yamamoto, E Song, Y Shirahata, HW Yoon, JM Kim, ... arXiv preprint arXiv:2204.10020, 2022	23	2022
High-Fidelity Parallel WaveGAN with Multi-Band Harmonic-Plus-Noise Model. MJ Hwang, R Yamamoto, E Song, JM Kim Interspeech, 2227-2231, 2021	19	2021
Period vits: Variational inference with explicit pitch modeling for end-to-end emotional speech synthesis Y Shirahata, R Yamamoto, E Song, R Terashima, JM Kim, K Tachibana ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	18	2023
Improving lpcnet-based text-to-speech with linear prediction-structured mixture density network MJ Hwang, E Song, R Yamamoto, F Soong, HG Kang ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	16	2020
Language model-based emotion prediction methods for emotional speech synthesis systems HW Yoon, O Kwon, H Lee, R Yamamoto, E Song, JM Kim, MJ Hwang arXiv preprint arXiv:2206.15067, 2022	15	2022
Neural text-to-speech with a modeling-by-generation excitation vocoder E Song, MJ Hwang, R Yamamoto, JS Kim, O Kwon, JM Kim arXiv preprint arXiv:2008.00132, 2020	11	2020
Improved time-frequency trajectory excitation modeling for a statistical parametric speech synthesis system E Song, YS Joo, HG Kang 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015	11	2015
LiteTTS: A Lightweight Mel-Spectrogram-Free Text-to-Wave Synthesizer Based on Generative Adversarial Networks. HK Nguyen, K Jeong, SY Um, MJ Hwang, E Song, HG Kang Interspeech, 3595-3599, 2021	10	2021
Unified Speech-Text Pretraining for Spoken Dialog Modeling H Kim, S Seo, K Jeong, O Kwon, J Kim, J Lee, E Song, M Oh, S Yoon, ... arXiv preprint arXiv:2402.05706, 2024	8	2024
TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder E Song, R Yamamoto, O Kwon, CH Song, MJ Hwang, S Oh, HW Yoon, ... arXiv preprint arXiv:2206.14984, 2022	8	2022
Acoustic Modeling Using Adversarially Trained Variational Recurrent Neural Network for Speech Synthesis. JY Lee, SJ Cheon, BJ Choi, NS Kim, E Song INTERSPEECH, 917-921, 2018	8	2018

El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.

Artículos 1–20

Citas por año

Citas duplicadas

Citas combinadas

Añadir coautoresCoautores

Seguir

Citado por