Obserwuj
Yi-Chiao, Wu
Tytuł
Cytowane przez
Cytowane przez
Rok
Voice conversion from unaligned corpora using variational autoencoding wasserstein generative adversarial networks
CC Hsu, HT Hwang, YC Wu, Y Tsao, HM Wang
arXiv preprint arXiv:1704.00849, 2017
4642017
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech
X Wang, J Yamagishi, M Todisco, H Delgado, A Nautsch, N Evans, ...
Computer Speech & Language 64, 101114, 2020
4322020
Voice conversion from non-parallel corpora using variational auto-encoder
CC Hsu, HT Hwang, YC Wu, Y Tsao, HM Wang
2016 Asia-Pacific Signal and Information Processing Association Annual …, 2016
3762016
Voice transformer network: Sequence-to-sequence voice conversion using transformer with text-to-speech pretraining
WC Huang, T Hayashi, YC Wu, H Kameoka, T Toda
arXiv preprint arXiv:1912.06813, 2019
1152019
Non-parallel voice conversion with cyclic variational autoencoder
PL Tobing, YC Wu, T Hayashi, K Kobayashi, T Toda
arXiv preprint arXiv:1907.10185, 2019
902019
Audiobox: Unified audio generation with natural language prompts
A Vyas, B Shi, M Le, A Tjandra, YC Wu, B Guo, J Zhang, X Zhang, ...
arXiv preprint arXiv:2312.15821, 2023
892023
AudioDec: An Open-Source Streaming High-Fidelity Neural Audio Codec
YC Wu, ID Gebru, D Marković, A Richard
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
842023
Movie gen: A cast of media foundation models
A Polyak, A Zohar, A Brown, A Tjandra, A Sinha, A Lee, A Vyas, B Shi, ...
arXiv preprint arXiv:2410.13720, 2024
822024
Pretraining techniques for sequence-to-sequence voice conversion
WC Huang, T Hayashi, YC Wu, H Kameoka, T Toda
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 745-755, 2021
482021
Locally Linear Embedding for Exemplar-Based Spectral Conversion.
YC Wu, HT Hwang, CC Hsu, Y Tsao, HM Wang
INTERSPEECH, 1652-1656, 2016
432016
Any-to-one sequence-to-sequence voice conversion using self-supervised discrete speech representations
WC Huang, YC Wu, T Hayashi
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
402021
Source-Filter HiFi-GAN: Fast and pitch controllable high-fidelity neural vocoder
R Yoneyama, YC Wu, T Toda
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
352023
The NU non-parallel voice conversion system for the voice conversion challenge 2018
YC Wu, PL Tobing, T Hayashi, K Kobayashi, T Toda
WORLD 2 (m3), a1, 2018
322018
Collapsed speech segment detection and suppression for WaveNet vocoder
YC Wu, K Kobayashi, T Hayashi, PL Tobing, T Toda
arXiv preprint arXiv:1804.11055, 2018
302018
Refined wavenet vocoder for variational autoencoder based voice conversion
WC Huang, YC Wu, HT Hwang, PL Tobing, T Hayashi, K Kobayashi, ...
2019 27th European Signal Processing Conference (EUSIPCO), 1-5, 2019
292019
Human identification system by fusion of face recognition and speaker recognition, method and service robot thereof
KT Song, SC Chien, CY Lin, YW Chen, SH Chen, CY Chiang, YC Wu
US Patent 8,879,799, 2014
282014
Quasi-periodic parallel WaveGAN: A non-autoregressive raw waveform generative model with pitch-dependent dilated convolution neural network
YC Wu, T Hayashi, T Okamoto, H Kawai, T Toda
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 792-806, 2021
272021
crank: An open-source software for nonparallel voice conversion based on vector-quantized variational autoencoder
K Kobayashi, WC Huang, YC Wu, PL Tobing, T Hayashi, T Toda
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
252021
Quasi-periodic WaveNet: An autoregressive raw waveform generative model with pitch-dependent dilated convolution neural network
YC Wu, T Hayashi, PL Tobing, K Kobayashi, T Toda
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1134-1148, 2021
252021
Voice conversion with cyclic recurrent neural network and fine-tuned WaveNet vocoder
PL Tobing, YC Wu, T Hayashi, K Kobayashi, T Toda
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
222019
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–20