Urmăriți
Jason Fong
Jason Fong
PhD Student, The University of Edinburgh
Adresă de e-mail confirmată pe ed.ac.uk
Titlu
Citat de
Citat de
Anul
A comparison between letters and phones as input to sequence-to-sequence models for speech synthesis
J Fong, J Taylor, K Richmond, S King
The 10th ISCA Speech Synthesis Workshop, 223-227, 2019
372019
Where do the improvements come from in sequence-to-sequence neural TTS?
O Watts, GE Henter, J Fong, C Valentini-Botinhao
2019 ISCA speech synthesis workshop (SSW) 10, 217-222, 2019
352019
Multilingual text-to-speech training using cross language voice conversion and self-supervised learning of speech representations
J Wu, A Polyak, Y Taigman, J Fong, P Agrawal, Q He
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
152022
Investigating the Robustness of Sequence-to-Sequence Text-to-Speech Models to Imperfectly-Transcribed Training Data.
J Fong, PO Gallegos, Z Hodari, S King
Interspeech, 1546-1550, 2019
142019
Exploring disentanglement with multilingual and monolingual vq-vae
J Williams, J Fong, E Cooper, J Yamagishi
Proc. 11th ISCA Speech Synthesis Workshop (SSW 11), 2021
122021
Testing the limits of representation mixing for pronunciation correction in end-to-end speech synthesis
J Fong, J Taylor, S King
21st Annual Conference of the International Speech Communication Association …, 2020
72020
Analysing Temporal Sensitivity of VQ-VAE Sub-Phone Codebooks
J Fong, J Williams, S King
Proc. 11th ISCA Speech Synthesis Workshop (SSW 11), 227-231, 2021
42021
Improving Polyglot Speech Synthesis through Multi-task and Adversarial Learning
J Fong, J Wu, P Agrawal, A Gibiansky, T Koehler, Q He
Proc. 11th ISCA Speech Synthesis Workshop (SSW 11), 172-176, 2021
32021
Spell4TTS: Acoustically-informed spellings for improving text-to-speech pronunciations
J Fong, H Tang, S King
12th Speech Synthesis Workshop (SSW) 2023, 2023
22023
Speech Audio Corrector: using speech from non-target speakers for one-off correction of mispronunciations in grapheme-input text-to-speech
J Fong, D Lyth, GE Henter, H Tang, S King
Proc. Interspeech 2022, 1213-1217, 2022
22022
Towards zero-shot Text-based voice editing using acoustic context conditioning, utterance embeddings, and reference encoders
J Fong, Y Wang, P Agrawal, V Manohar, J Wu, T Köhler, Q He
arXiv preprint arXiv:2210.16045, 2022
12022
Controlling text-to-speech pronunciation using limited linguistic resources
J Fong
The University of Edinburgh, 2024
2024
Sistemul nu poate realiza operația în acest moment. Încercați din nou mai târziu.
Articole 1–12