Jason Fong

Citat de

	Toate	Din 2020
Referințe bibliografice	132	129
h-index	6	6
i10-index	5	5

20192020202120222023202420253 15 23 36 22 31 2

Acces public

Afișați-le pe toate

4 articole

0 articole

disponibile

indisponibile

Pe baza cerințelor privind finanțarea

Coautori

Simon KingProfessor of Speech Processing, University of EdinburghAdresă de e-mail confirmată pe ed.ac.uk
Jason TaylorSchool of Informatics, University of EdinburghAdresă de e-mail confirmată pe ed.ac.uk
Gustav Eje HenterKTH Royal Institute of Technology, Stockholm, SwedenAdresă de e-mail confirmată pe kth.se
Jennifer WilliamsAssistant Professor at University of Southampton (UK)Adresă de e-mail confirmată pe soton.ac.uk
Korin RichmondCentre for Speech Technology Research, University of EdinburghAdresă de e-mail confirmată pe cstr.ed.ac.uk
Cassia Valentini-BotinhaoUniversity of EdinburghAdresă de e-mail confirmată pe inf.ed.ac.uk
Zack HodariResearch Engineer, PapercupAdresă de e-mail confirmată pe papercup.com

Urmăriți

Jason Fong

PhD Student, The University of Edinburgh

Adresă de e-mail confirmată pe ed.ac.uk

Speech Synthesis Natural Language Processing Machine Learning


Titlu Sortați după descrierea bibliografică Sortați după an Sortați după titlu	Citat de Citat de	Anul
A comparison between letters and phones as input to sequence-to-sequence models for speech synthesis J Fong, J Taylor, K Richmond, S King The 10th ISCA Speech Synthesis Workshop, 223-227, 2019	37	2019
Where do the improvements come from in sequence-to-sequence neural TTS? O Watts, GE Henter, J Fong, C Valentini-Botinhao 2019 ISCA speech synthesis workshop (SSW) 10, 217-222, 2019	35	2019
Multilingual text-to-speech training using cross language voice conversion and self-supervised learning of speech representations J Wu, A Polyak, Y Taigman, J Fong, P Agrawal, Q He ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	15	2022
Investigating the Robustness of Sequence-to-Sequence Text-to-Speech Models to Imperfectly-Transcribed Training Data. J Fong, PO Gallegos, Z Hodari, S King Interspeech, 1546-1550, 2019	14	2019
Exploring disentanglement with multilingual and monolingual vq-vae J Williams, J Fong, E Cooper, J Yamagishi Proc. 11th ISCA Speech Synthesis Workshop (SSW 11), 2021	12	2021
Testing the limits of representation mixing for pronunciation correction in end-to-end speech synthesis J Fong, J Taylor, S King 21st Annual Conference of the International Speech Communication Association …, 2020	7	2020
Analysing Temporal Sensitivity of VQ-VAE Sub-Phone Codebooks J Fong, J Williams, S King Proc. 11th ISCA Speech Synthesis Workshop (SSW 11), 227-231, 2021	4	2021
Improving Polyglot Speech Synthesis through Multi-task and Adversarial Learning J Fong, J Wu, P Agrawal, A Gibiansky, T Koehler, Q He Proc. 11th ISCA Speech Synthesis Workshop (SSW 11), 172-176, 2021	3	2021
Spell4TTS: Acoustically-informed spellings for improving text-to-speech pronunciations J Fong, H Tang, S King 12th Speech Synthesis Workshop (SSW) 2023, 2023	2	2023
Speech Audio Corrector: using speech from non-target speakers for one-off correction of mispronunciations in grapheme-input text-to-speech J Fong, D Lyth, GE Henter, H Tang, S King Proc. Interspeech 2022, 1213-1217, 2022	2	2022
Towards zero-shot Text-based voice editing using acoustic context conditioning, utterance embeddings, and reference encoders J Fong, Y Wang, P Agrawal, V Manohar, J Wu, T Köhler, Q He arXiv preprint arXiv:2210.16045, 2022	1	2022
Controlling text-to-speech pronunciation using limited linguistic resources J Fong The University of Edinburgh, 2024		2024

Sistemul nu poate realiza operația în acest moment. Încercați din nou mai târziu.

Articole 1–12

Referințe bibliografice pe an

Citate duplicat

Citate fuzionate

Adăugați coautoriCoautori

Urmăriți

Citat de

Coautori