Yusuke Yasda

Citeret af

	Alle	Siden 2020
Henvisninger	724	626
h-index	10	10
i10-indeks	10	10

180

135

2005200620072008200920102011201220132014201520162017201820192020202120222023202420253 4 4 2 4 6 3 4 9 7 6 7 3 5 26 52 109 122 132 173 31

Offentlig adgang

Se alle

5 artikler

0 artikler

tilgængelige

ikke tilgængelige

Baseret på krav i forbindelse med finansiering

Følg

Yusuke Yasda

Nagoya university

Verificeret mail på g.sp.m.is.nagoya-u.ac.jp

Speech synthesis


Titel Sortér efter henvisninger Sortér efter årstal Sortér efter titel	Citeret af Citeret af	År
Zero-shot multi-speaker text-to-speech with state-of-the-art neural speaker embeddings E Cooper, CI Lai, Y Yasuda, F Fang, X Wang, N Chen, J Yamagishi ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	226	2020
Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language Y Yasuda, X Wang, S Takaki, J Yamagishi ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	112	2019
Multivariate analysis of risk factors for hemorrhagic cystitis after hematopoietic stem cell transplantation K Tsuboi, K Kishi, K Ohmachi, Y Yasuda, T Shimizu, H Inoue, ... Bone marrow transplantation 32 (9), 903-907, 2003	83	2003
Espnet2-tts: Extending the edge of tts research T Hayashi, R Yamamoto, T Yoshimura, P Wu, J Shi, T Saeki, Y Ju, ... arXiv preprint arXiv:2110.07840, 2021	70	2021
The singing voice conversion challenge 2023 WC Huang, LP Violeta, S Liu, J Shi, T Toda 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023	58	2023
Investigation of learning abilities on linguistic features in sequence-to-sequence text-to-speech synthesis Y Yasuda, X Wang, J Yamagishi Computer Speech & Language 67, 101183, 2021	37	2021
Can speaker augmentation improve multi-speaker end-to-end TTS? E Cooper, CI Lai, Y Yasuda, J Yamagishi arXiv preprint arXiv:2005.01245, 2020	28	2020
Initial investigation of an encoder-decoder end-to-end TTS framework using marginalization of monotonic hard latent alignments Y Yasuda, X Wang, J Yamagishi arXiv preprint arXiv:1908.11535, 2019	25	2019
End-to-end text-to-speech using latent duration based on vq-vae Y Yasuda, X Wang, J Yamagishd ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	22	2021
Modeling of Rakugo speech and its limitations: Toward speech synthesis that entertains audiences S Kato, Y Yasuda, X Wang, E Cooper, S Takaki, J Yamagishi IEEE Access 8, 138149-138161, 2020	13	2020
Text-to-speech synthesis based on latent variable conversion using diffusion probabilistic model and variational autoencoder Y Yasuda, T Toda ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	8	2023
Rakugo speech synthesis using segment-to-segment neural transduction and style tokens—toward speech synthesis for entertaining audiences S Kato, Y Yasuda, X Wang, E Cooper, S Takaki, J Yamagishi Proc. 10th ISCA Speech Synth. Workshop, 111-116, 2019	8	2019
Analysis of mean opinion scores in subjective evaluation of synthetic speech based on tail probabilities Y Yasuda, T Toda Proc. Interspeech, 5491-5495, 2023	6	2023
Investigation of Japanese PnG BERT language model in text-to-speech synthesis for pitch accent language Y Yasuda, T Toda IEEE Journal of Selected Topics in Signal Processing 16 (6), 1319-1328, 2022	6	2022
Pretraining strategies, waveform model choice, and acoustic configurations for multi-speaker end-to-end speech synthesis E Cooper, X Wang, Y Zhao, Y Yasuda, J Yamagishi arXiv preprint arXiv:2011.04839, 2020	6	2020
Preference-based training framework for automatic speech quality assessment using deep neural network CH Hu, Y Yasuda, T Toda arXiv preprint arXiv:2308.15203, 2023	5	2023
Effect of choice of probability distribution, randomness, and search methods for alignment modeling in sequence-to-sequence text-to-speech synthesis using hard alignment Y Yasuda, X Wang, J Yamagishi ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	5	2020
Tts tutorial at ieice sp workshop X Wang, Y Yasuda	4	2019
Spoken-Text-Style Transfer with Conditional Variational Autoencoder and Content Word Storage. D Yoshioka, Y Yasuda, N Matsunaga, Y Ohtani, T Toda INTERSPEECH, 4576-4580, 2022	1	2022
落語音声合成における Tacotron およびコンテキスト特徴量の使用とその評価加藤集平，高木信二，山岸順一，安田裕介電子情報通信学会技術研究報告= IEICE technical report: 信学技報 118 (495 …, 2019	1	2019

Systemet kan ikke foretage handlingen nu. Prøv igen senere.

Artikler 1–20

Henvisninger pr. år

Dublerede henvisninger

Flettede henvisninger

Tilføj medforfattereMedforfattere

Følg

Citeret af