Подписаться
Takaaki Saeki
Takaaki Saeki
Google DeepMind
Подтвержден адрес электронной почты в домене google.com - Главная страница
Название
Процитировано
Процитировано
Год
UTMOS: UTokyo-SaruLab System for VoiceMOS Challenge 2022
T Saeki, D Xin, W Nakata, T Koriyama, S Takamichi, H Saruwatari
arXiv preprint arXiv:2204.02152, 2022
1842022
Espnet2-tts: Extending the edge of tts research
T Hayashi, R Yamamoto, T Yoshimura, P Wu, J Shi, T Saeki, Y Ju, ...
arXiv preprint arXiv:2110.07840, 2021
692021
SpeechLMScore: Evaluating speech generation using speech language model
S Maiti, Y Peng, T Saeki, S Watanabe
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
332023
JTubeSpeech: corpus of Japanese speech collected from YouTube for speech recognition and speaker verification
S Takamichi, L Kürzinger, T Saeki, S Shiota, S Watanabe
arXiv preprint arXiv:2112.09323, 2021
242021
Incremental text-to-speech synthesis using pseudo lookahead with large pretrained language model
T Saeki, S Takamichi, H Saruwatari
IEEE Signal Processing Letters 28, 857-861, 2021
232021
Real-Time, Full-Band, Online DNN-Based Voice Conversion System Using a Single CPU.
T Saeki, Y Saito, S Takamichi, H Saruwatari
INTERSPEECH, 1021-1022, 2020
162020
SpeechBERTScore: Reference-Aware Automatic Evaluation of Speech Generation Leveraging NLP Evaluation Metrics
T Saeki, S Maiti, S Takamichi, S Watanabe, H Saruwatari
arXiv preprint arXiv:2401.16812, 2024
152024
Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-to-Speech
T Saeki, H Zen, Z Chen, N Morioka, G Wang, Y Zhang, A Bapna, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
152023
Learning to Speak from Text: Zero-Shot Multilingual Text-to-Speech with Unsupervised Text Pretraining
T Saeki, S Maiti, X Li, S Watanabe, S Takamichi, H Saruwatari
arXiv preprint arXiv:2301.12596, 2023
152023
Duration-aware pause insertion using pre-trained language model for multi-speaker text-to-speech
D Yang, T Koriyama, Y Saito, T Saeki, D Xin, H Saruwatari
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
142023
Yodas: Youtube-Oriented Dataset for Audio and Speech
X Li, S Takamichi, T Saeki, W Chen, S Shiota, S Watanabe
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
132023
DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level and Utterance-Level Acoustic Representation Learning
T Saeki, K Tachibana, R Yamamoto
arXiv preprint arXiv:2203.15683, 2022
132022
Extending Multilingual Speech Synthesis to 100+ Languages without Transcribed Data
T Saeki, G Wang, N Morioka, I Elias, K Kastner, A Rosenberg, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
122024
Text-to-speech synthesis from dark data with evaluation-in-the-loop data selection
K Seki, S Takamichi, T Saeki, H Saruwatari
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
122023
End-to-End Deep Learning Speech Recognition Model for Silent Speech Challenge.
N Kimura, Z Su, T Saeki
INTERSPEECH, 1025-1026, 2020
82020
Heiga Zen, Zhehuai Chen, Nobuyuki Morioka, Gary Wang, Yu Zhang, Ankur Bapna, Andrew Rosenberg, and Bhuvana Ramabhadran. Virtuoso: Massive multilingual speech-text joint semi …
T Saeki
ICASSP 2023, 1-5, 2023
62023
SelfRemaster: Self-Supervised Speech Restoration with Analysis-by-Synthesis Approach Using Channel Modeling
T Saeki, S Takamichi, T Nakamura, N Tanji, H Saruwatari
arXiv preprint arXiv:2203.12937, 2022
62022
Text-Inductive Graphone-Based Language Adaptation for Low-Resource Speech Synthesis
T Saeki, S Maiti, X Li, S Watanabe, S Takamichi, H Saruwatari
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
52024
Low-Latency Incremental Text-to-Speech Synthesis with Distilled Context Prediction Network
T Saeki, S Takamichi, H Saruwatari
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
52021
Diversity-based core-set selection for text-to-speech with linguistic and acoustic features
K Seki, S Takamichi, T Saeki, H Saruwatari
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
42024
В данный момент система не может выполнить эту операцию. Повторите попытку позднее.
Статьи 1–20