Text-to-speech synthesis from dark data with evaluation-in-the-loop data selection K Seki, S Takamichi, T Saeki, H Saruwatari ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 12 | 2023 |
Diversity-based core-set selection for text-to-speech with linguistic and acoustic features K Seki, S Takamichi, T Saeki, H Saruwatari ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 4 | 2024 |
How generative spoken language modeling encodes noisy speech: Investigation from phonetics to syntactics J Park, S Takamichi, T Nakamura, K Seki, D Xin, H Saruwatari arXiv preprint arXiv:2306.00697, 2023 | 3 | 2023 |
Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment T Igarashi, Y Saito, K Seki, S Takamichi, R Yamamoto, K Tachibana, ... arXiv preprint arXiv:2406.07280, 2024 | 1 | 2024 |
Analysis of Degraded Noisty Voice and Application to Other Languages Using Generative Spoken Language Model J PARK, S TAKAMICHI, T NAKAMURA, K SEKI, T SHIN, H SARUWATARI 日本音響学会研究発表会講演論文集 (CD-ROM) 2023, 1-3, 2023 | 1 | 2023 |
J-CHAT: Japanese Large-scale Spoken Dialogue Corpus for Spoken Dialogue Language Modeling W Nakata, K Seki, H Yanaka, Y Saito, S Takamichi, H Saruwatari arXiv preprint arXiv:2407.15828, 2024 | | 2024 |
Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals K Seki, S Takamichi, N Takamune, Y Saito, K Imamura, H Saruwatari arXiv preprint arXiv:2406.17722, 2024 | | 2024 |
SRC4VC: Smartphone-Recorded Corpus for Voice Conversion Benchmark Y Saito, T Igarashi, K Seki, S Takamichi, R Yamamoto, K Tachibana, ... arXiv preprint arXiv:2406.07254, 2024 | | 2024 |
音声品質と音響環境の潜在変数で条件付けた Denoising Training によるノイズロバスト音声変換 五十嵐琢斗, 齋藤佑樹, 関健太郎, 高道慎之介, 山本龍一, 橘健太郎, ... 研究報告音声言語情報処理 (SLP) 2024 (3), 1-6, 2024 | | 2024 |
SRC4VC データセット: 多話者音声変換モデルのベンチマークを目的とした実デバイス収録音声コーパス 齋藤佑樹, 五十嵐琢斗, 関健太郎, 高道慎之介, 山本龍一, 橘健太郎, ... 研究報告音声言語情報処理 (SLP) 2024 (23), 1-1, 2024 | | 2024 |
音環境に適応するテキスト音声合成のための一人称視点コーパス構築 武伯寒, 高道慎之介, 関健太郎, 坂東宜昭, 猿渡洋 研究報告音声言語情報処理 (SLP) 2024 (9), 1-9, 2024 | | 2024 |