Autoregressive diffusion transformer for text-to-speech synthesis Z Liu, S Wang, S Inoue, Q Bai, H Li arXiv preprint arXiv:2406.05551, 2024 | 12 | 2024 |
Hierarchical emotion prediction and control in text-to-speech synthesis S Inoue, K Zhou, S Wang, H Li ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 7 | 2024 |
Fine-Grained Quantitative Emotion Editing for Speech Generation S Inoue, K Zhou, S Wang, H Li Asia-Pacific Signal and Information Processing Association Annual Summit and …, 2024 | 3 | 2024 |
Hierarchical Control of Emotion Rendering in Speech Synthesis S Inoue, K Zhou, S Wang, H Li arXiv preprint arXiv:2412.12498, 2024 | | 2024 |
MacST: Multi-Accent Speech Synthesis via Text Transliteration for Accent Conversion S Inoue, S Wang, W Wang, P Zhu, M Bi, H Li ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |
Activation Maximization with a Prior in Speech Data S Inoue, T Gonsalves American Journal of Computer Science and Technology 4 (3), 75-82, 2021 | | 2021 |
Style-Restricted GAN: Multi-Modal Translation with Style Restriction Using Generative Adversarial Networks S Inoue, T Gonsalves arXiv.org, 2021 | | 2021 |
LCGAN: Conditional GAN with Multiple Discrete Classes S Inoue, T Gonsalves 人工知能学会全国大会論文集 第 34 回 (2020), 2K4ES202-2K4ES202, 2020 | | 2020 |