Filled pauses as cues to the complexity of upcoming phrases for native and non-native listeners M Watanabe, K Hirose, Y Den, N Minematsu Speech communication 50 (2), 81-94, 2008 | 212 | 2008 |
Free software toolkit for Japanese large vocabulary continuous speech recognition T Kawahara, A Lee, T Kobayashi, K Takeda, N Minematsu, S Sagayama, ... | 162 | 2000 |
WFST-based grapheme-to-phoneme conversion: Open source tools for alignment, model-building and decoding JR Novak, N Minematsu, K Hirose Proceedings of the 10th International Workshop on Finite State Methods and …, 2012 | 149 | 2012 |
Phonetisaurus: Exploring grapheme-to-phoneme conversion with joint n-gram models in the WFST framework JR Novak, N Minematsu, K Hirose Natural Language Engineering 22 (6), 907-938, 2016 | 140 | 2016 |
Unsupervised optimal phoneme segmentation: Objectives, algorithm and comparisons Y Qiao, N Shimomura, N Minematsu 2008 IEEE International Conference on Acoustics, Speech and Signal …, 2008 | 113 | 2008 |
One-to-Many Voice Conversion Based on Tensor Representation of Speaker Space. D Saito, K Yamamoto, N Minematsu, K Hirose Interspeech, 653-656, 2011 | 105 | 2011 |
A Study on Invariance of -Divergence and Its Application to Speech Recognition Y Qiao, N Minematsu IEEE Transactions on Signal Processing 58 (7), 3884-3890, 2010 | 105 | 2010 |
Automatic estimation of one's age with his/her speech based upon acoustic modeling techniques of speakers N Minematsu, M Sekiguchi, K Hirose 2002 IEEE International Conference on Acoustics, Speech, and Signal …, 2002 | 101 | 2002 |
Mathematical evidence of the acoustic universal structure in speech N Minematsu Proceedings.(ICASSP'05). IEEE International Conference on Acoustics, Speech …, 2005 | 95 | 2005 |
A method for automatic extraction of model parameters from fundamental frequency contours of speech S Narusawa, N Minematsu, K Hirose, H Fujisaki 2002 IEEE International conference on acoustics, speech, and signal …, 2002 | 93 | 2002 |
Development of English speech database read by Japanese to support CALL research N Minematsu, Y Tomiyama, K Yoshimoto, K Shimizu, S Nakagawa, ... Proc. ICA 1 (2004), 557-560, 2004 | 88 | 2004 |
Wasserstein GAN and waveform loss-based acoustic model training for multi-speaker text-to-speech synthesis systems using a WaveNet vocoder Y Zhao, S Takaki, HT Luong, J Yamagishi, D Saito, N Minematsu IEEE access 6, 60478-60488, 2018 | 75 | 2018 |
Sharable software repository for Japanese large vocabulary continuous speech recognition T Kawahara, T Kobayashi, K Takeda, N Minematsu, K Itou, M Yamamoto, ... | 73 | 1998 |
Synthesis of F0 contours using generation process model parameters predicted from unlabeled corpora: Application to emotional speech synthesis K Hirose, K Sato, Y Asano, N Minematsu Speech communication 46 (3-4), 385-404, 2005 | 55 | 2005 |
English Speech Database Read by Japanese Learners for CALL System Development. N Minematsu, Y Tomiyama, K Yoshimoto, K Shimizu, S Nakagawa, ... LREC, 2002 | 54 | 2002 |
Yet another acoustic representation of speech sounds N Minematsu 2004 IEEE International Conference on Acoustics, Speech, and Signal …, 2004 | 53 | 2004 |
Improving WFST-based G2P Conversion with Alignment Constraints and RNNLM N-best Rescoring. JR Novak, N Minematsu, K Hirose, C Hori, H Kashioka, PR Dixon Interspeech, 2526-2529, 2012 | 50 | 2012 |
Measurement of Objective Intelligibility of Japanese Accented English Using ERJ (English Read by Japanese) Database. N Minematsu, K Okabe, K Ogaki, K Hirose INTERSPEECH, 1481-1484, 2011 | 48 | 2011 |
Japanese dictation toolkit-1997 version T Kawahara, A Lee, T Kobayashi, K Takeda, N Minematsu, K Itou, A Ito, ... Journal of the Acoustical Society of Japan (E) 20 (3), 233-239, 1999 | 45 | 1999 |
Role of prosodic features in the human process of perceiving spoken words and sentences in Japanese N Minematsu, K Hirose Journal of the Acoustical Society of Japan (E) 16 (5), 311-320, 1995 | 45 | 1995 |