Attentive statistics pooling for deep speaker embedding K Okabe, T Koshinaka, K Shinoda arXiv preprint arXiv:1803.10963, 2018 | 670 | 2018 |
MDL-based context-dependent subword modeling for speech recognition K Shinoda, T Watanabe Acoustical Science and Technology 21 (2), 79-86, 2000 | 373 | 2000 |
A structural Bayes approach to speaker adaptation K Shinoda, CH Lee IEEE Transactions on Speech and Audio Processing 9 (3), 276-287, 2001 | 221 | 2001 |
Acoustic modeling based on the MDL principle for speech recognition. K Shinoda, T Watanabe Eurospeech, 99-102, 1997 | 202 | 1997 |
Multimodal fusion of bert-cnn and gated cnn representations for depression detection M Rodrigues Makiuchi, T Warnita, K Uto, K Shinoda Proceedings of the 9th International on Audio/Visual Emotion Challenge and …, 2019 | 155 | 2019 |
Structural MAP speaker adaptation using hierarchical priors K Shinoda, CH Lee 1997 IEEE Workshop on Automatic Speech Recognition and Understanding …, 1997 | 114 | 1997 |
An online attention-based model for speech recognition R Fan, P Zhou, W Chen, J Jia, G Liu arXiv preprint arXiv:1811.05247, 2018 | 90* | 2018 |
Multimodal emotion recognition with high-level speech and text features MR Makiuchi, K Uto, K Shinoda 2021 IEEE automatic speech recognition and understanding workshop (ASRU …, 2021 | 88 | 2021 |
GINGA observation of the X-ray pulsar 1E 2259+ 586 in the supernova remnant G109. 1-1.0 K Koyama, F Nagase, Y Ogawara, K Shinoda, N Kawai, MH Jones, ... Astronomical Society of Japan, Publications (ISSN 0004-6264), vol. 41, no. 3 …, 1989 | 82 | 1989 |
Technique for adaptation of hidden markov models for speech recognition CH Lee, K Shinoda US Patent 6,151,574, 2000 | 77 | 2000 |
Speaker adaptation with autonomous model complexity control by MDL principle K Shinoda, T Watanabe 1996 IEEE International Conference on Acoustics, Speech, and Signal …, 1996 | 69 | 1996 |
A fast and accurate video semantic-indexing system using fast MAP adaptation and GMM supervectors N Inoue, K Shinoda IEEE Transactions on Multimedia 14 (4), 1196-1205, 2012 | 59 | 2012 |
Implicit neural representations for variable length human motion generation P Cervantes, Y Sekikawa, I Sato, K Shinoda European Conference on Computer Vision, 356-372, 2022 | 58 | 2022 |
Detecting Alzheimer's disease using gated convolutional neural network from audio data T Warnita, N Inoue, K Shinoda arXiv preprint arXiv:1803.11344, 2018 | 58 | 2018 |
User adaptation of convolutional neural network for human activity recognition S Matsui, N Inoue, Y Akagi, G Nagino, K Shinoda 2017 25th European Signal Processing Conference (EUSIPCO), 753-757, 2017 | 54 | 2017 |
Speaker adaptation techniques for automatic speech recognition K Shinoda Proc. APSIPA ASC 2011, 2011 | 53 | 2011 |
High speed speech recognition using tree-structured probability density function T Watanabe, K Shinoda, K Takagi, KI Iso 1995 International Conference on Acoustics, Speech, and Signal Processing 1 …, 1995 | 50 | 1995 |
Speaker adaptation with autonomous control using tree structure K Shinoda, T Watanabe Proc. Eurospeech 1995, 1143-1146, 1995 | 49 | 1995 |
Spectral graph skeletons for 3D action recognition T Kerola, N Inoue, K Shinoda Asian conference on computer vision, 417-432, 2014 | 47 | 2014 |
Speech recognition apparatus K Shinoda US Patent 7,437,288, 2008 | 47 | 2008 |