Hierarchical Transformer-Based Large-Context End-To-End ASR with Large-Context Knowledge Distillation R Masumura, N Makishima, M Ihori, A Takashima, T Tanaka, S Orihashi ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 39 | 2021 |
Neural Dialogue Context Online End-of-Turn Detection R Masumura, T Tanaka, A Ando, R Ishii, R Higashinaka, Y Aono Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue …, 2018 | 38 | 2018 |
Neural Error Corrective Language Models for Automatic Speech Recognition. T Tanaka, R Masumura, H Masataki, Y Aono Interspeech, 401-405, 2018 | 35 | 2018 |
Large Context End-to-end Automatic Speech Recognition via Extension of Hierarchical Recurrent Encoder-decoder Models R Masumura, T Tanaka, T Moriya, Y Shinohara, T Oba, Y Aono ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 33 | 2019 |
Automation of system building for state-of-the-art large vocabulary speech recognition using evolution strategy T Moriya, T Tanaka, T Shinozaki, S Watanabe, K Duh 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015 | 33 | 2015 |
Deep versus Wide: An Analysis of Student Architectures for Task-Agnostic Knowledge Distillation of Self-Supervised Speech Models T Ashihara, T Moriya, K Matsuura, T Tanaka arXiv preprint arXiv:2207.06867, 2022 | 30 | 2022 |
Self-Distillation for Improving CTC-Transformer-Based ASR Systems. T Moriya, T Ochiai, S Karita, H Sato, T Tanaka, T Ashihara, R Masumura, ... INTERSPEECH, 546-550, 2020 | 25 | 2020 |
Automated structure discovery and parameter tuning of neural network language model based on evolution strategy T Tanaka, T Moriya, T Shinozaki, S Watanabe, T Hori, K Duh 2016 IEEE Spoken Language Technology Workshop (SLT), 665-671, 2016 | 21 | 2016 |
A Joint End-to-End and DNN-HMM Hybrid Automatic Speech Recognition System with Transferring Sharable Knowledge. T Tanaka, R Masumura, T Moriya, T Oba, Y Aono INTERSPEECH, 2210-2214, 2019 | 20 | 2019 |
Evolution-strategy-based automation of system development for high-performance speech recognition T Moriya, T Tanaka, T Shinozaki, S Watanabe, K Duh IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (1), 77-88, 2018 | 18 | 2018 |
Distilling Attention Weights for CTC-Based ASR Systems T Moriya, H Sato, T Tanaka, T Ashihara, R Masumura, Y Shinohara ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 16 | 2020 |
Leveraging Large Text Corpora For End-To-End Speech Summarization K Matsuura, T Ashihara, T Moriya, T Tanaka, A Ogawa, M Delcroix, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 15 | 2023 |
Multi-task and Multi-lingual Joint Learning of Neural Lexical Utterance Classification based on Partially-shared Modeling R Masumura, T Tanaka, R Higashinaka, H Masataki, Y Aono Proceedings of the 27th International Conference on Computational …, 2018 | 15 | 2018 |
Neural Speech-to-Text Language Models for Rescoring Hypotheses of DNN-HMM Hybrid Automatic Speech Recognition Systems T Tanaka, R Masumura, T Moriya, Y Aono 2018 Asia-Pacific Signal and Information Processing Association Annual …, 2018 | 13 | 2018 |
Phoneme-to-Grapheme Conversion Based Large-Scale Pre-Training for End-to-End Automatic Speech Recognition. R Masumura, N Makishima, M Ihori, A Takashima, T Tanaka, S Orihashi INTERSPEECH, 2822-2826, 2020 | 12 | 2020 |
SpeechGLUE: How Well Can Self-Supervised Speech Models Capture Linguistic Knowledge? T Ashihara, T Moriya, K Matsuura, T Tanaka, Y Ijima, T Asami, M Delcroix, ... arXiv preprint arXiv:2306.08374, 2023 | 11 | 2023 |
Enrollment-less training for personalized voice activity detection N Makishima, M Ihori, T Tanaka, A Takashima, S Orihashi, R Masumura arXiv preprint arXiv:2106.12132, 2021 | 11 | 2021 |
Audio-Visual Speech Separation Using Cross-Modal Correspondence Loss N Makishima, M Ihori, A Takashima, T Tanaka, S Orihashi, R Masumura ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 11 | 2021 |
Role Play Dialogue Aware Language Models Based on Conditional Hierarchical Recurrent Encoder-Decoder. R Masumura, T Tanaka, A Ando, H Masataki, Y Aono Interspeech, 1259-1263, 2018 | 11 | 2018 |
Evolutionary optimization of long short-term memory neural network language model T Tanaka, T Moriya, T Shinozaki, S Watanabe, T Hori, K Duh Journal of the Acoustical Society of America 140 (4_Supplement), 3062-3062, 2016 | 11 | 2016 |