Voiced/nonvoiced detection based on robustness of voiced epochs N Dhananjaya, B Yegnanarayana IEEE Signal Processing Letters 17 (3), 273-276, 2009 | 121 | 2009 |
Spectro-temporal analysis of speech signals using zero-time windowing and group delay function Y Bayya, DN Gowda Speech Communication 55 (6), 782-795, 2013 | 72 | 2013 |
Attention based on-device streaming speech recognition with large speech corpus K Kim, K Lee, D Gowda, J Park, S Kim, S Jin, YY Lee, J Yeo, D Kim, ... 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 67 | 2019 |
Improved Vocal Tract Length Perturbation for a State-of-the-Art End-to-End Speech Recognition System. C Kim, M Shin, A Garg, D Gowda Interspeech, 739-743, 2019 | 51 | 2019 |
Acoustic analysis of trill sounds N Dhananjaya, B Yegnanarayana, P Bhaskararao The Journal of the Acoustical Society of America 131 (4), 3141-3152, 2012 | 46 | 2012 |
Speaker recognition from whispered speech: A tutorial survey and an application of time-varying linear prediction V Vestman, D Gowda, M Sahidullah, P Alku, T Kinnunen Speech Communication 99, 62-79, 2018 | 45 | 2018 |
A review of on-device fully neural end-to-end automatic speech recognition algorithms C Kim, D Gowda, D Lee, J Kim, A Kumar, S Kim, A Garg, C Han 2020 54th Asilomar Conference on Signals, Systems, and Computers, 277-283, 2020 | 40 | 2020 |
End-to-end training of a large vocabulary end-to-end speech recognition system C Kim, S Kim, K Kim, M Kumar, J Kim, K Lee, C Han, A Garg, E Kim, ... 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 30 | 2019 |
Utterance Confidence Measure for End-to-End Speech Recognition with Applications to Distributed Speech Recognition Scenarios. A Kumar, S Singh, D Gowda, A Garg, S Singh, C Kim Interspeech 2020, 4357-4361, 2020 | 26 | 2020 |
Improved multi-stage training of online attention-based encoder-decoder models A Garg, D Gowda, A Kumar, K Kim, M Kumar, C Kim 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 70-77, 2019 | 25 | 2019 |
Multi-Task Multi-Resolution Char-to-BPE Cross-Attention Decoder for End-to-End Speech Recognition. D Gowda, A Garg, K Kim, M Kumar, C Kim Interspeech, 2783-2787, 2019 | 22 | 2019 |
Hierarchical Multi-Stage Word-to-Grapheme Named Entity Corrector for Automatic Speech Recognition. A Garg, A Gupta, D Gowda, S Singh, C Kim Interspeech, 1793-1797, 2020 | 21 | 2020 |
Analysis of breathy, modal and pressed phonation based on low frequency spectral density. DN Gowda, M Kurimo INTERSPEECH, 3206-3210, 2013 | 21 | 2013 |
Streaming On-Device End-to-End ASR System for Privacy-Sensitive Voice-Typing. A Garg, GP Vadisetti, D Gowda, S Jin, A Jayasimha, Y Han, J Kim, J Park, ... Interspeech, 3371-3375, 2020 | 17 | 2020 |
Quasi-closed phase forward-backward linear prediction analysis of speech for accurate formant detection and estimation D Gowda, M Airaksinen, P Alku The Journal of the Acoustical Society of America 142 (3), 1542-1553, 2017 | 16 | 2017 |
The simple4all entry to the blizzard challenge 2014 A Suni, T Raitio, D Gowda, R Karhila, M Gibson, O Watts Proc. Blizzard Challenge, 2014 | 15 | 2014 |
Signal processing for excitation-based analysis of acoustic events in speech N Dhananjaya PhD thesis, Dept. of Computer Science and Engineering, IIT Madras, Chennai, 2011 | 15 | 2011 |
Acoustic-phonetic information from excitation source for refining manner hypotheses of a phone recognizer N Dhananjaya, B Yegnanarayana, VG Suryakanth 2011 IEEE International Conference on Acoustics, Speech and Signal …, 2011 | 15 | 2011 |
Video shot segmentation using late fusion technique CK Mohan, N Dhananjaya, B Yegnanarayana 2008 Seventh International Conference on Machine Learning and Applications …, 2008 | 15 | 2008 |
Speaker change detection in casual conversations using excitation source features N Dhananjaya, B Yegnanarayana Speech communication 50 (2), 153-161, 2008 | 15 | 2008 |