Acoustic Modeling for Google Home. B Li, TN Sainath, A Narayanan, J Caroselli, M Bacchiani, A Misra, ... Interspeech, 399-403, 2017 | 208 | 2017 |
Recurrent neural aligner: An encoder-decoder neural network model for sequence to sequence mapping. H Sak, M Shannon, K Rao, F Beaufays Interspeech 8, 1298-1302, 2017 | 154 | 2017 |
Location-relative attention mechanisms for robust long-form speech synthesis E Battenberg, RJ Skerry-Ryan, S Mariooryad, D Stanton, D Kao, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 135 | 2020 |
Autoregressive models for statistical parametric speech synthesis M Shannon, H Zen, W Byrne IEEE transactions on audio, speech, and language processing 21 (3), 587-597, 2012 | 85 | 2012 |
Optimizing expected word error rate via sampling for speech recognition M Shannon arXiv preprint arXiv:1706.02776, 2017 | 64 | 2017 |
Semi-supervised generative modeling for controllable speech synthesis R Habib, S Mariooryad, M Shannon, E Battenberg, RJ Skerry-Ryan, ... arXiv preprint arXiv:1910.01709, 2019 | 61 | 2019 |
Effective use of variational embedding capacity in expressive end-to-end speech synthesis E Battenberg, S Mariooryad, D Stanton, RJ Skerry-Ryan, M Shannon, ... arXiv preprint arXiv:1906.03402, 2019 | 59 | 2019 |
Improved End-of-Query Detection for Streaming Speech Recognition. M Shannon, G Simko, SY Chang, C Parada Interspeech, 1909-1913, 2017 | 57 | 2017 |
Measuring the perceptual effects of modelling assumptions in speech synthesis using stimuli constructed from repeated natural speech GE Henter, T Merritt, M Shannon, C Mayo, S King INTERSPEECH 2014 15th Annual Conference of the International Speech …, 2014 | 52 | 2014 |
Speaker generation D Stanton, M Shannon, S Mariooryad, RJ Skerry-Ryan, E Battenberg, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 36 | 2022 |
Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project M Wester, J Dines, M Gibson, H Liang, YJ Wu, L Saheer, S King, K Oura, ... Proc. of 7th ISCA Speech Synthesis Workshop, 2010 | 27 | 2010 |
Non-saturating GAN training as divergence minimization M Shannon, B Poole, S Mariooryad, T Bagby, E Battenberg, D Kao, ... arXiv preprint arXiv:2010.08029, 2020 | 21 | 2020 |
The effect of using normalized models in statistical speech synthesis M Shannon, H Zen, W Byrne ISCA (International Speech Communication Association), 2011 | 19 | 2011 |
Personalising speech-to-speech translation in the EMIME project M Kurimo, B Byrne, J Dines, PN Garner, M Gibson, Y Guan, T Hirsimäki, ... Proceedings of the ACL 2010 System Demonstrations, 48-53, 2010 | 17 | 2010 |
Fast, low-artifact speech synthesis considering global variance M Shannon, W Byrne 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013 | 16 | 2013 |
Autoregressive HMMs for speech synthesis SM Shannon, WJ Byrne ISCA (International Speech Communication Association), 2009 | 16 | 2009 |
A formulation of the autoregressive HMM for speech synthesis M Shannon, W Byrne Department of Engineering, University of Cambridge, 2009 | 13 | 2009 |
Global normalization for streaming speech recognition in a modular framework E Variani, K Wu, MD Riley, D Rybach, M Shannon, C Allauzen Advances in Neural Information Processing Systems 35, 4257-4269, 2022 | 9 | 2022 |
Autoregressive clustering for HMM speech synthesis M Shannon, W Byrne Proc. Interspeech, 2011 | 8 | 2011 |
Properties of f-divergences and f-GAN training M Shannon arXiv preprint arXiv:2009.00757, 2020 | 6 | 2020 |