[PDF][PDF] Recent advances in end-to-end automatic speech recognition

J Li - APSIPA Transactions on Signal and Information …, 2022 - nowpublishers.com
Recently, the speech community is seeing a significant trend of moving from deep neural
network based hybrid modeling to end-to-end (E2E) modeling for automatic speech …

Speech recognition using deep neural networks: A systematic review

AB Nassif, I Shahin, I Attili, M Azzeh, K Shaalan - IEEE access, 2019 - ieeexplore.ieee.org
Over the past decades, a tremendous amount of research has been done on the use of
machine learning for speech processing applications, especially speech recognition …

Contrastive representation distillation

Y Tian, D Krishnan, P Isola - arxiv preprint arxiv:1910.10699, 2019 - arxiv.org
Often we wish to transfer representational knowledge from one neural network to another.
Examples include distilling a large network into a smaller one, transferring knowledge from …

Split computing and early exiting for deep learning applications: Survey and research challenges

Y Matsubara, M Levorato, F Restuccia - ACM Computing Surveys, 2022 - dl.acm.org
Mobile devices such as smartphones and autonomous vehicles increasingly rely on deep
neural networks (DNNs) to execute complex inference tasks such as image classification …

Sequence-level knowledge distillation

Y Kim, AM Rush - arxiv preprint arxiv:1606.07947, 2016 - arxiv.org
Neural machine translation (NMT) offers a novel alternative formulation of translation that is
potentially simpler than statistical approaches. However to reach competitive performance …

Emotion recognition in speech using cross-modal transfer in the wild

S Albanie, A Nagrani, A Vedaldi… - Proceedings of the 26th …, 2018 - dl.acm.org
Obtaining large, human labelled speech datasets to train models for emotion recognition is a
notoriously challenging task, hindered by annotation cost and label ambiguity. In this work …

Policy distillation

AA Rusu, SG Colmenarejo, C Gulcehre… - arxiv preprint arxiv …, 2015 - arxiv.org
Policies for complex visual tasks have been successfully learned with deep reinforcement
learning, using an approach called deep Q-networks (DQN), but relatively large (task …

Interpretable deep models for ICU outcome prediction

Z Che, S Purushotham, R Khemani… - AMIA annual symposium …, 2017 - pmc.ncbi.nlm.nih.gov
Exponential surge in health care data, such as longitudinal data from electronic health
records (EHR), sensor data from intensive care unit (ICU), etc., is providing new …

Distillation-based training for multi-exit architectures

M Phuong, CH Lampert - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com
Multi-exit architectures, in which a stack of processing layers is interleaved with early output
layers, allow the processing of a test example to stop early and thus save computation time …

Deep learning based regression for optically inactive inland water quality parameter estimation using airborne hyperspectral imagery

C Niu, K Tan, X Jia, X Wang - Environmental Pollution, 2021 - Elsevier
Airborne hyperspectral remote sensing has the characteristics of high spatial and spectral
resolutions, and provides an opportunity for accurate and efficient inland water qauality …