Machine learning paradigms for speech recognition: An overview

L Deng, X Li - IEEE Transactions on Audio, Speech, and …, 2013 - ieeexplore.ieee.org
Automatic Speech Recognition (ASR) has historically been a driving force behind many
machine learning (ML) techniques, including the ubiquitously used hidden Markov model …

Speech recognition by machine, a review

MA Anusuya, SK Katti - arxiv preprint arxiv:1001.2267, 2010 - arxiv.org
This paper presents a brief survey on Automatic Speech Recognition and discusses the
major themes and advances made in the past 60 years of research, so as to provide a …

Plex: Towards reliability using pretrained large model extensions

D Tran, J Liu, MW Dusenberry, D Phan… - arxiv preprint arxiv …, 2022 - arxiv.org
A recent trend in artificial intelligence is the use of pretrained models for language and
vision tasks, which have achieved extraordinary performance but also puzzling failures …

Towards natural language-based visualization authoring

Y Wang, Z Hou, L Shen, T Wu, J Wang… - … on Visualization and …, 2022 - ieeexplore.ieee.org
A key challenge to visualization authoring is the process of getting familiar with the complex
user interfaces of authoring tools. Natural Language Interface (NLI) presents promising …

Active learning from imbalanced data: A solution of online weighted extreme learning machine

H Yu, X Yang, S Zheng, C Sun - IEEE transactions on neural …, 2018 - ieeexplore.ieee.org
It is well known that active learning can simultaneously improve the quality of the
classification model and decrease the complexity of training instances. However, several …

Anomalous sound event detection: A survey of machine learning based methods and applications

Z Mnasri, S Rovetta, F Masulli - Multimedia Tools and Applications, 2022 - Springer
With the development of multi-modal man-machine interaction, audio signal analysis is
gaining importance in a field traditionally dominated by video. In particular, anomalous …

Deep representation learning in speech processing: Challenges, recent advances, and future trends

S Latif, R Rana, S Khalifa, R Jurdak, J Qadir… - arxiv preprint arxiv …, 2020 - arxiv.org
Research on speech processing has traditionally considered the task of designing hand-
engineered acoustic features (feature engineering) as a separate distinct problem from the …

Active learning through density clustering

M Wang, F Min, ZH Zhang, YX Wu - Expert systems with applications, 2017 - Elsevier
Active learning is used for classification when labeling data are costly, while the main
challenge is to identify the critical instances that should be labeled. Clustering-based …

Spoken language understanding: A survey

R De Mori - 2007 IEEE Workshop on Automatic Speech …, 2007 - ieeexplore.ieee.org
A survey of research on spoken language understanding is presented. It covers aspects of
knowledge representation, automatic interpretation strategies, semantic grammars …

Active learning and semi-supervised learning for speech recognition: A unified framework using the global entropy reduction maximization criterion

D Yu, B Varadarajan, L Deng, A Acero - Computer Speech & Language, 2010 - Elsevier
We propose a unified global entropy reduction maximization (GERM) framework for active
learning and semi-supervised learning for speech recognition. Active learning aims to select …