An empirical survey of data augmentation for time series classification with neural networks
In recent times, deep artificial neural networks have achieved many successes in pattern
recognition. Part of this success can be attributed to the reliance on big data to increase …
recognition. Part of this success can be attributed to the reliance on big data to increase …
Towards end-to-end speech recognition with recurrent neural networks
This paper presents a speech recognition system that directly transcribes audio data with
text, without requiring an intermediate phonetic representation. The system is based on a …
text, without requiring an intermediate phonetic representation. The system is based on a …
A review of recent advances in visual speech decoding
Visual speech information plays an important role in automatic speech recognition (ASR)
especially when audio is corrupted or even inaccessible. Despite the success of audio …
especially when audio is corrupted or even inaccessible. Despite the success of audio …
Data augmentation for deep neural network acoustic modeling
This paper investigates data augmentation for deep neural network acoustic modeling
based on label-preserving transformations to deal with data sparsity. Two data …
based on label-preserving transformations to deal with data sparsity. Two data …
Large-vocabulary continuous speech recognition systems: A look at some recent advances
Over the past decade or so, several advances have been made to the design of modern
large vocabulary continuous speech recognition (LVCSR) systems to the point where their …
large vocabulary continuous speech recognition (LVCSR) systems to the point where their …
[PDF][PDF] Vocal tract length perturbation (VTLP) improves speech recognition
Augmenting datasets by transforming inputs in a way that does not change the label is a
crucial ingredient of the state of the art methods for object recognition using neural networks …
crucial ingredient of the state of the art methods for object recognition using neural networks …
[BOOK][B] Speech synthesis and recognition
W Holmes - 2002 - taylorfrancis.com
With the growing impact of information technology on daily life, speech is becoming
increasingly important for providing a natural means of communication between humans …
increasingly important for providing a natural means of communication between humans …
Lung sounds classification using convolutional neural networks
D Bardou, K Zhang, SM Ahmad - Artificial intelligence in medicine, 2018 - Elsevier
Lung sounds convey relevant information related to pulmonary disorders, and to evaluate
patients with pulmonary conditions, the physician or the doctor uses the traditional …
patients with pulmonary conditions, the physician or the doctor uses the traditional …
Child speech recognition in human-robot interaction: evaluations and recommendations
An increasing number of human-robot interaction (HRI) studies are now taking place in
applied settings with children. These interactions often hinge on verbal interaction to …
applied settings with children. These interactions often hinge on verbal interaction to …
Speaker anonymisation using the McAdams coefficient
Anonymisation has the goal of manipulating speech signals in order to degrade the
reliability of automatic approaches to speaker recognition, while preserving other aspects of …
reliability of automatic approaches to speaker recognition, while preserving other aspects of …