Intelligent personal assistants: A systematic literature review
Abstract Natural Language Interfaces allow human-computer interaction through the
translation of human intention into devices' control commands, analyzing the user's speech …
translation of human intention into devices' control commands, analyzing the user's speech …
SpeechBrain: A general-purpose speech toolkit
SpeechBrain is an open-source and all-in-one speech toolkit. It is designed to facilitate the
research and development of neural speech processing technologies by being simple …
research and development of neural speech processing technologies by being simple …
From feedforward to recurrent LSTM neural networks for language modeling
Language models have traditionally been estimated based on relative frequencies, using
count statistics that can be extracted from huge amounts of text data. More recently, it has …
count statistics that can be extracted from huge amounts of text data. More recently, it has …
Continuous sign language recognition: Towards large vocabulary statistical recognition systems handling multiple signers
This work presents a statistical recognition approach performing large vocabulary
continuous sign language recognition across different signers. Automatic sign language …
continuous sign language recognition across different signers. Automatic sign language …
The pytorch-kaldi speech recognition toolkit
The availability of open-source software is playing a remarkable role in the popularization of
speech recognition and deep learning. Kaldi, for instance, is nowadays an established …
speech recognition and deep learning. Kaldi, for instance, is nowadays an established …
Deep hand: How to train a cnn on 1 million hand images when your data is continuous and weakly labelled
This work presents a new approach to learning a frame-based classifier on weakly labelled
sequence data by embedding a CNN within an iterative EM algorithm. This allows the CNN …
sequence data by embedding a CNN within an iterative EM algorithm. This allows the CNN …
Re-sign: Re-aligned end-to-end sequence modelling with deep recurrent CNN-HMMs
This work presents an iterative re-alignment approach applicable to visual sequence
labelling tasks such as gesture recognition, activity recognition and continuous sign …
labelling tasks such as gesture recognition, activity recognition and continuous sign …
Handwriting recognition with large multidimensional long short-term memory recurrent neural networks
Multidimensional long short-term memory recurrent neural networks achieve impressive
results for handwriting recognition. However, with current CPU-based implementations, their …
results for handwriting recognition. However, with current CPU-based implementations, their …
[PDF][PDF] Deep Sign: Hybrid CNN-HMM for Continuous Sign Language Recognition.
This paper introduces the end-to-end embedding of a CNN into a HMM, while interpreting
the outputs of the CNN in a Bayesian fashion. The hybrid CNN-HMM combines the strong …
the outputs of the CNN in a Bayesian fashion. The hybrid CNN-HMM combines the strong …
Deep sign: Enabling robust statistical continuous sign language recognition via hybrid CNN-HMMs
This manuscript introduces the end-to-end embedding of a CNN into a HMM, while
interpreting the outputs of the CNN in a Bayesian framework. The hybrid CNN-HMM …
interpreting the outputs of the CNN in a Bayesian framework. The hybrid CNN-HMM …