Intelligent personal assistants: A systematic literature review

A de Barcelos Silva, MM Gomes, CA da Costa… - Expert Systems with …, 2020 - Elsevier
Abstract Natural Language Interfaces allow human-computer interaction through the
translation of human intention into devices' control commands, analyzing the user's speech …

SpeechBrain: A general-purpose speech toolkit

M Ravanelli, T Parcollet, P Plantinga, A Rouhe… - arxiv preprint arxiv …, 2021 - arxiv.org
SpeechBrain is an open-source and all-in-one speech toolkit. It is designed to facilitate the
research and development of neural speech processing technologies by being simple …

From feedforward to recurrent LSTM neural networks for language modeling

M Sundermeyer, H Ney… - IEEE/ACM Transactions on …, 2015 - ieeexplore.ieee.org
Language models have traditionally been estimated based on relative frequencies, using
count statistics that can be extracted from huge amounts of text data. More recently, it has …

Continuous sign language recognition: Towards large vocabulary statistical recognition systems handling multiple signers

O Koller, J Forster, H Ney - Computer Vision and Image Understanding, 2015 - Elsevier
This work presents a statistical recognition approach performing large vocabulary
continuous sign language recognition across different signers. Automatic sign language …

The pytorch-kaldi speech recognition toolkit

M Ravanelli, T Parcollet… - ICASSP 2019-2019 IEEE …, 2019 - ieeexplore.ieee.org
The availability of open-source software is playing a remarkable role in the popularization of
speech recognition and deep learning. Kaldi, for instance, is nowadays an established …

Deep hand: How to train a cnn on 1 million hand images when your data is continuous and weakly labelled

O Koller, H Ney, R Bowden - Proceedings of the IEEE …, 2016 - openaccess.thecvf.com
This work presents a new approach to learning a frame-based classifier on weakly labelled
sequence data by embedding a CNN within an iterative EM algorithm. This allows the CNN …

Re-sign: Re-aligned end-to-end sequence modelling with deep recurrent CNN-HMMs

O Koller, S Zargaran, H Ney - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com
This work presents an iterative re-alignment approach applicable to visual sequence
labelling tasks such as gesture recognition, activity recognition and continuous sign …

Handwriting recognition with large multidimensional long short-term memory recurrent neural networks

P Voigtlaender, P Doetsch… - 2016 15th international …, 2016 - ieeexplore.ieee.org
Multidimensional long short-term memory recurrent neural networks achieve impressive
results for handwriting recognition. However, with current CPU-based implementations, their …

[PDF][PDF] Deep Sign: Hybrid CNN-HMM for Continuous Sign Language Recognition.

O Koller, S Zargaran, H Ney, R Bowden - BMVC, 2016 - openresearch.surrey.ac.uk
This paper introduces the end-to-end embedding of a CNN into a HMM, while interpreting
the outputs of the CNN in a Bayesian fashion. The hybrid CNN-HMM combines the strong …

Deep sign: Enabling robust statistical continuous sign language recognition via hybrid CNN-HMMs

O Koller, S Zargaran, H Ney, R Bowden - International Journal of …, 2018 - Springer
This manuscript introduces the end-to-end embedding of a CNN into a HMM, while
interpreting the outputs of the CNN in a Bayesian framework. The hybrid CNN-HMM …