Learning to forget: Continual prediction with LSTM

FA Gers, J Schmidhuber, F Cummins - Neural computation, 2000 - ieeexplore.ieee.org
Long short-term memory (LSTM; Hochreiter & Schmidhuber, 1997) can solve numerous
tasks not solvable by previous learning algorithms for recurrent neural networks (RNNs). We …

[PDF][PDF] Gradient flow in recurrent nets: the difficulty of learning long-term dependencies

S Hochreiter, Y Bengio, P Frasconi, J Schmidhuber - 2001 - researchgate.net
Recurrent networks (crossreference Chapter 12) can, in principle, use their feedback
connections to store representations of recent input events in the form of activations. The …

Learning precise timing with LSTM recurrent networks

FA Gers, NN Schraudolph, J Schmidhuber - Journal of machine learning …, 2002 - jmlr.org
The temporal distance between events conveys information essential for numerous
sequential tasks such as motor control and rhythm detection. While Hidden Markov Models …

Sign language recognition from digital videos using feature pyramid network with detection transformer

Y Liu, P Nand, MA Hossain, M Nguyen… - Multimedia Tools and …, 2023 - Springer
Sign language recognition is one of the fundamental ways to assist deaf people to
communicate with others. An accurate vision-based sign language recognition system using …

[КНИГА][B] Multilingual speech processing

T Schultz, K Kirchhoff - 2006 - books.google.com
Tanja Schultz and Katrin Kirchhoff have compiled a comprehensive overview of speech
processing from a multilingual perspective. By taking this all-inclusive approach to speech …

Video dynamics detection using deep neural networks

K Zheng, WQ Yan, P Nand - IEEE Transactions on Emerging …, 2017 - ieeexplore.ieee.org
In recent years, deep neural networks (DNNs) have achieved a remarkable progression in
solving many complex problems. DNNs are suitable for dealing with the problems related to …

The rhythms of rhythm

D Gibbon - Journal of the International Phonetic Association, 2023 - cambridge.org
The low frequency (LF) spectral analysis or 'rhythm spectrum'approach to the quantitative
analysis and comparison of speech rhythms is extended beyond syllable or word rhythms to …

Rhythmic unit extraction and modelling for automatic language identification

JL Rouas, J Farinas, F Pellegrino… - Speech …, 2005 - Elsevier
This paper deals with an approach to automatic language identification based on rhythmic
modelling. Beside phonetics and phonotactics, rhythm is actually one of the most promising …

Language identification using pitch contour information

CY Lin, HC Wang - Proceedings.(ICASSP'05). IEEE …, 2005 - ieeexplore.ieee.org
An approach to automatic language identification (LID) using pitch contour information is
proposed. A segment of pitch contour is approximated by a set of Legendre polynomials so …

The future of prosody: it's about time

D Gibbon - arxiv preprint arxiv:1804.09543, 2018 - arxiv.org
Prosody is usually defined in terms of the three distinct but interacting domains of pitch,
intensity and duration patterning, or, more generally, as phonological and phonetic …