Automatic speech recognition and speech variability: A review

M Benzeghiba, R De Mori, O Deroo, S Dupont… - Speech …, 2007 - Elsevier
Major progress is being recorded regularly on both the technology and exploitation of
automatic speech recognition (ASR) and spoken language systems. However, there are still …

Human language technology: Opportunities and challenges

M Ostendorf, E Shriberg… - Proceedings.(ICASSP'05) …, 2005 - ieeexplore.ieee.org
In recent years, there has been dramatic progress in both speech and language processing,
in many cases leveraging some of the same underlying methods. This progress and the …

Automated generation of 'good enough'transcripts as a first step to transcription of audio-recorded data

C Bokhove, C Downey - Methodological innovations, 2018 - journals.sagepub.com
In the last decade, automated captioning services have appeared in mainstream technology
use. Until now, the focus of these services have been on the technical aspects, supporting …

How might we create better benchmarks for speech recognition?

A Aksënova, D van Esch, J Flynn… - Proceedings of the 1st …, 2021 - aclanthology.org
The applications of automatic speech recognition (ASR) systems are proliferating, in part
due to recent significant quality improvements. However, as recent work indicates, even …

[LLIBRE][B] Multilingual speech processing

T Schultz, K Kirchhoff - 2006 - books.google.com
Tanja Schultz and Katrin Kirchhoff have compiled a comprehensive overview of speech
processing from a multilingual perspective. By taking this all-inclusive approach to speech …

[LLIBRE][B] Multilingual information retrieval: From research to practice

C Peters, M Braschler, P Clough - 2012 - Springer
We are living in a multilingual world and the diversity in languages which are used to
interact with information access systems has generated a wide variety of challenges to be …

Spoken content retrieval: A survey of techniques and technologies

M Larson, GJF Jones - Foundations and Trends® in …, 2012 - nowpublishers.com
Speech media, that is, digital audio and video containing spoken content, has blossomed in
recent years. Large collections are accruing on the Internet as well as in private and …

Exploring capabilities of monolingual audio transformers using large datasets in automatic speech recognition of Czech

J Lehečka, J Švec, A Pražák, JV Psutka - arxiv preprint arxiv:2206.07627, 2022 - arxiv.org
In this paper, we present our progress in pretraining Czech monolingual audio transformers
from a large dataset containing more than 80 thousand hours of unlabeled speech, and …

Speechfind: Advances in spoken document retrieval for a national gallery of the spoken word

JHL Hansen, R Huang, B Zhou… - … on Speech and …, 2005 - ieeexplore.ieee.org
Advances in formulating spoken document retrieval for a new National Gallery of the Spoken
Word (NGSW) are addressed. NGSW is the first large-scale repository of its kind, consisting …

Computational intelligence in processing of speech acoustics: a survey

A Singh, N Kaur, V Kukreja, V Kadyan… - Complex & Intelligent …, 2022 - Springer
Speech recognition of a language is a key area in the field of pattern recognition. This paper
presents a comprehensive survey on the speech recognition techniques for non-Indian and …