Speech recognition using deep neural networks: A systematic review

AB Nassif, I Shahin, I Attili, M Azzeh, K Shaalan - IEEE access, 2019 - ieeexplore.ieee.org
Over the past decades, a tremendous amount of research has been done on the use of
machine learning for speech processing applications, especially speech recognition …

Spoken language recognition: from fundamentals to practice

H Li, B Ma, KA Lee - Proceedings of the IEEE, 2013 - ieeexplore.ieee.org
Spoken language recognition refers to the automatic process through which we determine
or verify the identity of the language spoken in a speech sample. We study a computational …

Ensemble deep learning in speech signal tasks: a review

M Tanveer, A Rastogi, V Paliwal, MA Ganaie, AK Malik… - Neurocomputing, 2023 - Elsevier
Abstract Machine learning methods are extensively used for processing and analysing
speech signals by virtue of their performance gains over multiple domains. Deep learning …

Method and system for efficient spoken term detection using confusion networks

BED Kingsbury, HK Kuo, L Mangu, H Soltau - US Patent 9,196,243, 2015 - Google Patents
US9196243B2 - Method and system for efficient spoken term detection using confusion
networks - Google Patents US9196243B2 - Method and system for efficient spoken term …

Query-by-example spoken term detection using phonetic posteriorgram templates

TJ Hazen, W Shen, C White - 2009 IEEE Workshop on …, 2009 - ieeexplore.ieee.org
This paper examines a query-by-example approach to spoken term detection in audio files.
The approach is designed for low-resource situations in which limited or no in-domain …

Spoken content retrieval—beyond cascading speech recognition with text retrieval

L Lee, J Glass, H Lee, C Chan - IEEE/ACM Transactions on …, 2015 - ieeexplore.ieee.org
Spoken content retrieval refers to directly indexing and retrieving spoken content based on
the audio rather than text descriptions. This potentially eliminates the requirement of …

Current challenges and future directions in podcast information access

R Jones, H Zamani, M Schedl, CW Chen… - Proceedings of the 44th …, 2021 - dl.acm.org
Podcasts are spoken documents across a wide-range of genres and styles, with growing
listenership across the world, and a rapidly lowering barrier to entry for both listeners and …

Spoken language identification system using convolutional recurrent neural network

AA Alashban, MA Qamhan, AH Meftah, YA Alotaibi - Applied Sciences, 2022 - mdpi.com
Following recent advancements in deep learning and artificial intelligence, spoken
language identification applications are playing an increasingly significant role in our day-to …

Hybrid feature selection method based on harmony search and naked mole-rat algorithms for spoken language identification from audio signals

S Guha, A Das, PK Singh, A Ahmadian, N Senu… - IEEE …, 2020 - ieeexplore.ieee.org
This era is dominated by artificial intelligence and its various applications-one of which is
Spoken Language Identification (S-LID) which has always been a challenging issue and an …

Spoken content retrieval: A survey of techniques and technologies

M Larson, GJF Jones - Foundations and Trends® in …, 2012 - nowpublishers.com
Speech media, that is, digital audio and video containing spoken content, has blossomed in
recent years. Large collections are accruing on the Internet as well as in private and …