[HTML][HTML] Automatic Speech Recognition: A survey of deep learning techniques and approaches

H Ahlawat, N Aggarwal, D Gupta - International Journal of Cognitive …, 2025 - Elsevier
Significant research has been conducted during the last decade on the application of
machine learning for speech processing, particularly speech recognition. However, in recent …

[HTML][HTML] Recent advancements and challenges of nlp-based sentiment analysis: A state-of-the-art review

JR Jim, MAR Talukder, P Malakar, MM Kabir… - Natural Language …, 2024 - Elsevier
Sentiment analysis is a method within natural language processing that evaluates and
identifies the emotional tone or mood conveyed in textual data. Scrutinizing words and …

Kid-whisper: Towards bridging the performance gap in automatic speech recognition for children vs. adults

AA Attia, J Liu, W Ai, D Demszky… - Proceedings of the AAAI …, 2024 - ojs.aaai.org
Abstract Recent advancements in Automatic Speech Recognition (ASR) systems,
exemplified by Whisper, have demonstrated the potential of these systems to approach …

Deep representation learning: Fundamentals, technologies, applications, and open challenges

A Payandeh, KT Baghaei, P Fayyazsanavi… - IEEE …, 2023 - ieeexplore.ieee.org
Machine learning algorithms have had a profound impact on the field of computer science
over the past few decades. The performance of these algorithms heavily depends on the …

Harnessing the power of Wav2Vec2 and CNNs for Robust Speaker Identification on the VoxCeleb and LibriSpeech Datasets

OH Anidjar, R Marbel, R Yozevitch - Expert Systems with Applications, 2024 - Elsevier
Speaker identification, a cornerstone of speech processing, involves associating individuals
with spoken segments within a known speaker pool. This paper presents a significant AI …

Situating Automatic Speech Recognition Development within Communities of Under-heard Language Speakers

T Reitmaier, E Wallington, O Klejch, N Markl… - Proceedings of the …, 2023 - dl.acm.org
In this paper we develop approaches to automatic speech recognition (ASR) development
that suit the needs and functions of under-heard language speakers. Our novel contribution …

A parameter-efficient learning approach to arabic dialect identification with pre-trained general-purpose speech model

S Radhakrishnan, CHH Yang, SA Khan… - arxiv preprint arxiv …, 2023 - arxiv.org
In this work, we explore Parameter-Efficient-Learning (PEL) techniques to repurpose a
General-Purpose-Speech (GSM) model for Arabic dialect identification (ADI). Specifically …

Pqlm-multilingual decentralized portable quantum language model

SS Li, X Zhang, S Zhou, H Shu, R Liang… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
With careful manipulation, malicious agents can reverse engineer private information
encoded in pre-trained language models. Security concerns motivate the development of …

Deep representation learning: Fundamentals, perspectives, applications, and open challenges

KT Baghaei, A Payandeh, P Fayyazsanavi… - arxiv preprint arxiv …, 2022 - arxiv.org
Machine Learning algorithms have had a profound impact on the field of computer science
over the past few decades. These algorithms performance is greatly influenced by the …

A Systematic Literature Review of Human-Centered, Ethical, and Responsible AI

M Tahaei, M Constantinides, D Quercia… - arxiv preprint arxiv …, 2023 - arxiv.org
As Artificial Intelligence (AI) continues to advance rapidly, it becomes increasingly important
to consider AI's ethical and societal implications. In this paper, we present a bottom-up …