Ensemble deep learning in speech signal tasks: a review

M Tanveer, A Rastogi, V Paliwal, MA Ganaie, AK Malik… - Neurocomputing, 2023 - Elsevier
Abstract Machine learning methods are extensively used for processing and analysing
speech signals by virtue of their performance gains over multiple domains. Deep learning …

[PDF][PDF] A review on voice-based interface for human-robot interaction

AA Badr, AK Abdul-Hassan - Iraqi Journal for Electrical and Electronic …, 2020 - iasj.net
With the recent developments of technology and the advances in artificial intelligence and
machine learning techniques, it has become possible for the robot to understand and …

Paralinguistics in speech and language—state-of-the-art and the challenge

B Schuller, S Steidl, A Batliner, F Burkhardt… - Computer Speech & …, 2013 - Elsevier
Paralinguistic analysis is increasingly turning into a mainstream topic in speech and
language processing. This article aims to provide a broad overview of the constantly …

Voice-based age, gender, and language recognition based on ResNet deep model and transfer learning in spectro-temporal domain

S Mavaddati - Neurocomputing, 2024 - Elsevier
In personal identity recognition systems, detecting a person's age, gender, and language
using voice signal characteristics is a crucial issue, especially because of the importance of …

Automatic speaker age and gender recognition using acoustic and prosodic level information fusion

M Li, KJ Han, S Narayanan - Computer Speech & Language, 2013 - Elsevier
The paper presents a novel automatic speaker age and gender identification approach
which combines seven different methods at both acoustic and prosodic levels to improve the …

Minimalistic CNN-based ensemble model for gender prediction from face images

G Antipov, SA Berrani, JL Dugelay - Pattern recognition letters, 2016 - Elsevier
Despite being extensively studied in the literature, the problem of gender recognition from
face images remains difficult when dealing with unconstrained images in a cross-dataset …

Age group classification and gender recognition from speech with temporal convolutional neural networks

HA Sánchez-Hevia, R Gil-Pita, M Utrilla-Manso… - Multimedia Tools and …, 2022 - Springer
This paper analyses the performance of different types of Deep Neural Networks to jointly
estimate age and identify gender from speech, to be applied in Interactive Voice Response …

An effective gender recognition approach using voice data via deeper LSTM networks

F Ertam - Applied Acoustics, 2019 - Elsevier
It is not difficult to estimate the gender of the human from other people's audio files. In
general, people can easily identify the gender of the owner of a conversation with the …

Automatic speaker, age-group and gender identification from children's speech

S Safavi, M Russell, P Jančovič - Computer Speech & Language, 2018 - Elsevier
A speech signal contains important paralinguistic information, such as the identity, age,
gender, language, accent, and the emotional state of the speaker. Automatic recognition of …

Deep neural network framework and transformed MFCCs for speaker's age and gender classification

Z Qawaqneh, AA Mallouh, BD Barkana - Knowledge-Based Systems, 2017 - Elsevier
Speaker age and gender classification is one of the most challenging problems in speech
processing. Although many studies have been carried out focusing on feature extraction and …