Speaker identification through artificial intelligence techniques: A comprehensive review and research challenges

R Jahangir, YW Teh, HF Nweke, G Mujtaba… - Expert Systems with …, 2021 - Elsevier
Speech is a powerful medium of communication that always convey rich and useful
information, such as gender, accent, and other unique characteristics of a speaker. These …

Speaker Recognition through Deep Learning Techniques: A Comprehensive Review and Research Challenges

N Shome, A Sarkar, AK Ghosh, RH Laskar… - … and Computer Science, 2023 - pp.bme.hu
Deep learning has now become an integral part of today's world and advancement in the
field of deep learning has gained a huge development. Due to the extensive use and fast …

[BOOK][B] Emotion recognition using speech features

KS Rao, SG Koolagudi - 2012 - books.google.com
“Emotion Recognition Using Speech Features” provides coverage of emotion-specific
features present in speech. The author also discusses suitable models for capturing emotion …

Disentanglement for audio-visual emotion recognition using multitask setup

R Peri, S Parthasarathy, C Bradshaw… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
Deep learning models trained on audio-visual data have been successfully used to achieve
state-of-the-art performance for emotion recognition. In particular, models trained with …

A study of speaker verification performance with expressive speech

S Parthasarathy, C Zhang… - … on Acoustics, Speech …, 2017 - ieeexplore.ieee.org
Expressive speech introduces variations in the acoustic features affecting the performance
of speech technology such as speaker verification systems. It is important to identify the …

Building a speech recognition system with privacy identification information based on Google Voice for social robots

PC Lin, B Yankson, V Chauhan, M Tsukada - The Journal of …, 2022 - Springer
Currently, many smart speakers, even social robots, appear on the market to help people's
lives become more convenient. Usually, people use smart speakers to check their daily …

Emotional voice conversion using a hybrid framework with speaker-adaptive DNN and particle-swarm-optimized neural network

S Vekkot, D Gupta, M Zakariah, YA Alotaibi - IEEE Access, 2020 - ieeexplore.ieee.org
We propose a hybrid network-based learning framework for speaker-adaptive vocal emotion
conversion, tested on three different datasets (languages), namely, EmoDB (German) …

Speaker-independent expressive voice synthesis using learning-based hybrid network model

S Vekkot, D Gupta - International Journal of Speech Technology, 2020 - Springer
Emotional voice conversion systems are used to formulate map** functions to transform
the neutral speech from output of text-to-speech systems to that of target emotion …

Predicting speaker recognition reliability by considering emotional content

S Parthasarathy, C Busso - 2017 seventh international …, 2017 - ieeexplore.ieee.org
Studies have shown that emotional variability in speech degrades the performance of
speaker recognition tasks. Of particular interest is the error produced due to mismatch …

Hybrid framework for speaker-independent emotion conversion using i-vector PLDA and neural network

S Vekkot, D Gupta, M Zakariah, YA Alotaibi - IEEE Access, 2019 - ieeexplore.ieee.org
Expressive speech can be synthesized using acoustic feature modeling by map** the
spectral and fundamental frequency parameters between neutral speech and target …