Speaker identification through artificial intelligence techniques: A comprehensive review and research challenges
Speech is a powerful medium of communication that always convey rich and useful
information, such as gender, accent, and other unique characteristics of a speaker. These …
information, such as gender, accent, and other unique characteristics of a speaker. These …
Speaker Recognition through Deep Learning Techniques: A Comprehensive Review and Research Challenges
Deep learning has now become an integral part of today's world and advancement in the
field of deep learning has gained a huge development. Due to the extensive use and fast …
field of deep learning has gained a huge development. Due to the extensive use and fast …
[BOOK][B] Emotion recognition using speech features
KS Rao, SG Koolagudi - 2012 - books.google.com
“Emotion Recognition Using Speech Features” provides coverage of emotion-specific
features present in speech. The author also discusses suitable models for capturing emotion …
features present in speech. The author also discusses suitable models for capturing emotion …
Disentanglement for audio-visual emotion recognition using multitask setup
Deep learning models trained on audio-visual data have been successfully used to achieve
state-of-the-art performance for emotion recognition. In particular, models trained with …
state-of-the-art performance for emotion recognition. In particular, models trained with …
A study of speaker verification performance with expressive speech
Expressive speech introduces variations in the acoustic features affecting the performance
of speech technology such as speaker verification systems. It is important to identify the …
of speech technology such as speaker verification systems. It is important to identify the …
Building a speech recognition system with privacy identification information based on Google Voice for social robots
Currently, many smart speakers, even social robots, appear on the market to help people's
lives become more convenient. Usually, people use smart speakers to check their daily …
lives become more convenient. Usually, people use smart speakers to check their daily …
Emotional voice conversion using a hybrid framework with speaker-adaptive DNN and particle-swarm-optimized neural network
We propose a hybrid network-based learning framework for speaker-adaptive vocal emotion
conversion, tested on three different datasets (languages), namely, EmoDB (German) …
conversion, tested on three different datasets (languages), namely, EmoDB (German) …
Speaker-independent expressive voice synthesis using learning-based hybrid network model
Emotional voice conversion systems are used to formulate map** functions to transform
the neutral speech from output of text-to-speech systems to that of target emotion …
the neutral speech from output of text-to-speech systems to that of target emotion …
Predicting speaker recognition reliability by considering emotional content
Studies have shown that emotional variability in speech degrades the performance of
speaker recognition tasks. Of particular interest is the error produced due to mismatch …
speaker recognition tasks. Of particular interest is the error produced due to mismatch …
Hybrid framework for speaker-independent emotion conversion using i-vector PLDA and neural network
Expressive speech can be synthesized using acoustic feature modeling by map** the
spectral and fundamental frequency parameters between neutral speech and target …
spectral and fundamental frequency parameters between neutral speech and target …