Classical and deep learning data processing techniques for speech and speaker recognitions

A Mittal, M Dua, S Dua - Deep learning approaches for spoken and …, 2021 - Springer
Both information of context and speaker-specific information are involved in the speech
utterance in the form of its features; hence, speech is noticed as a potential human …

A fractal approach to characterize emotions in audio and visual domain: A study on cross-modal interaction

S Nag, U Sarkar, S Sanyal, A Banerjee, S Roy… - arxiv preprint arxiv …, 2021 - arxiv.org
It is already known that both auditory and visual stimulus is able to convey emotions in
human mind to different extent. The strength or intensity of the emotional arousal vary …

Language Independent Emotion Quantification using Non linear Modelling of Speech

U Sarkar, S Nag, C Bhattacharya, S Sanyal… - arxiv preprint arxiv …, 2021 - arxiv.org
At present emotion extraction from speech is a very important issue due to its diverse
applications. Hence, it becomes absolutely necessary to obtain models that take into …