A review on five recent and near-future developments in computational processing of emotion in the human voice

DM Schuller, BW Schuller - Emotion Review, 2021 - journals.sagepub.com
We provide a short review on the recent and near-future developments of computational
processing of emotion in the voice, highlighting (a) self-learning of representations moving …

Deep speaker conditioning for speech emotion recognition

A Triantafyllopoulos, S Liu… - 2021 IEEE international …, 2021 - ieeexplore.ieee.org
In this work, we explore the use of speaker conditioning sub-networks for speaker
adaptation in a deep neural network (DNN) based speech emotion recognition (SER) …

[PDF][PDF] Towards robust speech emotion recognition using deep residual networks for speech enhancement

A Triantafyllopoulos, G Keren, J Wagner, I Steiner… - 2019 - opus.bibliothek.uni-augsburg.de
The use of deep learning (DL) architectures for speech enhancement has recently improved
the robustness of voice applications under diverse noise conditions. These improvements …

Dual-stream noise and speech information perception based speech enhancement

N Li, L Wang, Q Zhang, J Dang - Expert Systems with Applications, 2025 - Elsevier
In real-world scenarios, dynamic ambient noise often degrades speech quality, highlighting
the need for advanced speech enhancement techniques. Traditional methods, which rely on …

Multistage linguistic conditioning of convolutional layers for speech emotion recognition

A Triantafyllopoulos, U Reichel, S Liu… - Frontiers in Computer …, 2023 - frontiersin.org
Introduction The effective fusion of text and audio information for categorical and
dimensional speech emotion recognition (SER) remains an open issue, especially given the …

N-HANS: A neural network-based toolkit for in-the-wild audio enhancement

S Liu, G Keren, E Parada-Cabaleiro… - Multimedia Tools and …, 2021 - Springer
The unprecedented growth of noise pollution over the last decades has raised an always
increasing need for develo** efficient audio enhancement technologies. Yet, the variety of …

Neural noise embedding for end-to-end speech enhancement with conditional layer normalization

Z Zhang, X Li, Y Li, Y Dong, D Wang… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
Most of the deep learning based speech enhancement methods focus on the modeling of
complicated relationship between the noisy speech and the clean speech without the …

DFingerNet: Noise-Adaptive Speech Enhancement for Hearing Aids

I Tsangko, A Triantafyllopoulos, M Müller… - arxiv preprint arxiv …, 2025 - arxiv.org
The DeepFilterNet (DFN) architecture was recently proposed as a deep learning model
suited for hearing aid devices. Despite its competitive performance on numerous …

Hierarchical component-attention based speaker turn embedding for emotion recognition

S Liu, J Jiao, Z Zhao, J Dineley… - … Joint Conference on …, 2020 - ieeexplore.ieee.org
Traditional discrete-time Speech Emotion Recognition (SER) modelling techniques typically
assume that an entire speaker chunk or turn is indicative of its corresponding label. An …

[PDF][PDF] ESTIMADOR DE RELACIÓN SEÑAL A RUIDO USANDO REDES NEURONALES PARA RECONOCEDORES AUTOMÁTICOS DE VOZ EN USO …

FJ Bautista - Jornadas de Acústica, Audio y Sonido, UNTREF, 2023 - researchgate.net
Se desarrolla un estimador de relación señal a ruido (SNR) con el fin de determinar la
calidad de registros sonoros de habla que ingresen a sistemas de reconocimiento …