Social signal processing: Survey of an emerging domain

A Vinciarelli, M Pantic, H Bourlard - Image and vision computing, 2009 - Elsevier
The ability to understand and manage social signals of a person we are communicating with
is the core of social intelligence. Social intelligence is a facet of human intelligence that has …

Speaker segmentation and clustering

M Kotti, V Moschou, C Kotropoulos - Signal processing, 2008 - Elsevier
This survey focuses on two challenging speech processing topics, namely: speaker
segmentation and speaker clustering. Speaker segmentation aims at finding speaker …

Classification of audio signals using SVM and RBFNN

P Dhanalakshmi, S Palanivel, V Ramalingam - Expert systems with …, 2009 - Elsevier
In the age of digital information, audio data has become an important part in many modern
computer applications. Audio classification has been becoming a focus in the research of …

Multiclass audio segmentation based on recurrent neural networks for broadcast domain data

P Gimeno, I Viñals, A Ortega, A Miguel… - EURASIP Journal on …, 2020 - Springer
This paper presents a new approach based on recurrent neural networks (RNN) to the
multiclass audio segmentation task whose goal is to classify an audio signal as speech …

Classification of audio signals using AANN and GMM

P Dhanalakshmi, S Palanivel, V Ramalingam - Applied soft computing, 2011 - Elsevier
Today, digital audio applications are part of our everyday lives. Audio classification can
provide powerful tools for content management. If an audio clip automatically can be …

[PDF][PDF] A decision-tree-based algorithm for speech/music classification and segmentation

Y Lavner, D Ruinskiy - EURASIP Journal on Audio, Speech, and Music …, 2009 - Springer
We present an efficient algorithm for segmentation of audio signals into speech or music.
The central motivation to our study is consumer audio applications, where various real-time …

Learning sparse dictionaries for music and speech classification

M Srinivas, D Roy, CK Mohan - 2014 19th International …, 2014 - ieeexplore.ieee.org
The field of music and speech classification is quite mature with researchers having settled
on the approximate best discriminative representation. In this regard, Zubair et al. showed …

Speech-music discrimination using deep visual feature extractors

M Papakostas, T Giannakopoulos - Expert Systems with Applications, 2018 - Elsevier
Speech music discrimination is a traditional task in audio analytics, useful for a wide range
of applications, such as automatic speech recognition and radio broadcast monitoring, that …

Audio-based semantic concept classification for consumer video

K Lee, DPW Ellis - IEEE Transactions on Audio, Speech, and …, 2009 - ieeexplore.ieee.org
This paper presents a novel method for automatically classifying consumer video clips
based on their soundtracks. We use a set of 25 overlap** semantic classes, chosen for …

Social signal processing: Understanding social interactions through nonverbal behavior analysis

A Vinciarelli, H Salamin, M Pantic - 2009 IEEE computer society …, 2009 - ieeexplore.ieee.org
This paper introduces social signal processing (SSP), the domain aimed at automatic
understanding of social interactions through analysis of nonverbal behavior. The core idea …