Acoustic scene classification: a comprehensive survey

B Ding, T Zhang, C Wang, G Liu, J Liang, R Hu… - Expert Systems with …, 2024 - Elsevier
Acoustic scene classification (ASC) has gained significant interest recently due to its diverse
applications. Various audio signal processing and machine learning methods have been …

Deep convolutional neural networks and data augmentation for environmental sound classification

J Salamon, JP Bello - IEEE Signal processing letters, 2017 - ieeexplore.ieee.org
The ability of deep convolutional neural networks (CNNs) to learn discriminative spectro-
temporal patterns makes them well suited to environmental sound classification. However …

ESC: Dataset for environmental sound classification

KJ Piczak - Proceedings of the 23rd ACM international conference …, 2015 - dl.acm.org
One of the obstacles in research activities concentrating on environmental sound
classification is the scarcity of suitable and publicly available datasets. This paper tries to …

TUT database for acoustic scene classification and sound event detection

A Mesaros, T Heittola, T Virtanen - 2016 24th European Signal …, 2016 - ieeexplore.ieee.org
We introduce TUT Acoustic Scenes 2016 database for environmental sound research,
consisting of binaural recordings from 15 different acoustic environments. A subset of this …

Metrics for polyphonic sound event detection

A Mesaros, T Heittola, T Virtanen - Applied Sciences, 2016 - mdpi.com
This paper presents and discusses various metrics proposed for evaluation of polyphonic
sound event detection systems used in realistic situations where there are typically multiple …

Detection and classification of acoustic scenes and events

D Stowell, D Giannoulis, E Benetos… - IEEE Transactions …, 2015 - ieeexplore.ieee.org
For intelligent systems to make best use of the audio modality, it is important that they can
recognize not just speech and music, which have been researched as specific tasks, but …

General-purpose tagging of freesound audio with audioset labels: Task description, dataset, and baseline

E Fonseca, M Plakal, F Font, DPW Ellis… - arxiv preprint arxiv …, 2018 - arxiv.org
This paper describes Task 2 of the DCASE 2018 Challenge, titled" General-purpose audio
tagging of Freesound content with AudioSet labels". This task was hosted on the Kaggle …

Sound event detection of weakly labelled data with cnn-transformer and automatic threshold optimization

Q Kong, Y Xu, W Wang… - IEEE/ACM Transactions …, 2020 - ieeexplore.ieee.org
Sound event detection (SED) is a task to detect sound events in an audio recording. One
challenge of the SED task is that many datasets such as the Detection and Classification of …

Histogram of gradients of time–frequency representations for audio scene classification

A Rakotomamonjy, G Gasso - IEEE/ACM Transactions on …, 2014 - ieeexplore.ieee.org
Presents our entry to the Detection and Classification of Acoustic Scenes challenge. The
approach we propose for classifying acoustic scenes is based on transforming the audio …

[PDF][PDF] Acoustic Scene Classification Using Parallel Combination of LSTM and CNN.

SH Bae, IK Choi, NS Kim - DCASE, 2016 - dcase.community
Deep neural networks (DNNs) have recently achieved a great success in various learning
task, and have also been used for classification of environmental sounds. While DNNs are …