Environmental sound classification with convolutional neural networks

KJ Piczak - 2015 IEEE 25th international workshop on machine …, 2015 - ieeexplore.ieee.org
This paper evaluates the potential of convolutional neural networks in classifying short audio
clips of environmental sounds. A deep model consisting of 2 convolutional layers with max …

Augmented Hearing of Auditory Safety Cues for Construction Workers: A Systematic Literature Review

K Dang, K Elelu, T Le, C Le - Sensors, 2022 - mdpi.com
Safety-critical sounds at job sites play an essential role in construction safety, but hearing
capability is often declined due to the use of hearing protection and the complicated nature …

Environmental sound classification with dilated convolutions

Y Chen, Q Guo, X Liang, J Wang, Y Qian - Applied Acoustics, 2019 - Elsevier
In sound information retrieval (SIR) area, environmental sound classification (ESC) emerges
as a new issue, which aims at classifying environments by analysing the complex features …

Enhancing micro-video understanding by harnessing external sounds

L Nie, X Wang, J Zhang, X He, H Zhang… - Proceedings of the 25th …, 2017 - dl.acm.org
Different from traditional long videos, micro-videos are much shorter and usually recorded at
a specific place with mobile devices. To better understand the semantics of a micro-video …

Audio-based multimedia event detection using deep recurrent neural networks

Y Wang, L Neves, F Metze - 2016 IEEE international …, 2016 - ieeexplore.ieee.org
Multimedia event detection (MED) is the task of detecting given events (eg birthday party,
making a sandwich) in a large collection of video clips. While visual features and automatic …

Gun identification from gunshot audios for secure public places using transformer learning

R Nijhawan, SA Ansari, S Kumar, F Alassery… - Scientific reports, 2022 - nature.com
Increased mass shootings and terrorist activities severely impact society mentally and
physically. Development of real-time and cost-effective automated weapon detection …

A network of deep neural networks for distant speech recognition

M Ravanelli, P Brakel, M Omologo… - 2017 IEEE International …, 2017 - ieeexplore.ieee.org
Despite the remarkable progress recently made in distant speech recognition, state-of-the-
art technology still suffers from a lack of robustness, especially when adverse acoustic …

Batch-normalized joint training for DNN-based distant speech recognition

M Ravanelli, P Brakel, M Omologo… - 2016 IEEE Spoken …, 2016 - ieeexplore.ieee.org
Improving distant speech recognition is a crucial step towards flexible human-machine
interfaces. Current technology, however, still exhibits a lack of robustness, especially when …

Hierarchical learning for DNN-based acoustic scene classification

Y Xu, Q Huang, W Wang, MD Plumbley - arxiv preprint arxiv:1607.03682, 2016 - arxiv.org
In this paper, we present a deep neural network (DNN)-based acoustic scene classification
framework. Two hierarchical learning methods are proposed to improve the DNN baseline …

A first attempt at polyphonic sound event detection using connectionist temporal classification

Y Wang, F Metze - … conference on acoustics, speech and signal …, 2017 - ieeexplore.ieee.org
Sound event detection is the task of detecting the type, starting time, and ending time of
sound events in audio streams. Recently, recurrent neural networks (RNNs) have become …