Environmental sound classification with convolutional neural networks
KJ Piczak - 2015 IEEE 25th international workshop on machine …, 2015 - ieeexplore.ieee.org
This paper evaluates the potential of convolutional neural networks in classifying short audio
clips of environmental sounds. A deep model consisting of 2 convolutional layers with max …
clips of environmental sounds. A deep model consisting of 2 convolutional layers with max …
Augmented Hearing of Auditory Safety Cues for Construction Workers: A Systematic Literature Review
Safety-critical sounds at job sites play an essential role in construction safety, but hearing
capability is often declined due to the use of hearing protection and the complicated nature …
capability is often declined due to the use of hearing protection and the complicated nature …
Environmental sound classification with dilated convolutions
In sound information retrieval (SIR) area, environmental sound classification (ESC) emerges
as a new issue, which aims at classifying environments by analysing the complex features …
as a new issue, which aims at classifying environments by analysing the complex features …
Enhancing micro-video understanding by harnessing external sounds
Different from traditional long videos, micro-videos are much shorter and usually recorded at
a specific place with mobile devices. To better understand the semantics of a micro-video …
a specific place with mobile devices. To better understand the semantics of a micro-video …
Audio-based multimedia event detection using deep recurrent neural networks
Multimedia event detection (MED) is the task of detecting given events (eg birthday party,
making a sandwich) in a large collection of video clips. While visual features and automatic …
making a sandwich) in a large collection of video clips. While visual features and automatic …
Gun identification from gunshot audios for secure public places using transformer learning
Increased mass shootings and terrorist activities severely impact society mentally and
physically. Development of real-time and cost-effective automated weapon detection …
physically. Development of real-time and cost-effective automated weapon detection …
A network of deep neural networks for distant speech recognition
Despite the remarkable progress recently made in distant speech recognition, state-of-the-
art technology still suffers from a lack of robustness, especially when adverse acoustic …
art technology still suffers from a lack of robustness, especially when adverse acoustic …
Batch-normalized joint training for DNN-based distant speech recognition
Improving distant speech recognition is a crucial step towards flexible human-machine
interfaces. Current technology, however, still exhibits a lack of robustness, especially when …
interfaces. Current technology, however, still exhibits a lack of robustness, especially when …
Hierarchical learning for DNN-based acoustic scene classification
In this paper, we present a deep neural network (DNN)-based acoustic scene classification
framework. Two hierarchical learning methods are proposed to improve the DNN baseline …
framework. Two hierarchical learning methods are proposed to improve the DNN baseline …
A first attempt at polyphonic sound event detection using connectionist temporal classification
Sound event detection is the task of detecting the type, starting time, and ending time of
sound events in audio streams. Recently, recurrent neural networks (RNNs) have become …
sound events in audio streams. Recently, recurrent neural networks (RNNs) have become …