Deep learning on multi sensor data for counter UAV applications—A systematic review
Usage of Unmanned Aerial Vehicles (UAVs) is growing rapidly in a wide range of consumer
applications, as they prove to be both autonomous and flexible in a variety of environments …
applications, as they prove to be both autonomous and flexible in a variety of environments …
Automated audio captioning: An overview of recent progress and new challenges
Automated audio captioning is a cross-modal translation task that aims to generate natural
language descriptions for given audio clips. This task has received increasing attention with …
language descriptions for given audio clips. This task has received increasing attention with …
Wavcaps: A chatgpt-assisted weakly-labelled audio captioning dataset for audio-language multimodal research
The advancement of audio-language (AL) multimodal learning tasks has been significant in
recent years, yet the limited size of existing audio-language datasets poses challenges for …
recent years, yet the limited size of existing audio-language datasets poses challenges for …
DCASE 2017 challenge setup: Tasks, datasets and baseline system
DCASE 2017 Challenge consists of four tasks: acoustic scene classification, detection of
rare sound events, sound event detection in real-life audio, and large-scale weakly …
rare sound events, sound event detection in real-life audio, and large-scale weakly …
Detection and classification of acoustic scenes and events: Outcome of the DCASE 2016 challenge
Public evaluation campaigns and datasets promote active development in target research
areas, allowing direct comparison of algorithms. The second edition of the challenge on …
areas, allowing direct comparison of algorithms. The second edition of the challenge on …
Large-scale weakly supervised audio classification using gated convolutional neural network
In this paper, we present a gated convolutional neural network and a temporal attention-
based localization method for audio classification, which won the 1st place in the large-scale …
based localization method for audio classification, which won the 1st place in the large-scale …
Unsupervised learning of semantic audio representations
Even in the absence of any explicit semantic annotation, vast collections of audio recordings
provide valuable information for learning the categorical structure of sounds. We consider …
provide valuable information for learning the categorical structure of sounds. We consider …
Audio set classification with attention model: A probabilistic perspective
This paper investigates the Audio Set classification. Audio Set is a large scale weakly
labelled dataset (WLD) of audio clips. In WLD only the presence of a label is known, without …
labelled dataset (WLD) of audio clips. In WLD only the presence of a label is known, without …
Environmental audio scene and sound event recognition for autonomous surveillance: A survey and comparative studies
Monitoring of human and social activities is becoming increasingly pervasive in our living
environment for public security and safety applications. The recognition of suspicious events …
environment for public security and safety applications. The recognition of suspicious events …
Convolutional gated recurrent neural network incorporating spatial features for audio tagging
Environmental audio tagging is a newly proposed task to predict the presence or absence of
a specific audio event in a chunk. Deep neural network (DNN) based methods have been …
a specific audio event in a chunk. Deep neural network (DNN) based methods have been …