Deep learning on multi sensor data for counter UAV applications—A systematic review

S Samaras, E Diamantidou, D Ataloglou, N Sakellariou… - Sensors, 2019 - mdpi.com
Usage of Unmanned Aerial Vehicles (UAVs) is growing rapidly in a wide range of consumer
applications, as they prove to be both autonomous and flexible in a variety of environments …

Automated audio captioning: An overview of recent progress and new challenges

X Mei, X Liu, MD Plumbley, W Wang - … journal on audio, speech, and music …, 2022 - Springer
Automated audio captioning is a cross-modal translation task that aims to generate natural
language descriptions for given audio clips. This task has received increasing attention with …

Wavcaps: A chatgpt-assisted weakly-labelled audio captioning dataset for audio-language multimodal research

X Mei, C Meng, H Liu, Q Kong, T Ko… - … on Audio, Speech …, 2024 - ieeexplore.ieee.org
The advancement of audio-language (AL) multimodal learning tasks has been significant in
recent years, yet the limited size of existing audio-language datasets poses challenges for …

DCASE 2017 challenge setup: Tasks, datasets and baseline system

A Mesaros, T Heittola, A Diment, B Elizalde… - … 2017-workshop on …, 2017 - inria.hal.science
DCASE 2017 Challenge consists of four tasks: acoustic scene classification, detection of
rare sound events, sound event detection in real-life audio, and large-scale weakly …

Detection and classification of acoustic scenes and events: Outcome of the DCASE 2016 challenge

A Mesaros, T Heittola, E Benetos… - … on Audio, Speech …, 2017 - ieeexplore.ieee.org
Public evaluation campaigns and datasets promote active development in target research
areas, allowing direct comparison of algorithms. The second edition of the challenge on …

Large-scale weakly supervised audio classification using gated convolutional neural network

Y Xu, Q Kong, W Wang… - 2018 IEEE international …, 2018 - ieeexplore.ieee.org
In this paper, we present a gated convolutional neural network and a temporal attention-
based localization method for audio classification, which won the 1st place in the large-scale …

Unsupervised learning of semantic audio representations

A Jansen, M Plakal, R Pandya… - … on acoustics, speech …, 2018 - ieeexplore.ieee.org
Even in the absence of any explicit semantic annotation, vast collections of audio recordings
provide valuable information for learning the categorical structure of sounds. We consider …

Audio set classification with attention model: A probabilistic perspective

Q Kong, Y Xu, W Wang… - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
This paper investigates the Audio Set classification. Audio Set is a large scale weakly
labelled dataset (WLD) of audio clips. In WLD only the presence of a label is known, without …

Environmental audio scene and sound event recognition for autonomous surveillance: A survey and comparative studies

S Chandrakala, SL Jayalakshmi - ACM Computing Surveys (CSUR), 2019 - dl.acm.org
Monitoring of human and social activities is becoming increasingly pervasive in our living
environment for public security and safety applications. The recognition of suspicious events …

Convolutional gated recurrent neural network incorporating spatial features for audio tagging

Y Xu, Q Kong, Q Huang, W Wang… - … Joint Conference on …, 2017 - ieeexplore.ieee.org
Environmental audio tagging is a newly proposed task to predict the presence or absence of
a specific audio event in a chunk. Deep neural network (DNN) based methods have been …