The internet of audio things: State of the art, vision, and challenges

L Turchet, G Fazekas, M Lagrange… - IEEE internet of …, 2020‏ - ieeexplore.ieee.org
The Internet of Audio Things (IoAuT) is an emerging research field positioned at the
intersection of the Internet of Things, sound and music computing, artificial intelligence, and …

Fsd50k: an open dataset of human-labeled sound events

E Fonseca, X Favory, J Pons, F Font… - IEEE/ACM Transactions …, 2021‏ - ieeexplore.ieee.org
Most existing datasets for sound event recognition (SER) are relatively small and/or domain-
specific, with the exception of AudioSet, based on over 2 M tracks from YouTube videos and …

Panns: Large-scale pretrained audio neural networks for audio pattern recognition

Q Kong, Y Cao, T Iqbal, Y Wang… - … on Audio, Speech …, 2020‏ - ieeexplore.ieee.org
Audio pattern recognition is an important research topic in the machine learning area, and
includes several tasks such as audio tagging, acoustic scene classification, music …

A survey of audio enhancement algorithms for music, speech, bioacoustics, biomedical, industrial, and environmental sounds by image U-Net

S Gul, MS Khan - IEEE Access, 2023‏ - ieeexplore.ieee.org
The recent surge in the use of Deep Neural Networks (DNNs) has also made its mark in the
field of Audio Enhancement (AE), providing much better quality than the classical methods …

Strong labeling of sound events using crowdsourced weak labels and annotator competence estimation

I Martín-Morató, A Mesaros - IEEE/ACM transactions on audio …, 2023‏ - ieeexplore.ieee.org
Crowdsourcing is a popular tool for collecting large amounts of annotated data, but the
specific format of the strong labels necessary for sound event detection is not easily …

Cough against covid: Evidence of covid-19 signature in cough sounds

P Bagad, A Dalmia, J Doshi, A Nagrani… - arxiv preprint arxiv …, 2020‏ - arxiv.org
Testing capacity for COVID-19 remains a challenge globally due to the lack of adequate
supplies, trained personnel, and sample-processing equipment. These problems are even …

Polyphonic sound event detection and localization using a two-stage strategy

Y Cao, Q Kong, T Iqbal, F An, W Wang… - arxiv preprint arxiv …, 2019‏ - arxiv.org
Sound event detection (SED) and localization refer to recognizing sound events and
estimating their spatial and temporal locations. Using neural networks has become the …

Learning sound event classifiers from web audio with noisy labels

E Fonseca, M Plakal, DPW Ellis, F Font… - ICASSP 2019-2019 …, 2019‏ - ieeexplore.ieee.org
As sound event classification moves towards larger datasets, issues of label noise become
inevitable. Web sites can supply large volumes of user-contributed audio and metadata, but …

Attribution in scale and space

S Xu, S Venugopalan… - Proceedings of the …, 2020‏ - openaccess.thecvf.com
We study the attribution problem for deep networks applied to perception tasks. For vision
tasks, attribution techniques attribute the prediction of a network to the pixels of the input …

Sound event detection and time–frequency segmentation from weakly labelled data

Q Kong, Y Xu, I Sobieraj, W Wang… - … /ACM Transactions on …, 2019‏ - ieeexplore.ieee.org
Sound event detection (SED) aims to detect when and recognize what sound events happen
in an audio clip. Many supervised SED algorithms rely on strongly labelled data that …