A survey on deep learning based forest environment sound classification at the edge

D Meedeniya, I Ariyarathne, M Bandara… - ACM Computing …, 2023 - dl.acm.org
Forest ecosystems are of paramount importance to the sustainable existence of life on earth.
Unique natural and artificial phenomena pose severe threats to the perseverance of such …

Fsd50k: an open dataset of human-labeled sound events

E Fonseca, X Favory, J Pons, F Font… - IEEE/ACM Transactions …, 2021 - ieeexplore.ieee.org
Most existing datasets for sound event recognition (SER) are relatively small and/or domain-
specific, with the exception of AudioSet, based on over 2 M tracks from YouTube videos and …

Hts-at: A hierarchical token-semantic audio transformer for sound classification and detection

K Chen, X Du, B Zhu, Z Ma… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Audio classification is an important task of map** audio samples into their corresponding
labels. Recently, the transformer model with self-attention mechanisms has been adopted in …

A comprehensive review of polyphonic sound event detection

TK Chan, CS Chin - IEEE Access, 2020 - ieeexplore.ieee.org
One of the most amazing functions of the human auditory system is the ability to detect all
kinds of sound events in the environment. With the technologies and hardware advances …

What's all the fuss about free universal sound separation data?

S Wisdom, H Erdogan, DPW Ellis… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
We introduce the Free Universal Sound Separation (FUSS) dataset, a new corpus for
experiments in separating mixtures of an unknown number of sounds from an open domain …

Zero-shot audio source separation through query-based learning from weakly-labeled data

K Chen, X Du, B Zhu, Z Ma, T Berg-Kirkpatrick… - Proceedings of the …, 2022 - ojs.aaai.org
Deep learning techniques for separating audio into different sound sources face several
challenges. Standard architectures require training separate models for different types of …

Sound event detection by consistency training and pseudo-labeling with feature-pyramid convolutional recurrent neural networks

CY Koh, YS Chen, YW Liu… - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
Due to the high cost of large-scale strong labeling, sound event detection (SED) using only
weakly-labeled and unlabeled data has drawn increasing attention in recent years. To …

[PDF][PDF] Audio lottery: Speech recognition made ultra-lightweight, noise-robust, and transferable

S Ding, T Chen, Z Wang - International Conference on Learning …, 2022 - par.nsf.gov
Lightweight speech recognition models have seen explosive demands owing to a growing
amount of speech-interactive features on mobile devices. Since designing such systems …