A survey on deep learning based forest environment sound classification at the edge

D Meedeniya, I Ariyarathne, M Bandara… - ACM Computing …, 2023 - dl.acm.org
Forest ecosystems are of paramount importance to the sustainable existence of life on earth.
Unique natural and artificial phenomena pose severe threats to the perseverance of such …

Pengi: An audio language model for audio tasks

S Deshmukh, B Elizalde, R Singh… - Advances in Neural …, 2023 - proceedings.neurips.cc
In the domain of audio processing, Transfer Learning has facilitated the rise of Self-
Supervised Learning and Zero-Shot Learning techniques. These approaches have led to …

The internet of audio things: State of the art, vision, and challenges

L Turchet, G Fazekas, M Lagrange… - IEEE internet of …, 2020 - ieeexplore.ieee.org
The Internet of Audio Things (IoAuT) is an emerging research field positioned at the
intersection of the Internet of Things, sound and music computing, artificial intelligence, and …

Soundstream: An end-to-end neural audio codec

N Zeghidour, A Luebs, A Omran… - … on Audio, Speech …, 2021 - ieeexplore.ieee.org
We present SoundStream, a novel neural audio codec that can efficiently compress speech,
music and general audio at bitrates normally targeted by speech-tailored codecs …

Fsd50k: an open dataset of human-labeled sound events

E Fonseca, X Favory, J Pons, F Font… - IEEE/ACM Transactions …, 2021 - ieeexplore.ieee.org
Most existing datasets for sound event recognition (SER) are relatively small and/or domain-
specific, with the exception of AudioSet, based on over 2 M tracks from YouTube videos and …

MIMII Dataset: Sound dataset for malfunctioning industrial machine investigation and inspection

H Purohit, R Tanabe, K Ichige, T Endo… - arxiv preprint arxiv …, 2019 - arxiv.org
Factory machinery is prone to failure or breakdown, resulting in significant expenses for
companies. Hence, there is a rising interest in machine monitoring using different sensors …

Audiocaps: Generating captions for audios in the wild

CD Kim, B Kim, H Lee, G Kim - … of the 2019 Conference of the …, 2019 - aclanthology.org
We explore the problem of Audio Captioning: generating natural language description for
any kind of audio in the wild, which has been surprisingly unexplored in previous research …

Students' perception towards behavioral intention of audio and video teaching styles: An acceptance study

RS Al-Maroof, NMN Alahbabi, I Akour… - … Journal of Data and …, 2022 - zuscholars.zu.ac.ae
Recently audio and video material has been used significantly in various online platforms.
The au-dio-video materials enhance the teaching and learning process by facilitating the …

Unsupervised sound separation using mixture invariant training

S Wisdom, E Tzinis, H Erdogan… - Advances in neural …, 2020 - proceedings.neurips.cc
In recent years, rapid progress has been made on the problem of single-channel sound
separation using supervised training of deep neural networks. In such supervised …

ToyADMOS: A dataset of miniature-machine operating sounds for anomalous sound detection

Y Koizumi, S Saito, H Uematsu… - 2019 IEEE Workshop …, 2019 - ieeexplore.ieee.org
This paper introduces a new dataset called" ToyADMOS" designed for anomaly detection in
machine operating sounds (ADMOS). To the best our knowledge, no large-scale datasets …