Supervised speech separation based on deep learning: An overview

DL Wang, J Chen - IEEE/ACM transactions on audio, speech …, 2018 - ieeexplore.ieee.org
Speech separation is the task of separating target speech from background interference.
Traditionally, speech separation is studied as a signal processing problem. A more recent …

Deep spoken keyword spotting: An overview

I López-Espejo, ZH Tan, JHL Hansen, J Jensen - IEEE Access, 2021 - ieeexplore.ieee.org
Spoken keyword spotting (KWS) deals with the identification of keywords in audio streams
and has become a fast-growing technology thanks to the paradigm shift introduced by deep …

Convolutional recurrent neural networks for small-footprint keyword spotting

SO Arik, M Kliegl, R Child, J Hestness… - arxiv preprint arxiv …, 2017 - arxiv.org
Keyword spotting (KWS) constitutes a major component of human-technology interfaces.
Maximizing the detection accuracy at a low false alarm (FA) rate, while minimizing the …

LEAF: A learnable frontend for audio classification

N Zeghidour, O Teboul, FDC Quitry… - arxiv preprint arxiv …, 2021 - arxiv.org
Mel-filterbanks are fixed, engineered audio features which emulate human perception and
have been used through the history of audio understanding up to today. However, their …

Acoustic scene classification: A comprehensive survey

B Ding, T Zhang, C Wang, G Liu, J Liang, R Hu… - Expert Systems with …, 2024 - Elsevier
Acoustic scene classification (ASC) has gained significant interest recently due to its diverse
applications. Various audio signal processing and machine learning methods have been …

A convolutional neural network for automated detection of humpback whale song in a diverse, long-term passive acoustic dataset

AN Allen, M Harvey, L Harrell, A Jansen… - Frontiers in Marine …, 2021 - frontiersin.org
Passive acoustic monitoring is a well-established tool for researching the occurrence,
movements, and ecology of a wide variety of marine mammal species. Advances in …

Attention guided learnable time-domain filterbanks for speech depression detection

W Yang, J Liu, P Cao, R Zhu, Y Wang, JK Liu, F Wang… - Neural Networks, 2023 - Elsevier
Depression, as a global mental health problem, is lacking effective screening methods that
can help with early detection and treatment. This paper aims to facilitate the large-scale …

A review of deep learning based methods for acoustic scene classification

J Abeßer - Applied Sciences, 2020 - mdpi.com
The number of publications on acoustic scene classification (ASC) in environmental audio
recordings has constantly increased over the last few years. This was mainly stimulated by …

Improving bird classification with unsupervised sound separation

T Denton, S Wisdom, JR Hershey - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
This paper addresses the problem of species classification in bird song recordings. The
massive amount of available field recordings of birds presents an opportunity to use …