Towards relatable explainable AI with the perceptual process

W Zhang, BY Lim - Proceedings of the 2022 CHI Conference on Human …, 2022 - dl.acm.org
Machine learning models need to provide contrastive explanations, since people often seek
to understand why a puzzling prediction occurred instead of some expected outcome …

An interpretable deep learning model for automatic sound classification

P Zinemanas, M Rocamora, M Miron, F Font, X Serra - Electronics, 2021 - mdpi.com
Deep learning models have improved cutting-edge technologies in many research areas,
but their black-box structure makes it difficult to understand their inner workings and the …

[HTML][HTML] Bioacoustic classification of avian calls from raw sound waveforms with an open-source deep learning architecture

FJB Sanchez, MR Hossain, NB English… - Scientific Reports, 2021 - ncbi.nlm.nih.gov
The use of autonomous recordings of animal sounds to detect species is a popular
conservation tool, constantly improving in fidelity as audio hardware and software evolves …

Bioacoustic classification of avian calls from raw sound waveforms with an open-source deep learning architecture

FJ Bravo Sanchez, MR Hossain, NB English… - Scientific Reports, 2021 - nature.com
The use of autonomous recordings of animal sounds to detect species is a popular
conservation tool, constantly improving in fidelity as audio hardware and software evolves …

Filterbank design for end-to-end speech separation

M Pariente, S Cornell, A Deleforge… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
Single-channel speech separation has recently made great progress thanks to learned
filterbanks as used in ConvTasNet. In parallel, parameterized filterbanks have been …

Towards modeling raw speech in gender identification of children using sincNet over ERB scale

K Radha, M Bansal - International Journal of Speech Technology, 2023 - Springer
This article reveals the results of age-dependent gender identification from raw speech
using the recently developed non-native children's English speech corpus. Convolutional …

E2E-SINCNET: Toward fully end-to-end speech recognition

T Parcollet, M Morchid, G Linares - ICASSP 2020-2020 IEEE …, 2020 - ieeexplore.ieee.org
Modern end-to-end (E2E) Automatic Speech Recognition (ASR) systems rely on Deep
Neural Networks (DNN) that are mostly trained on handcrafted and pre-computed acoustic …

Cgcnn: Complex gabor convolutional neural network on raw speech

PG Noé, T Parcollet, M Morchid - ICASSP 2020-2020 IEEE …, 2020 - ieeexplore.ieee.org
Convolutional Neural Networks (CNN) have been used in Automatic Speech Recognition
(ASR) to learn representations directly from the raw signal instead of hand-crafted acoustic …

[PDF][PDF] Dysarthric Speech Recognition From Raw Waveform with Parametric CNNs.

Z Yue, E Loweimi, H Christensen, J Barker… - …, 2022 - isca-archive.org
Raw waveform acoustic modelling has recently received increasing attention. Compared
with the task-blind hand-crafted features which may discard useful information …

[HTML][HTML] Passive sonar automated target classifier for shallow waters using end-to-end learnable deep convolutional LSTMs

S Kamal, CS Chandran, MH Supriya - Engineering Science and …, 2021 - Elsevier
Automated target recognition systems are increasingly employed in sonar systems to reduce
manning and associated challenges. Although passive acoustic target recognition is an …