A survey of deep network techniques all classifiers can adopt

A Ghods, DJ Cook - Data mining and knowledge discovery, 2021 - Springer
Deep neural networks (DNNs) have introduced novel and useful tools to the machine
learning community. Other types of classifiers can potentially make use of these tools as well …

Multimodal and multilingual embeddings for large-scale speech mining

PA Duquenne, H Gong… - Advances in Neural …, 2021 - proceedings.neurips.cc
We present an approach to encode a speech signal into a fixed-size representation which
minimizes the cosine loss with the existing massively multilingual LASER text embedding …

Do infants really learn phonetic categories?

NH Feldman, S Goldwater, E Dupoux, T Schatz - Open Mind, 2021 - direct.mit.edu
Early changes in infants' ability to perceive native and nonnative speech sound contrasts are
typically attributed to their develo** knowledge of phonetic categories. We critically …

DP-Parse: Finding word boundaries from raw speech with an instance lexicon

R Algayres, T Ricoul, J Karadayi… - Transactions of the …, 2022 - direct.mit.edu
Finding word boundaries in continuous speech is challenging as there is little or no
equivalent of a 'space'delimiter between words. Popular Bayesian non-parametric models …

Feature learning for efficient ASR-free keyword spotting in low-resource languages

E van der Westhuizen, H Kamper, R Menon… - Computer Speech & …, 2022 - Elsevier
We consider feature learning for a computationally efficient method of keyword spotting that
can be applied in severely under-resourced settings. The objective is to support …

Siamese capsule network for end-to-end speaker recognition in the wild

A Hajavi, A Etemad - ICASSP 2021-2021 IEEE International …, 2021 - ieeexplore.ieee.org
We propose an end-to-end deep model for speaker verification in the wild. Our model uses
thin-ResNet for extracting speaker embeddings from utterances and a Siamese capsule …

Improved acoustic word embeddings for zero-resource languages using multilingual transfer

H Kamper, Y Matusevych… - IEEE/ACM Transactions …, 2021 - ieeexplore.ieee.org
Acoustic word embeddings are fixed-dimensional representations of variable-length speech
segments. Such embeddings can form the basis for speech search, indexing and discovery …