Self-supervised language learning from raw audio: Lessons from the zero resource speech challenge

E Dunbar, N Hamilakis… - IEEE Journal of Selected …, 2022 - ieeexplore.ieee.org
Recent progress in self-supervised or unsupervised machine learning has opened the
possibility of building a full speech processing system from raw audio without using any …

A brief overview of unsupervised neural speech representation learning

L Borgholt, JD Havtorn, J Edin, L Maaløe… - arxiv preprint arxiv …, 2022 - arxiv.org
Unsupervised representation learning for speech processing has matured greatly in the last
few years. Work in computer vision and natural language processing has paved the way, but …

Autoregressive predictive coding: A comprehensive study

GP Yang, SL Yeh, YA Chung, J Glass… - IEEE Journal of …, 2022 - ieeexplore.ieee.org
We review autoregressive predictive coding (APC), an approach to learn speech
representation by predicting a future frame given the past frames. We present three different …

Variable-rate hierarchical CPC leads to acoustic unit discovery in speech

S Cuervo, A Lancucki, R Marxer… - Advances in …, 2022 - proceedings.neurips.cc
The success of deep learning comes from its ability to capture the hierarchical structure of
data by learning high-level representations defined in terms of low-level ones. In this paper …

Deep audio embeddings for vocalisation clustering

P Best, S Paris, H Glotin, R Marxer - Plos one, 2023 - journals.plos.org
The study of non-human animals' communication systems generally relies on the
transcription of vocal sequences using a finite set of discrete units. This set is referred to as a …

Advancing RUL prediction in mechanical systems: A hybrid deep learning approach utilizing non-full lifecycle data

T Lin, L Song, L Cui, H Wang - Advanced Engineering Informatics, 2024 - Elsevier
This paper addresses the significant challenge of predicting the Remaining Useful Life
(RUL) of mechanical equipment, a critical aspect of predictive maintenance and reliability …

Why human prejudice is so persistent: A predictive coding analysis

TW Hung - Social Epistemology, 2023 - Taylor & Francis
Although the relationship between prejudice and predictive coding has attracted more
attention recently, many important issues remain to be investigated, such as why prejudice is …

[PDF][PDF] Segmental speechclip: Utilizing pretrained image-text models for audio-visual learning

S Bhati, J Villalba, L Moro-Velazquez, T Thebaud… - …, 2023 - researchgate.net
Visually grounded models learn from paired images and their spoken captions. Recently,
there have been attempts to utilize the visually grounded models trained from images and …

Beyond orthography: Automatic recovery of short vowels and dialectal sounds in arabic

YE Kheir, H Mubarak, A Ali, SA Chowdhury - arxiv preprint arxiv …, 2024 - arxiv.org
This paper presents a novel Dialectal Sound and Vowelization Recovery framework,
designed to recognize borrowed and dialectal sounds within phonologically diverse and …

Slowness Regularized Contrastive Predictive Coding for Acoustic Unit Discovery

S Bhati, J Villalba, P Żelasko… - … on Audio, Speech …, 2024 - ieeexplore.ieee.org
Self-supervised methods such as Contrastive predictive Coding (CPC) have greatly
improved the quality of the unsupervised representations. These representations …