An embedded segmental k-means model for unsupervised segmentation and clustering of speech

H Kamper, K Livescu… - 2017 IEEE automatic …, 2017 - ieeexplore.ieee.org
Unsupervised segmentation and clustering of unlabelled speech are core problems in zero-
resource speech processing. Most approaches lie at methodological extremes: some use …

Bayesian subspace hidden Markov model for acoustic unit discovery

L Ondel, HK Vydana, L Burget, J Černocký - ar** speech technologies for low-resource languages has become a very active
research field over the last decade. Among others, Bayesian models have shown some …

A hierarchical subspace model for language-attuned acoustic unit discovery

B Yusuf, L Ondel, L Burget, J Černocký… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
In this work, we propose a hierarchical subspace model for acoustic unit discovery. In this
approach, we frame the task as one of learning embeddings on a low-dimensional phonetic …

A Temporal Extension of Latent Dirichlet Allocation for Unsupervised Acoustic Unit Discovery

W van der Merwe, H Kamper, J Preez - arxiv preprint arxiv:2206.11706, 2022 - arxiv.org
Latent Dirichlet allocation (LDA) is widely used for unsupervised topic modelling on sets of
documents. No temporal information is used in the model. However, there is often a …

The effectiveness of unsupervised subword modeling with autoregressive and cross-lingual phone-aware networks

S Feng, O Scharenborg - IEEE Open Journal of Signal …, 2021 - ieeexplore.ieee.org
This study addresses unsupervised subword modeling, ie, learning acoustic feature
representations that can distinguish between subword units of a language. We propose a …

[PDF][PDF] Discovering acoustic units from speech: A Bayesian approach

AFL Ondel - Ph. D. thesis, 2021 - researchgate.net
From an early age, infants show an innate ability to infer linguistic structures from the speech
signal long before they learn to read and write. In contrast, modern speech recognition …

[PDF][PDF] Unsupervised Phonetic and Word Level Discovery for Speech to Speech Translation for Unwritten Languages.

S Hillis, AP Kumar, AW Black - INTERSPEECH, 2019 - festvox.org
We experiment with unsupervised methods for deriving and clustering symbolic
representations of speech, working towards speech-to-speech translation for languages …

Bayesian Subspace HMM for the Zerospeech 2020 Challenge

B Yusuf, L Ondel - arxiv preprint arxiv:2005.09282, 2020 - arxiv.org
In this paper we describe our submission to the Zerospeech 2020 challenge, where the
participants are required to discover latent representations from unannotated speech, and to …

[PDF][PDF] Lucas Ondel

AL Burget, AM Hanneman, J Černocký - lucasondel.github.io
Lucas Ondel – Ph. D. candidate Page 1 Lucas Ondel Ph. D. candidate +420 777 234 742
lucas.ondel@gmail.com Education 2012 2020 Ph.D. in Automatic Speech Recognition, Brno …