Non-Māori-speaking New Zealanders have a Māori proto-lexicon

Y Oh, S Todd, C Beckner, J Hay, J King, J Needle - Scientific reports, 2020 - nature.com
We investigate implicit vocabulary learning by adults who are exposed to a language in their
ambient environment. Most New Zealanders do not speak Māori, yet are exposed to it …

Automatic transcription challenges for Inuktitut, a low-resource polysynthetic language

V Gupta, G Boulianne - … of the Twelfth Language Resources and …, 2020 - aclanthology.org
We introduce the first attempt at automatic speech recognition (ASR) in Inuktitut, as a
representative for polysynthetic, low-resource languages, like many of the 900 Indigenous …

End-to-End Open Vocabulary Keyword Search With Multilingual Neural Representations

B Yusuf, J Černocký, M Saraçlar - IEEE/ACM Transactions on …, 2023 - ieeexplore.ieee.org
Conventional keyword search systems operate on automatic speech recognition (ASR)
outputs, which causes them to have a complex indexing and search pipeline. This has led to …

Constructing sub-word units for spoken term detection

C Van Heerden, D Karakos… - … , Speech and Signal …, 2017 - ieeexplore.ieee.org
Spoken term detection, especially of out-of-vocabulary (OOV) keywords, benefits from the
use of sub-word systems. We experiment with different language-independent approaches …

Joint learning of distance metric and query model for posteriorgram-based keyword search

B Gündoğdu, B Yusuf, M Saraçlar - IEEE Journal of Selected …, 2017 - ieeexplore.ieee.org
In this paper, we propose a novel approach to keyword search (KWS) in low-resource
languages, which provides an alternative method for retrieving the terms of interest …

Spoken term detection and relevance score estimation using dot-product of pronunciation embeddings

J Švec, L Šmídl, JV Psutka, A Pražák - arxiv preprint arxiv:2210.11895, 2022 - arxiv.org
The paper describes a novel approach to Spoken Term Detection (STD) in large spoken
archives using deep LSTM networks. The work is based on the previous approach of using …

Generative RNNs for OOV keyword search

B Gundogdu, B Yusuf, M Saraclar - IEEE Signal Processing …, 2018 - ieeexplore.ieee.org
The modeling of text queries as sequences of embeddings for conducting similarity
matching based search within speech features has been recently shown to improve keyword …

Semantically expanded spoken term detection

Z Kozhirbayev, Z Yessenbayev - IEEE Access, 2024 - ieeexplore.ieee.org
Spoken term detection (STD) is effectively implemented using fundamental techniques such
as automatic speech recognition (ASR) and information retrieval. Through these methods …

Deep LSTM spoken term detection using Wav2Vec 2.0 recognizer

J Švec, J Lehečka, L Šmídl - arxiv preprint arxiv:2210.11885, 2022 - arxiv.org
In recent years, the standard hybrid DNN-HMM speech recognizers are outperformed by the
end-to-end speech recognition systems. One of the very promising approaches is the …

Fast and accurate OOV decoder on high-level features

Y Khokhlov, N Tomashenko, I Medennikov… - arxiv preprint arxiv …, 2017 - arxiv.org
This work proposes a novel approach to out-of-vocabulary (OOV) keyword search (KWS)
task. The proposed approach is based on using high-level features from an automatic …