Query-by-example keyword spotting using long short-term memory networks

G Chen, C Parada, TN Sainath - 2015 IEEE international …, 2015 - ieeexplore.ieee.org
We present a novel approach to query-by-example keyword spotting (KWS) using a long
short-term memory (LSTM) recurrent neural network-based feature extractor. In our …

Neural network based end-to-end query by example spoken term detection

D Ram, L Miculicich, H Bourlard - IEEE/ACM Transactions on …, 2020 - ieeexplore.ieee.org
This article focuses on the problem of query by example spoken term detection (QbE-STD)
in zero-resource scenario. State-of-the-art approaches primarily rely on dynamic time …

Donut: Ctc-based query-by-example keyword spotting

L Lugosch, S Myer, VS Tomar - arxiv preprint arxiv:1811.10736, 2018 - arxiv.org
Keyword spotting--or wakeword detection--is an essential feature for hands-free operation of
modern voice-controlled devices. With such devices becoming ubiquitous, users might want …

Multilingual spoken term detection: a review

G Deekshitha, L Mary - International Journal of Speech Technology, 2020 - Springer
In modern multilingual societies, there is a demand for multilingual Automatic Speech
Recognition (ASR) and Spoken Term Detection (STD). Multilingual Spoken Term Detection …

Learning acoustic word embeddings with temporal context for query-by-example speech search

Y Yuan, CC Leung, L **e, H Chen, B Ma… - arxiv preprint arxiv …, 2018 - arxiv.org
We propose to learn acoustic word embeddings with temporal context for query-by-example
(QbE) speech search. The temporal context includes the leading and trailing word …

A lightweight architecture for query-by-example keyword spotting on low-power iot devices

M Li - IEEE Transactions on Consumer Electronics, 2022 - ieeexplore.ieee.org
Keyword spotting (KWS) is a task to recognize a keyword or a particular command in a
continuous audio stream, which can be effectively applied to a voice trigger system that …

CRNN-CTC based mandarin keywords spotting

H Yan, Q He, W **e - ICASSP 2020-2020 IEEE International …, 2020 - ieeexplore.ieee.org
Deep learning based approaches have greatly improved the performance of spoken
keyword spotting (KWS). However, KWS of different languages should have their own …

Leveraging pre-trained representations to improve access to untranscribed speech from endangered languages

N San, M Bartelds, M Browne, L Clifford… - 2021 IEEE Automatic …, 2021 - ieeexplore.ieee.org
Pre-trained speech representations like wav2vec 2.0 are a powerful tool for automatic
speech recognition (ASR). Yet many endangered languages lack sufficient data for pre …

A novel approach to unsupervised pattern discovery in speech using Convolutional Neural Network

KS Rao - Computer Speech & Language, 2022 - Elsevier
In this paper, a novel approach to unsupervised pattern discovery for speech signals is
proposed. Recently, we introduced an image processing method (IPM) that extracts the …

Template-matching for text-dependent speaker verification

S Dey, P Motlicek, S Madikeri, M Ferras - Speech communication, 2017 - Elsevier
In the last decade, i-vector and Joint Factor Analysis (JFA) approaches to speaker modeling
have become ubiquitous in the area of automatic speaker recognition. Both of these …