Query-by-example keyword spotting using long short-term memory networks
We present a novel approach to query-by-example keyword spotting (KWS) using a long
short-term memory (LSTM) recurrent neural network-based feature extractor. In our …
short-term memory (LSTM) recurrent neural network-based feature extractor. In our …
Neural network based end-to-end query by example spoken term detection
This article focuses on the problem of query by example spoken term detection (QbE-STD)
in zero-resource scenario. State-of-the-art approaches primarily rely on dynamic time …
in zero-resource scenario. State-of-the-art approaches primarily rely on dynamic time …
Donut: Ctc-based query-by-example keyword spotting
Keyword spotting--or wakeword detection--is an essential feature for hands-free operation of
modern voice-controlled devices. With such devices becoming ubiquitous, users might want …
modern voice-controlled devices. With such devices becoming ubiquitous, users might want …
Multilingual spoken term detection: a review
In modern multilingual societies, there is a demand for multilingual Automatic Speech
Recognition (ASR) and Spoken Term Detection (STD). Multilingual Spoken Term Detection …
Recognition (ASR) and Spoken Term Detection (STD). Multilingual Spoken Term Detection …
Learning acoustic word embeddings with temporal context for query-by-example speech search
We propose to learn acoustic word embeddings with temporal context for query-by-example
(QbE) speech search. The temporal context includes the leading and trailing word …
(QbE) speech search. The temporal context includes the leading and trailing word …
A lightweight architecture for query-by-example keyword spotting on low-power iot devices
M Li - IEEE Transactions on Consumer Electronics, 2022 - ieeexplore.ieee.org
Keyword spotting (KWS) is a task to recognize a keyword or a particular command in a
continuous audio stream, which can be effectively applied to a voice trigger system that …
continuous audio stream, which can be effectively applied to a voice trigger system that …
CRNN-CTC based mandarin keywords spotting
Deep learning based approaches have greatly improved the performance of spoken
keyword spotting (KWS). However, KWS of different languages should have their own …
keyword spotting (KWS). However, KWS of different languages should have their own …
Leveraging pre-trained representations to improve access to untranscribed speech from endangered languages
Pre-trained speech representations like wav2vec 2.0 are a powerful tool for automatic
speech recognition (ASR). Yet many endangered languages lack sufficient data for pre …
speech recognition (ASR). Yet many endangered languages lack sufficient data for pre …
A novel approach to unsupervised pattern discovery in speech using Convolutional Neural Network
KS Rao - Computer Speech & Language, 2022 - Elsevier
In this paper, a novel approach to unsupervised pattern discovery for speech signals is
proposed. Recently, we introduced an image processing method (IPM) that extracts the …
proposed. Recently, we introduced an image processing method (IPM) that extracts the …
Template-matching for text-dependent speaker verification
In the last decade, i-vector and Joint Factor Analysis (JFA) approaches to speaker modeling
have become ubiquitous in the area of automatic speaker recognition. Both of these …
have become ubiquitous in the area of automatic speaker recognition. Both of these …