Advanced data exploitation in speech analysis: An overview

Z Zhang, N Cummins, B Schuller - IEEE Signal Processing …, 2017 - ieeexplore.ieee.org
With recent advances in machine-learning techniques for automatic speech analysis (ASA)-
the computerized extraction of information from speech signals-there is a greater need for …

Spoken content retrieval—beyond cascading speech recognition with text retrieval

L Lee, J Glass, H Lee, C Chan - IEEE/ACM Transactions on …, 2015 - ieeexplore.ieee.org
Spoken content retrieval refers to directly indexing and retrieving spoken content based on
the audio rather than text descriptions. This potentially eliminates the requirement of …

High-performance query-by-example spoken term detection on the SWS 2013 evaluation

LJ Rodriguez-Fuentes, A Varona… - … , Speech and Signal …, 2014 - ieeexplore.ieee.org
In the last years, the task of Query-by-Example Spoken Term Detection (QbE-STD), which
aims to find occurrences of a spoken query in a set of audio documents, has gained the …

Acoustic segment modeling with spectral clustering methods

H Wang, T Lee, CC Leung, B Ma… - IEEE/ACM Transactions …, 2015 - ieeexplore.ieee.org
This paper presents a study of spectral clustering-based approaches to acoustic segment
modeling (ASM). ASM aims at finding the underlying phoneme-like speech units and …

The spoken web search task at MediaEval 2012

F Metze, X Anguera, E Barnard… - … on Acoustics, Speech …, 2013 - ieeexplore.ieee.org
In this paper, we describe the “Spoken Web Search” Task, which was held as part of the
2012 MediaEval benchmark evaluation campaign. The purpose of this task was to perform …

[PDF][PDF] Unsupervised Bottleneck Features for Low-Resource Query-by-Example Spoken Term Detection.

H Chen, CC Leung, L **e, B Ma, H Li - Interspeech, 2016 - researchgate.net
We propose a framework which ports Dirichlet Gaussian mixture model (DPGMM) based
labels to deep neural network (DNN). The DNN trained using the unsupervised labels is …

Fast query-by-example speech search using attention-based deep binary embeddings

Y Yuan, L **e, CC Leung, H Chen… - IEEE/ACM Transactions …, 2020 - ieeexplore.ieee.org
State-of-the-art query-by-example (QbE) speech search approaches usually use recurrent
neural network (RNN) based acoustic word embeddings (AWEs) to represent variable …

Language independent search in MediaEval's Spoken Web Search task

F Metze, X Anguera, E Barnard, M Davel… - Computer Speech & …, 2014 - Elsevier
In this paper, we describe several approaches to language-independent spoken term
detection and compare their performance on a common task, namely “Spoken Web Search” …

[PDF][PDF] On the calibration and fusion of heterogeneous spoken term detection systems.

A Abad, LJ Rodriguez-Fuentes, M Penagarikano… - Interspeech, 2013 - isca-archive.org
The combination of several heterogeneous systems is known to provide remarkable
performance improvements in verification and detection tasks. In Spoken Term Detection …

An efficient TF-IDF based query by example spoken term detection

A Singh, V Arora, YPP Chen - 2024 IEEE Conference on …, 2024 - ieeexplore.ieee.org
In the present research, we tackle the problem of query by example spoken term detection
(QbE-STD) in the zero-resource scenario. State-of-the-art methods typically use dynamic …