Deep spoken keyword spotting: An overview

I López-Espejo, ZH Tan, JHL Hansen, J Jensen - IEEE Access, 2021‏ - ieeexplore.ieee.org
Spoken keyword spotting (KWS) deals with the identification of keywords in audio streams
and has become a fast-growing technology thanks to the paradigm shift introduced by deep …

End-to-end ASR-free keyword search from speech

K Audhkhasi, A Rosenberg, A Sethy… - IEEE Journal of …, 2017‏ - ieeexplore.ieee.org
Conventional keyword search (KWS) systems for speech databases match the input text
query to the set of word hypotheses generated by an automatic speech recognition (ASR) …

Method and system for efficient spoken term detection using confusion networks

BED Kingsbury, HK Kuo, L Mangu, H Soltau - US Patent 9,196,243, 2015‏ - Google Patents
US9196243B2 - Method and system for efficient spoken term detection using confusion
networks - Google Patents US9196243B2 - Method and system for efficient spoken term …

Recent developments in spoken term detection: a survey

A Mandal, KR Prasanna Kumar, P Mitra - International Journal of Speech …, 2014‏ - Springer
Spoken term detection (STD) provides an efficient means for content based indexing of
speech. However, achieving high detection performance, faster speed, detecting ot-of …

Attention-based end-to-end models for small-footprint keyword spotting

C Shan, J Zhang, Y Wang, L **e - arxiv preprint arxiv:1803.10916, 2018‏ - arxiv.org
In this paper, we propose an attention-based end-to-end neural approach for small-footprint
keyword spotting (KWS), which aims to simplify the pipelines of building a production-quality …

Spoken content retrieval—beyond cascading speech recognition with text retrieval

L Lee, J Glass, H Lee, C Chan - IEEE/ACM Transactions on …, 2015‏ - ieeexplore.ieee.org
Spoken content retrieval refers to directly indexing and retrieving spoken content based on
the audio rather than text descriptions. This potentially eliminates the requirement of …

Conference segmentation based on conversational dynamics

RJ Cartwright, K Li, X Sun - US Patent 10,522,151, 2019‏ - Google Patents
2017-08-02 Assigned to DOLBY LABORATORIES LICENSING CORPORATION
reassignment DOLBY LABORATORIES LICENSING CORPORATION ASSIGNMENT OF …

Using proxies for OOV keywords in the keyword search task

G Chen, O Yilmaz, J Trmal, D Povey… - 2013 IEEE Workshop …, 2013‏ - ieeexplore.ieee.org
We propose a simple but effective weighted finite state transducer (WFST) based framework
for handling out-of-vocabulary (OOV) keywords in a speech search task. State-of-the-art …

Crowd counting based on multiscale spatial guided perception aggregation network

Z Chen, S Zhang, X Zheng, X Zhao… - IEEE Transactions on …, 2023‏ - ieeexplore.ieee.org
Crowd counting has received extensive attention in the field of computer vision, and
methods based on deep convolutional neural networks (CNNs) have made great progress …

Scheduling playback of audio in a virtual acoustic space

X Sun, RJ Cartwright, MP Hollier, M Eckert - US Patent 10,334,384, 2019‏ - Google Patents
2017-09-12 Assigned to DOLBY LABORATORIES LICENSING CORPORATION
reassignment DOLBY LABORATORIES LICENSING CORPORATION ASSIGNMENT OF …