DCCRN-KWS: An audio bias based model for noise robust small-footprint keyword spotting

S Lv, X Wang, S Sun, L Ma, L **e - arxiv preprint arxiv:2305.12331, 2023 - arxiv.org
Real-world complex acoustic environments especially the ones with a low signal-to-noise
ratio (SNR) will bring tremendous challenges to a keyword spotting (KWS) system. Inspired …

Speed-robust keyword spotting via soft self-attention on multi-scale features

C Ding, J Li, M Zong, B Li - 2022 IEEE Spoken Language …, 2023 - ieeexplore.ieee.org
In this work, we focus on the robustness of keyword spotting (KWS) at various speech
speeds. First, to enable small-footprint KWS, we graft a depthwise separable convolution …

The 2020 personalized voice trigger challenge: Open database, evaluation metrics and the baseline systems

Y Jia, X Wang, X Qin, Y Zhang, X Wang… - arxiv preprint arxiv …, 2021 - arxiv.org
The 2020 Personalized Voice Trigger Challenge (PVTC2020) addresses two different
research problems a unified setup: joint wake-up word detection with speaker verification on …

Auto-KWS 2021 Challenge: Task, datasets, and baselines

J Wang, Y He, C Zhao, Q Shao, WW Tu, T Ko… - arxiv preprint arxiv …, 2021 - arxiv.org
Auto-KWS 2021 challenge calls for automated machine learning (AutoML) solutions to
automate the process of applying machine learning to a customized keyword spotting task …

OPC-KWS: Optimizing Keyword Spotting with Path Retrieval Decoding and Contrastive Learning

J Li, X Liu, X Zhang - 2024 IEEE 14th International Symposium …, 2024 - ieeexplore.ieee.org
As voice interaction capabilities with smart devices advance and the demand for
personalized wake words increases, customized keyword spotting (KWS) has become …

Improving voice trigger detection with metric learning

P Nayak, T Higuchi, A Gupta, S Ranjan, S Shum… - arxiv preprint arxiv …, 2022 - arxiv.org
Voice trigger detection is an important task, which enables activating a voice assistant when
a target user speaks a keyword phrase. A detector is typically trained on speech data …

An integrated framework for two-pass personalized voice trigger

D Liao, J Li, Y Zhi, S Li, Q Hong, L Li - arxiv preprint arxiv:2106.15950, 2021 - arxiv.org
In this paper, we present the XMUSPEECH system for Task 1 of 2020 Personalized Voice
Trigger Challenge (PVTC2020). Task 1 is a joint wake-up word detection with speaker …

[PDF][PDF] Auto-KWS 2021 Challenge: Task, Datasets, and Baselines

LX Lee - academia.edu
Auto-KWS 2021 challenge calls for automated machine learning (AutoML) solutions to
automate the process of applying machine learning to a customized keyword spotting task …

[ЦИТАТА][C] Personalized User-Defined Keyword Spotting in Household Environments: A Text-Audio Multi-Modality Approach

Z Ai, Z Chen, X Li, S Xu - Available at SSRN 4733124