Enhancing voice wake-up for dysarthria: Mandarin dysarthria speech corpus release and customized system design

M Gao, H Chen, J Du, X Xu, H Guo, H Bu… - arxiv preprint arxiv …, 2024‏ - arxiv.org
Smart home technology has gained widespread adoption, facilitating effortless control of
devices through voice commands. However, individuals with dysarthria, a motor speech …

Efficient personal voice activity detection with wake word reference speech

B Zeng, M Cheng, Y Tian, H Liu… - ICASSP 2024-2024 IEEE …, 2024‏ - ieeexplore.ieee.org
Personal voice activity detection (PVAD) is gradually used in speech assistants. Traditional
PVAD schemes extract the target speaker's embedding from existing query reference …

Robust wake word spotting with frame-level cross-modal attention based audio-visual conformer

H Wang, M Cheng, Q Fu, M Li - ICASSP 2024-2024 IEEE …, 2024‏ - ieeexplore.ieee.org
In recent years, neural network-based Wake Word Spotting achieves good performance on
clean audio samples but struggles in noisy environments. Audio-Visual Wake Word Spotting …

Exploring Semi-Supervised, Subcategory Classification and Subwords Alignment for Visual Wake Word Spotting

S **ong, L Dai - … on Multimedia and Expo Workshops (ICMEW), 2024‏ - ieeexplore.ieee.org
In this paper, we describe our approaches in Chat-scenario Chinese Lipreading (ChatCLR)
Challenge task 1, which mainly explores semi-supervised, subcategory classification and …

Lightweight Audio-Visual Wake Word Spotting with Diverse Acoustic Knowledge Distillation

KW Li, H Chen, J Du, HS Zhou… - … on Circuits and …, 2025‏ - ieeexplore.ieee.org
Audio-Visual Wake Word Spotting (AVWWS) aims to accurately detect user-defined
keywords by leveraging the complementary nature of different modalities in challenging …

Summary of Low-Resource Dysarthria Wake-Up Word Spotting Challenge

M Gao, H Chen, J Du, X Xu, H Guo… - 2024 IEEE Spoken …, 2024‏ - ieeexplore.ieee.org
In recent years, the rapid advancement and widespread adoption of speech technology
have made smart home systems a common feature in many households. However …

Optimizing Dysarthria Wake-Up Word Spotting: an End-to-End Approach For SLT 2024 LRDWWS Challenge

S Liu, Y Kong, P Guo, W Zhuang, P Gao… - 2024 IEEE Spoken …, 2024‏ - ieeexplore.ieee.org
Speech has emerged as a widely embraced user interface across diverse applications.
However, for individuals with dysarthria, the inherent variability in their speech poses …

Summary on the Chat-Scenario Chinese Lipreading (ChatCLR) Challenge

CY Zhang, H Chen, J Du, SM Siniscalchi… - … on Multimedia and …, 2024‏ - ieeexplore.ieee.org
Lipreading which infers spoken content based solely on visual information such as lip
movements is crucial in multi-modal research medicine and human-computer interaction …

The Whu Wake Word Lipreading System for the 2024 Chat-Scenario Chinese Lipreading Challenge

H Wang, C Li, F Su, J Liu, H Suo… - 2024 IEEE International …, 2024‏ - ieeexplore.ieee.org
The paper describes the Wake Word Lipreading system developed by the WHU team for the
ChatCLR Challenge 2024. Although Lipreading and Wake Word Spotting have seen …

Audio-Visual Wake-up Word Spotting Under Noisy and Multi-person Scenarios

C Li, F Su, J Liu - International Conference on Pattern Recognition, 2024‏ - Springer
The existing audio-visual wake-up word spotting (AVWWS) methods assume that the audio
signal has been aligned with the lip movement video signal of a specific speaker in noisy …