An overview of deep-learning-based audio-visual speech enhancement and separation

D Michelsanti, ZH Tan, SX Zhang, Y Xu… - … on Audio, Speech …, 2021 - ieeexplore.ieee.org
Speech enhancement and speech separation are two related tasks, whose purpose is to
extract either one or more target speech signals, respectively, from a mixture of sounds …

[HTML][HTML] Brain-machine interfaces: from basic science to neuroprostheses and neurorehabilitation

MA Lebedev, MAL Nicolelis - Physiological reviews, 2017 - journals.physiology.org
Brain-machine interfaces (BMIs) combine methods, approaches, and concepts derived from
neurophysiology, computer science, and engineering in an effort to establish real-time …

Decoding lip language using triboelectric sensors with deep learning

Y Lu, H Tian, J Cheng, F Zhu, B Liu, S Wei, L Ji… - Nature …, 2022 - nature.com
Lip language is an effective method of voice-off communication in daily life for people with
vocal cord lesions and laryngeal and lingual injuries without occupying the hands …

Visual speech recognition for multiple languages in the wild

P Ma, S Petridis, M Pantic - Nature Machine Intelligence, 2022 - nature.com
Visual speech recognition (VSR) aims to recognize the content of speech based on lip
movements, without relying on the audio stream. Advances in deep learning and the …

Force-induced ion generation in zwitterionic hydrogels for a sensitive silent-speech sensor

S Xu, JX Yu, H Guo, S Tian, Y Long, J Yang… - Nature …, 2023 - nature.com
Human-sensitive mechanosensation depends on ionic currents controlled by skin
mechanoreceptors. Inspired by the sensory behavior of skin, we investigate zwitterionic …

Inferring imagined speech using EEG signals: a new approach using Riemannian manifold features

CH Nguyen, GK Karavas… - Journal of neural …, 2017 - iopscience.iop.org
Objective. In this paper, we investigate the suitability of imagined speech for brain–computer
interface (BCI) applications. Approach. A novel method based on covariance matrix …

Ultrasensitive textile strain sensors redefine wearable silent speech interfaces with high machine learning efficiency

C Tang, M Xu, W Yi, Z Zhang, E Occhipinti… - npj Flexible …, 2024 - nature.com
This work introduces a silent speech interface (SSI), proposing a few-layer graphene (FLG)
strain sensing mechanism based on thorough cracks and AI-based self-adaptation …

Thinking out loud, an open-access EEG-based BCI dataset for inner speech recognition

N Nieto, V Peterson, HL Rufiner, JE Kamienkowski… - Scientific data, 2022 - nature.com
Surface electroencephalography is a standard and noninvasive way to measure electrical
brain activity. Recent advances in artificial intelligence led to significant improvements in the …

Biosignal-based spoken communication: A survey

T Schultz, M Wand, T Hueber… - … on Audio, Speech …, 2017 - ieeexplore.ieee.org
Speech is a complex process involving a wide range of biosignals, including but not limited
to acoustics. These biosignals-stemming from the articulators, the articulator muscle …

Lipreading with long short-term memory

M Wand, J Koutník… - 2016 IEEE International …, 2016 - ieeexplore.ieee.org
Lipreading, ie speech recognition from visual-only recordings of a speaker's face, can be
achieved with a processing pipeline based solely on neural networks, yielding significantly …