Enrichment of oesophageal speech: Voice conversion with duration–matched synthetic speech as target
Pathological speech such as Oesophageal Speech (OS) is difficult to understand due to the
presence of undesired artefacts and lack of normal healthy speech characteristics. Modern …
presence of undesired artefacts and lack of normal healthy speech characteristics. Modern …
Pareto-Optimized Non-Negative Matrix Factorization Approach to the Cleaning of Alaryngeal Speech Signals
R Maskeliūnas, R Damaševičius, A Kulikajevas… - Cancers, 2023 - mdpi.com
Simple Summary This paper introduces a new method for cleaning impaired speech by
combining Pareto-optimized deep learning with Non-negative Matrix Factorization (NMF) …
combining Pareto-optimized deep learning with Non-negative Matrix Factorization (NMF) …
Alaryngeal Speech Enhancement for Noisy Environments Using a Pareto Denoising Gated LSTM
R Maskeliūnas, R Damaševičius, A Kulikajevas… - Journal of Voice, 2024 - Elsevier
Loss of the larynx significantly alters natural voice production, requiring alternative
communication modalities and rehabilitation methods to restore speech intelligibility and …
communication modalities and rehabilitation methods to restore speech intelligibility and …
Generative models for improved naturalness, intelligibility, and voicing of whispered speech
This work adapts two recent architectures of generative models and evaluates their
effectiveness for the conversion of whispered speech to normal speech. We incorporate the …
effectiveness for the conversion of whispered speech to normal speech. We incorporate the …
[PDF][PDF] Investigating Speech Reconstruction for Laryngectomees for Silent Speech Interfaces.
Silent speech interfaces (SSIs) are devices that convert nonaudio bio-signals to speech,
which hold the potential of recovering quality speech for laryngectomees (people who have …
which hold the potential of recovering quality speech for laryngectomees (people who have …
Mandarin electrolaryngeal speech voice conversion using cross-domain features
HH Chen, YL Chien, MC Yen, SW Tsai, Y Tsao… - arxiv preprint arxiv …, 2023 - arxiv.org
Patients who have had their entire larynx removed, including the vocal folds, owing to throat
cancer may experience difficulties in speaking. In such cases, electrolarynx devices are …
cancer may experience difficulties in speaking. In such cases, electrolarynx devices are …
Quality of Experience: Comparison of Synthesized Speech Naturalness Between Apple's Siri and Google Translate Referring to Thai Language
T Daengsi… - … Conference on Computer …, 2021 - ieeexplore.ieee.org
This paper presents application of a speech quality measurement method to evaluate the
naturalness of synthesized speech from Text-To-Speech synthesis (TTS) which is an …
naturalness of synthesized speech from Text-To-Speech synthesis (TTS) which is an …
[HTML][HTML] Assessment of Self-Supervised Denoising Methods for Esophageal Speech Enhancement
Esophageal speech (ES) is a pathological voice that is often difficult to understand.
Moreover, acquiring recordings of a patient's voice before a laryngectomy proves …
Moreover, acquiring recordings of a patient's voice before a laryngectomy proves …
Research on Tone Enhancement of Mandarin Pitch Controllable Electrolaryngeal Speech Based on Deep Learning
J Zhou, L Wang, F Li, S Zhang, T Liu… - 2024 46th Annual …, 2024 - ieeexplore.ieee.org
The deep learning-based electrolaryngeal (EL) voice conversion methods have achieved
good results in non-tonal languages. However, the effectiveness in tonal languages, such as …
good results in non-tonal languages. However, the effectiveness in tonal languages, such as …
Time alignment using lip images for frame-based electrolaryngeal voice conversion
YS Liou, WC Huang, MC Yen, SW Tsai… - 2021 Asia-Pacific …, 2021 - ieeexplore.ieee.org
Voice conversion (VC) is an effective approach to electrolaryngeal (EL) speech
enhancement, a task that aims to improve the quality of the artificial voice from an …
enhancement, a task that aims to improve the quality of the artificial voice from an …