Enrichment of oesophageal speech: Voice conversion with duration–matched synthetic speech as target

S Raman, X Sarasola, E Navas, I Hernaez - Applied Sciences, 2021 - mdpi.com
Pathological speech such as Oesophageal Speech (OS) is difficult to understand due to the
presence of undesired artefacts and lack of normal healthy speech characteristics. Modern …

Pareto-Optimized Non-Negative Matrix Factorization Approach to the Cleaning of Alaryngeal Speech Signals

R Maskeliūnas, R Damaševičius, A Kulikajevas… - Cancers, 2023 - mdpi.com
Simple Summary This paper introduces a new method for cleaning impaired speech by
combining Pareto-optimized deep learning with Non-negative Matrix Factorization (NMF) …

Alaryngeal Speech Enhancement for Noisy Environments Using a Pareto Denoising Gated LSTM

R Maskeliūnas, R Damaševičius, A Kulikajevas… - Journal of Voice, 2024 - Elsevier
Loss of the larynx significantly alters natural voice production, requiring alternative
communication modalities and rehabilitation methods to restore speech intelligibility and …

Generative models for improved naturalness, intelligibility, and voicing of whispered speech

D Wagner, SP Bayerl, HAC Maruri… - 2022 IEEE Spoken …, 2023 - ieeexplore.ieee.org
This work adapts two recent architectures of generative models and evaluates their
effectiveness for the conversion of whispered speech to normal speech. We incorporate the …

[PDF][PDF] Investigating Speech Reconstruction for Laryngectomees for Silent Speech Interfaces.

B Cao, N Sebkhi, A Bhavsar, OT Inan, R Samlan… - Interspeech, 2021 - researchgate.net
Silent speech interfaces (SSIs) are devices that convert nonaudio bio-signals to speech,
which hold the potential of recovering quality speech for laryngectomees (people who have …

Mandarin electrolaryngeal speech voice conversion using cross-domain features

HH Chen, YL Chien, MC Yen, SW Tsai, Y Tsao… - arxiv preprint arxiv …, 2023 - arxiv.org
Patients who have had their entire larynx removed, including the vocal folds, owing to throat
cancer may experience difficulties in speaking. In such cases, electrolarynx devices are …

Quality of Experience: Comparison of Synthesized Speech Naturalness Between Apple's Siri and Google Translate Referring to Thai Language

T Daengsi… - … Conference on Computer …, 2021 - ieeexplore.ieee.org
This paper presents application of a speech quality measurement method to evaluate the
naturalness of synthesized speech from Text-To-Speech synthesis (TTS) which is an …

[HTML][HTML] Assessment of Self-Supervised Denoising Methods for Esophageal Speech Enhancement

M Amarjouf, EH Ibn Elhaj, M Chami, K Ezzine… - Applied Sciences, 2024 - mdpi.com
Esophageal speech (ES) is a pathological voice that is often difficult to understand.
Moreover, acquiring recordings of a patient's voice before a laryngectomy proves …

Research on Tone Enhancement of Mandarin Pitch Controllable Electrolaryngeal Speech Based on Deep Learning

J Zhou, L Wang, F Li, S Zhang, T Liu… - 2024 46th Annual …, 2024 - ieeexplore.ieee.org
The deep learning-based electrolaryngeal (EL) voice conversion methods have achieved
good results in non-tonal languages. However, the effectiveness in tonal languages, such as …

Time alignment using lip images for frame-based electrolaryngeal voice conversion

YS Liou, WC Huang, MC Yen, SW Tsai… - 2021 Asia-Pacific …, 2021 - ieeexplore.ieee.org
Voice conversion (VC) is an effective approach to electrolaryngeal (EL) speech
enhancement, a task that aims to improve the quality of the artificial voice from an …