Accentron: Foreign accent conversion to arbitrary non-native speakers using zero-shot learning

S Ding, G Zhao, R Gutierrez-Osuna - Computer Speech & Language, 2022 - Elsevier
Foreign accent conversion (FAC) aims to create a new voice that has the voice identity of a
given second-language (L2) speaker but with a native (L1) accent. Previous FAC …

Converting foreign accent speech without a reference

G Zhao, S Ding… - IEEE/ACM Transactions on …, 2021 - ieeexplore.ieee.org
Foreign accent conversion (FAC) is the problem of generating a synthetic voice that has the
voice identity of a second-language (L2) learner and the pronunciation patterns of a native …

Speech production real‐time MRI at 0.55 T

Y Lim, P Kumar, KS Nayak - Magnetic Resonance in Medicine, 2024 - Wiley Online Library
Purpose To demonstrate speech‐production real‐time MRI (RT‐MRI) using a contemporary
0.55 T system, and to identify opportunities for improved performance compared with …

An improved air tissue boundary segmentation technique for real time magnetic resonance imaging video using segnet

CA Valliappan, A Kumar, R Mannem… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org
This paper presents an improved methodology for the segmentation of the Air-Tissue
boundaries (ATBs) in the upper airway of the human vocal tract using Real-Time Magnetic …

[PDF][PDF] Air-Tissue Boundary Segmentation in Real-Time Magnetic Resonance Imaging Video Using Semantic Segmentation with Fully Convolutional Networks.

CA Valliappan, R Mannem, PK Ghosh - InterSpeech, 2018 - researchgate.net
In this paper, we propose a new technique for the segmentation of the Air-Tissue
Boundaries (ATBs) in the vocal tract from the real-time magnetic resonance imaging (rtMRI) …

Speaker dependent articulatory-to-acoustic map** using real-time MRI of the vocal tract

TG Csapó - ar** is a technique to predict speech using various
articulatory acquisition techniques (eg ultrasound tongue imaging, lip video). Real-time MRI …

Reconstructing speech from real-time articulatory MRI using neural vocoders

Y Yu, AH Shandiz, L Tóth - 2021 29th European Signal …, 2021 - ieeexplore.ieee.org
Several approaches exist for the recording of articulatory movements, such as
eletromagnetic and permanent magnetic articulagraphy, ultrasound tongue imaging and …

Vocal tract contour tracking in rtMRI using deep temporal regression network

S Asadiabadi, E Erzin - IEEE/ACM Transactions on Audio …, 2020 - ieeexplore.ieee.org
Recent advances in real-time Magnetic Resonance Imaging (rtMRI) provide an invaluable
tool to study speech articulation. In this paper, we present an effective deep learning …

Air-tissue boundary segmentation in real time magnetic resonance imaging video using a convolutional encoder-decoder network

R Mannem, PK Ghosh - ICASSP 2019-2019 IEEE International …, 2019 - ieeexplore.ieee.org
In this paper, we propose a convolutional encoder-decoder network (CEDN) based
approach for upper and lower Air-Tissue Boundary (ATB) segmentation within vocal tract in …

Real-time mri video synthesis from time aligned phonemes with sequence-to-sequence networks

S Udupa, PK Ghosh - ICASSP 2023-2023 IEEE International …, 2023 - ieeexplore.ieee.org
Real-Time Magnetic resonance imaging (rtMRI) of the midsagittal plane of the mouth is of
interest for speech production research. In this work, we focus on estimating utterance level …