[HTML][HTML] Neural representations for modeling variation in speech

M Bartelds, W de Vries, F Sanal, C Richter… - Journal of …, 2022 - Elsevier
Variation in speech is often quantified by comparing phonetic transcriptions of the same
utterance. However, manually transcribing speech is time-consuming and error prone. As an …

[HTML][HTML] How pronunciation distance impacts word recognition in children and adults

T Bent, RF Holt, KJ Van Engen, IA Jamsek… - The Journal of the …, 2021 - pubs.aip.org
Although unfamiliar accents can pose word identification challenges for children and adults,
few studies have directly compared perception of multiple nonnative and regional accents or …

[HTML][HTML] Deep learning assessment of syllable affiliation of intervocalic consonants

Z Liu, Y Xu - The Journal of the Acoustical Society of America, 2023 - pubs.aip.org
In English, a sentence like “He made out our intentions.” could be misperceived as “He may
doubt our intentions.” because the coda/d/sounds like it has become the onset of the next …

Relative dynamic time war** comparison for pronunciation errors

C Richter, J Guðnason - ICASSP 2023-2023 IEEE International …, 2023 - ieeexplore.ieee.org
We propose using a dynamic time war** (DTW) difference-to-sum ratio to classify speech
as either matching or diverging from a linguistic standard. This measure effectively …

Using acoustic distance and acoustic absement to quantify lexical competition

MC Kelley, BV Tucker - The Journal of the Acoustical Society of …, 2022 - pubs.aip.org
Using phonological neighborhood density has been a common method to quantify lexical
competition. It is useful and convenient but has shortcomings that are worth reconsidering …

Determining optimal talker variability for nonnative speech training: A systematic review and Bayesian network meta-analysis

X Zhang, B Cheng, Y Zou, Y Zhang - Journal of Speech, Language …, 2025 - pubs.asha.org
Purpose: This meta-analysis study aimed to determine the optimal level of talker variability in
training to maximize second-language speech learning. Method: We conducted a systematic …

Open methods: decolonizing (or not) research methods in linguistics

D Villarreal, L Collister - 2024 - d-scholarship.pitt.edu
Open Methods are resources that pertain to at least one stage in the linguistics research
process and are available free of charge to all who can find them (eg, Boersma & Weenink …

Quantifying language variation acoustically with few resources

M Bartelds, M Wieling - arxiv preprint arxiv:2205.02694, 2022 - arxiv.org
Deep acoustic models represent linguistic information based on massive amounts of data.
Unfortunately, for regional languages and dialects such resources are mostly not available …

The Mason-Alberta Phonetic Segmenter: a forced alignment system based on deep neural networks and interpolation

MC Kelley, SJ Perry, BV Tucker - Phonetica, 2024 - degruyter.com
Given an orthographic transcription, forced alignment systems automatically determine
boundaries between segments in speech, facilitating the use of large corpora. In the present …

[HTML][HTML] Relating pronunciation distance metrics to intelligibility across English accents

T Bent, M Henry, RF Holt, H Lind-Combs - Journal of Phonetics, 2024 - Elsevier
Unfamiliar accents can cause word recognition challenges, particularly in noisy
environments, but few studies have incorporated quantitative pronunciation distance metrics …