Human and computer recognition of regional accents and ethnic groups from British English speech

A Hanani, MJ Russell, MJ Carey - Computer Speech & Language, 2013 - Elsevier
The paralinguistic information in a speech signal includes clues to the geographical and
social background of the speaker. This paper is concerned with automatic extraction of this …

Automatic accent identification as an analytical tool for accent robust automatic speech recognition

M Najafian, M Russell - Speech Communication, 2020 - Elsevier
We present a novel study of relationships between automatic accent identification (AID) and
accent-robust automatic speech recognition (ASR), using i-vector based AID and deep …

Exploiting convolutional neural networks for phonotactic based dialect identification

M Najafian, S Khurana, S Shan, A Ali… - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
In this paper, we investigate different approaches for Dialect Identification (DID) in Arabic
broadcast speech. Dialects differ in their inventory of phonological segments. This paper …

[PDF][PDF] Identification of British English regional accents using fusion of i-vector and multi-accent phonotactic systems.

M Najafian, S Safavi, P Weber, MJ Russell - Odyssey, 2016 - odyssey2016.org
The para-linguistic information in a speech signal includes clues to the geographical and
social background of the speaker. This paper is concerned with recognition of the 14 …

[PDF][PDF] The effect of listener accent background on accent perception and comprehension

A Ikeno, JHL Hansen - EURASIP Journal on Audio, Speech, and Music …, 2007 - Springer
Variability of speaker accent is a challenge for effective human communication as well as
speech technology including automatic speech recognition and accent identification. The …

Unsupervised model selection for recognition of regional accented speech

M Najafian, A DeMarco, S Cox… - Interspeech …, 2014 - research.birmingham.ac.uk
This paper is concerned with automatic speech recognition (ASR) for accented speech.
Given a small amount of speech from a new speaker, is it better to apply speaker adaptation …

Mel-weighted single frequency filtering spectrogram for dialect identification

R Kethireddy, SR Kadiri, P Alku… - IEEE Access, 2020 - ieeexplore.ieee.org
In this study, we propose Mel-weighted single frequency filtering (SFF) spectrograms for
dialect identification. The spectrum derived using SFF has high spectral resolution for …

Linguistic-acoustic similarity based accent shift for accent recognition

Q Shao, J Yan, J Kang, P Guo, X Shi, P Hu… - arxiv preprint arxiv …, 2022 - arxiv.org
General accent recognition (AR) models tend to directly extract low-level information from
spectrums, which always significantly overfit on speakers or channels. Considering accent …

[PDF][PDF] A Review on Grapheme-to-Phoneme Modelling Techniques to Transcribe Pronunciation Variants for Under-Resourced Language.

E Irie, SS Juan, S Saee - Pertanika Journal of Science & …, 2023 - journals-jd.upm.edu.my
ABSTRACT A pronunciation dictionary (PD) is one of the components in an Automatic
Speech Recognition (ASR) system, a system that is used to convert speech to text. The …

Automatic phonetic transcription of large speech corpora

C Van Bael, L Boves, H van den Heuvel… - Computer Speech & …, 2007 - Elsevier
Most large speech corpora are delivered with a lexicon that contains a canonical
transcription of every word in the orthographic transcription. Such a lexicon can be used for …