Mel-frequency cepstral coefficient features based on standard deviation and principal component analysis for language identification systems
Spoken language identification (LID) is the process of determining and classifying natural
language from a given content and dataset. Data must be processed to extract useful …
language from a given content and dataset. Data must be processed to extract useful …
[HTML][HTML] A review of social background profiling of speakers from speech accents
Social background profiling of speakers is heavily used in areas, such as, speech forensics,
and tuning speech recognition for accuracy improvement. This article provides a survey of …
and tuning speech recognition for accuracy improvement. This article provides a survey of …
Native language identification in very short utterances using bidirectional long short-term memory network
Native language identification (NLI) is the task of identifying the first language of a user
based on their speech or written text in a second language. In this paper, we propose the …
based on their speech or written text in a second language. In this paper, we propose the …
Deep neural architectures for dialect classification with single frequency filtering and zero-time windowing feature representations
The goal of this study is to investigate advanced signal processing approaches [single
frequency filtering (SFF) and zero-time windowing (ZTW)] with modern deep neural networks …
frequency filtering (SFF) and zero-time windowing (ZTW)] with modern deep neural networks …
Exploring end-to-end attention-based neural networks for native language identification
Automatic identification of speakers' native language (L1) based on their speech in a second
language (L2) is a challenging research problem that can aid several spoken language …
language (L2) is a challenging research problem that can aid several spoken language …
Accent classification from an emotional speech in clean and noisy environments
KS Rao - Multimedia Tools and Applications, 2023 - Springer
The performance of speech emotion recognition systems (SER) suffers when emotional
speech is spoken in different accents. One possible solution to such a problem is to identify …
speech is spoken in different accents. One possible solution to such a problem is to identify …
Estimating Social Background Profiling of Indian Speakers by Acoustic Speech Features: SPEECH ACCENT CLASSIFICATION BY ACOUSTIC ANALYSIS
Social background profiling of speakers refers to estimating the geographical origin of
speakers by their speech features. Methods for accent profiling that use linguistic features …
speakers by their speech features. Methods for accent profiling that use linguistic features …
Native language identification from raw waveforms using deep convolutional neural networks with attentive pooling
Automatic detection of an individual's native language (L1) based on speech data from their
second language (L2) can be useful for informing a variety of speech applications such as …
second language (L2) can be useful for informing a variety of speech applications such as …
Combining textual and speech features in the NLI task using state-of-the-art machine learning techniques
We summarize the involvement of our CEMI team in the” NLI Shared Task 2017”, which
deals with both textual and speech input data. We submitted the results achieved by using …
deals with both textual and speech input data. We submitted the results achieved by using …
End-to-End Native Language Identification Using a Modified Vision Transformer (ViT) from L2 English Speech
Native langauge identification involves identifying the mother tongue of a person from an
audio recording of their speech in second language. Improving native language …
audio recording of their speech in second language. Improving native language …