Language modeling for code-mixing: The role of linguistic theory based synthetic data
Training language models for Code-mixed (CM) language is known to be a difficult problem
because of lack of data compounded by the increased confusability due to the presence of …
because of lack of data compounded by the increased confusability due to the presence of …
Modeling code-switch languages using bilingual parallel corpus
Abstract Language modeling is the technique to estimate the probability of a sequence of
words. A bilingual language model is expected to model the sequential dependency for …
words. A bilingual language model is expected to model the sequential dependency for …
Code-switched language models using dual RNNs and same-source pretraining
This work focuses on building language models (LMs) for code-switched text. We propose
two techniques that significantly improve these LMs: 1) A novel recurrent neural network unit …
two techniques that significantly improve these LMs: 1) A novel recurrent neural network unit …
[PDF][PDF] Curriculum design for code-switching: Experiments with language identification and language modeling with deep neural networks
Curriculum learning strategies are known to improve the accuracy, robustness and
convergence rate for various language learning tasks using deep architectures (Bengio et …
convergence rate for various language learning tasks using deep architectures (Bengio et …
Dual language models for code switched speech recognition
In this work, we present a simple and elegant approach to language modeling for bilingual
code-switched text. Since code-switching is a blend of two or more different languages, a …
code-switched text. Since code-switching is a blend of two or more different languages, a …
Improving N-gram language modeling for code-switching speech recognition
Code-switching language modeling is challenging due to statistics of each individual
language, as well as statistics of cross-lingual language are insufficient. To compensate for …
language, as well as statistics of cross-lingual language are insufficient. To compensate for …
An improved framework for recognizing highly imbalanced bilingual code-switched lectures with cross-language acoustic modeling and frame-level language …
CF Yeh, LS Lee - IEEE/ACM Transactions on Audio, Speech …, 2015 - ieeexplore.ieee.org
This paper considers the recognition of a widely observed type of bilingual code-switched
speech: the speaker speaks primarily the host language (usually his native language), but …
speech: the speaker speaks primarily the host language (usually his native language), but …