End-to-end slot alignment and recognition for cross-lingual NLU

W Xu, B Haider, S Mansour - arxiv preprint arxiv:2004.14353, 2020 - arxiv.org
Natural language understanding (NLU) in the context of goal-oriented dialog systems
typically includes intent classification and slot labeling tasks. Existing methods to expand an …

Learning multilingual named entity recognition from Wikipedia

J Nothman, N Ringland, W Radford, T Murphy… - Artificial Intelligence, 2013 - Elsevier
We automatically create enormous, free and multilingual silver-standard training annotations
for named entity recognition (ner) by exploiting the text and structure of Wikipedia. Most ner …

Entity projection via machine translation for cross-lingual NER

A Jain, B Paranjape, ZC Lipton - arxiv preprint arxiv:1909.05356, 2019 - arxiv.org
Although over 100 languages are supported by strong off-the-shelf machine translation
systems, only a subset of them possess large annotated corpora for named entity …

Geo‐parsing messages from microtext

J Gelernter, N Mushegian - Transactions in GIS, 2011 - Wiley Online Library
Widespread use of social media during crises has become commonplace, as shown by the
volume of messages during the Haiti earthquake of 2010 and Japan tsunami of 2011 …

Naamapadam: A large-scale named entity annotated data for Indic languages

A Mhaske, H Kedia, S Doddapaneni… - arxiv preprint arxiv …, 2022 - arxiv.org
We present, Naamapadam, the largest publicly available Named Entity Recognition (NER)
dataset for the 11 major Indian languages from two language families. The dataset contains …

Multilingual code-switching for zero-shot cross-lingual intent prediction and slot filling

J Krishnan, A Anastasopoulos, H Purohit… - arxiv preprint arxiv …, 2021 - arxiv.org
Predicting user intent and detecting the corresponding slots from text are two key problems
in Natural Language Understanding (NLU). In the context of zero-shot learning, this task is …

[PDF][PDF] Cross-lingual metaphor detection using common semantic features

Y Tsvetkov, E Mukomel… - Proceedings of the First …, 2013 - aclanthology.org
We present the CSF-Common Semantic Features method for metaphor detection. This
method has two distinguishing characteristics: it is cross-lingual and it does not rely on the …

Cross-lingual text classification of transliterated Hindi and Malayalam

J Krishnan, A Anastasopoulos… - … Conference on Big …, 2022 - ieeexplore.ieee.org
Transliteration is very common on social media, but transliterated text is not adequately
handled by modern neural models for various NLP tasks. In this work, we combine data …

[PDF][PDF] Building a multilingual named entity-annotated corpus using annotation projection

M Ehrmann, M Turchi… - Proceedings of the …, 2011 - aclanthology.org
As developers of a highly multilingual named entity recognition (NER) system, we face an
evaluation resource bottleneck problem: we need evaluation data in many languages, the …

[PDF][PDF] Developments of Swahili resources for an automatic speech recognition system.

H Gelas, L Besacier, F Pellegrino - SLTU, 2012 - isca-archive.org
This article describes our efforts to provide ASR resources for Swahili, a Bantu language
spoken in a wide area of East Africa. We start with an introduction on the language situation …