A survey of cross-lingual word embedding models

S Ruder, I Vulić, A Søgaard - Journal of Artificial Intelligence Research, 2019 - jair.org
Cross-lingual representations of words enable us to reason about word meaning in
multilingual contexts and are a key facilitator of cross-lingual transfer when develo** …

Universal dependencies v1: A multilingual treebank collection

J Nivre, MC De Marneffe, F Ginter… - Proceedings of the …, 2016 - aclanthology.org
Cross-linguistically consistent annotation is necessary for sound comparative evaluation
and cross-lingual learning experiments. It is also useful for multilingual system development …

Retrofitting word vectors to semantic lexicons

M Faruqui, J Dodge, SK Jauhar, C Dyer, E Hovy… - arxiv preprint arxiv …, 2014 - arxiv.org
Vector space word representations are learned from distributional information of words in
large corpora. Although such statistics are semantically informative, they disregard the …

[PDF][PDF] The social impact of natural language processing

D Hovy, SL Spruit - Proceedings of the 54th Annual Meeting of the …, 2016 - aclanthology.org
Medical sciences have long since established an ethics code for experiments, to minimize
the risk of harm to subjects. Natural language processing (NLP) used to involve mostly …

[PDF][PDF] JW300: A wide-coverage parallel corpus for low-resource languages

Ž Agic, I Vulic - 2019 - repository.cam.ac.uk
Viable cross-lingual transfer critically depends on the availability of parallel texts. Shortage
of such resources imposes a development and evaluation bottleneck in multilingual …

Massively multilingual transfer for NER

A Rahimi, Y Li, T Cohn - arxiv preprint arxiv:1902.00193, 2019 - arxiv.org
In cross-lingual transfer, NLP models over one or more source languages are applied to a
low-resource target language. While most prior work has used a single source model or a …

[PDF][PDF] Universal dependency annotation for multilingual parsing

R McDonald, J Nivre… - Proceedings of the …, 2013 - aclanthology.org
We present a new collection of treebanks with homogeneous syntactic dependency
annotation for six languages: German, English, Swedish, Spanish, French and Korean. To …

A universal part-of-speech tagset

S Petrov, D Das, R McDonald - arxiv preprint arxiv:1104.2086, 2011 - arxiv.org
To facilitate future research in unsupervised induction of syntactic structure and to
standardize best-practices, we propose a tagset that consists of twelve universal part-of …

Neural cross-lingual named entity recognition with minimal resources

J **e, Z Yang, G Neubig, NA Smith… - arxiv preprint arxiv …, 2018 - arxiv.org
For languages with no annotated resources, unsupervised transfer of natural language
processing models such as named-entity recognition (NER) from resource-rich languages …

An autoencoder approach to learning bilingual word representations

S Chandar AP, S Lauly, H Larochelle… - Advances in neural …, 2014 - proceedings.neurips.cc
Cross-language learning allows us to use training data from one language to build models
for a different language. Many approaches to bilingual learning require that we have word …