Xtreme: A massively multilingual multi-task benchmark for evaluating cross-lingual generalisation

J Hu, S Ruder, A Siddhant, G Neubig… - International …, 2020 - proceedings.mlr.press
Much recent progress in applications of machine learning models to NLP has been driven
by benchmarks that evaluate models across a wide variety of tasks. However, these broad …

State-of-the-art generalisation research in NLP: a taxonomy and review

D Hupkes, M Giulianelli, V Dankers, M Artetxe… - arxiv preprint arxiv …, 2022 - arxiv.org
The ability to generalise well is one of the primary desiderata of natural language
processing (NLP). Yet, what'good generalisation'entails and how it should be evaluated is …

A model of two tales: Dual transfer learning framework for improved long-tail item recommendation

Y Zhang, DZ Cheng, T Yao, X Yi, L Hong… - Proceedings of the web …, 2021 - dl.acm.org
Highly skewed long-tail item distribution is very common in recommendation systems. It
significantly hurts model performance on tail items. To improve tail-item recommendation …

A call for more rigor in unsupervised cross-lingual learning

M Artetxe, S Ruder, D Yogatama, G Labaka… - arxiv preprint arxiv …, 2020 - arxiv.org
We review motivations, definition, approaches, and methodology for unsupervised cross-
lingual learning and call for a more rigorous position in each of them. An existing rationale …

Are all good word vector spaces isomorphic?

I Vulić, S Ruder, A Søgaard - arxiv preprint arxiv:2004.04070, 2020 - arxiv.org
Existing algorithms for aligning cross-lingual word vector spaces assume that vector spaces
are approximately isomorphic. As a result, they perform poorly or fail completely on non …

Out-of-distribution generalization in natural language processing: Past, present, and future

L Yang, Y Song, X Ren, C Lyu, Y Wang… - Proceedings of the …, 2023 - aclanthology.org
Abstract Machine learning (ML) systems in natural language processing (NLP) face
significant challenges in generalizing to out-of-distribution (OOD) data, where the test …

Fast contextual adaptation with neural associative memory for on-device personalized speech recognition

T Munkhdalai, KC Sim, A Chandorkar… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Fast contextual adaptation has shown to be effective in improving Automatic Speech
Recognition (ASR) of rare words and when combined with an on-device personalized …

Biased user history synthesis for personalized long-tail item recommendation

K Balasubramanian, A Alshabanah… - Proceedings of the 18th …, 2024 - dl.acm.org
Recommendation systems connect users to items and create value chains in the internet
economy. Recommendation systems learn from past user-item interaction histories. As such …

From SPMRL to NMRL: What did we learn (and unlearn) in a decade of parsing morphologically-rich languages (MRLs)?

R Tsarfaty, D Bareket, S Klein, A Seker - arxiv preprint arxiv:2005.01330, 2020 - arxiv.org
It has been exactly a decade since the first establishment of SPMRL, a research initiative
unifying multiple research efforts to address the peculiar challenges of Statistical Parsing for …

Lost in evaluation: Misleading benchmarks for bilingual dictionary induction

Y Kementchedjhieva, M Hartmann… - arxiv preprint arxiv …, 2019 - arxiv.org
The task of bilingual dictionary induction (BDI) is commonly used for intrinsic evaluation of
cross-lingual word embeddings. The largest dataset for BDI was generated automatically, so …