Discovering language model behaviors with model-written evaluations

E Perez, S Ringer, K Lukošiūtė, K Nguyen… - arxiv preprint arxiv …, 2022 - arxiv.org
As language models (LMs) scale, they develop many novel behaviors, good and bad,
exacerbating the need to evaluate how they behave. Prior work creates evaluations with …

Multilingual translation with extensible multilingual pretraining and finetuning

Y Tang, C Tran, X Li, PJ Chen, N Goyal… - arxiv preprint arxiv …, 2020 - arxiv.org
Recent work demonstrates the potential of multilingual pretraining of creating one model that
can be used for various tasks in different languages. Previous work in multilingual …

Contrastive learning for many-to-many multilingual neural machine translation

X Pan, M Wang, L Wu, L Li - arxiv preprint arxiv:2105.09501, 2021 - arxiv.org
Existing multilingual machine translation approaches mainly focus on English-centric
directions, while the non-English directions still lag behind. In this work, we aim to build a …

Cico: Domain-aware sign language retrieval via cross-lingual contrastive learning

Y Cheng, F Wei, J Bao, D Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com
This work focuses on sign language retrieval--a recently proposed task for sign language
understanding. Sign language retrieval consists of two sub-tasks: text-to-sign-video (T2V) …

MTOP: A comprehensive multilingual task-oriented semantic parsing benchmark

H Li, A Arora, S Chen, A Gupta, S Gupta… - arxiv preprint arxiv …, 2020 - arxiv.org
Scaling semantic parsing models for task-oriented dialog systems to new languages is often
expensive and time-consuming due to the lack of available datasets. Available datasets …

Facebook ai wmt21 news translation task submission

C Tran, S Bhosale, J Cross, P Koehn, S Edunov… - arxiv preprint arxiv …, 2021 - arxiv.org
We describe Facebook's multilingual model submission to the WMT2021 shared task on
news translation. We participate in 14 language directions: English to and from Czech …

ERNIE-M: Enhanced multilingual representation by aligning cross-lingual semantics with monolingual corpora

X Ouyang, S Wang, C Pang, Y Sun, H Tian… - arxiv preprint arxiv …, 2020 - arxiv.org
Recent studies have demonstrated that pre-trained cross-lingual models achieve impressive
performance in downstream cross-lingual tasks. This improvement benefits from learning a …

One question answering model for many languages with cross-lingual dense passage retrieval

A Asai, X Yu, J Kasai… - Advances in Neural …, 2021 - proceedings.neurips.cc
Abstract We present Cross-lingual Open-Retrieval Answer Generation (CORA), the first
unified many-to-many question answering (QA) model that can answer questions across …

End-to-end speech translation via cross-modal progressive training

R Ye, M Wang, L Li - arxiv preprint arxiv:2104.10380, 2021 - arxiv.org
End-to-end speech translation models have become a new trend in research due to their
potential of reducing error propagation. However, these models still suffer from the …

Zero-shot cross-lingual transfer of neural machine translation with multilingual pretrained encoders

G Chen, S Ma, Y Chen, L Dong, D Zhang, J Pan… - arxiv preprint arxiv …, 2021 - arxiv.org
Previous work mainly focuses on improving cross-lingual transfer for NLU tasks with a
multilingual pretrained encoder (MPE), or improving the performance on supervised …