Μελετητής Google

S Ranathunga, ESA Lee, M Prifti Skenduli… - ACM Computing …, 2023 - dl.acm.org

Neural Machine Translation (NMT) has seen tremendous growth in the last ten years since
the early 2000s and has already entered a mature phase. While considered the most widely …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 279 Σχετικά άρθρα Όλες οι 6 εκδοχές

[Free GPT-4]

[PDF] arxiv.org

Text embeddings by weakly-supervised contrastive pre-training

L Wang, N Yang, X Huang, B Jiao, L Yang… - arxiv preprint arxiv …, 2022 - arxiv.org

This paper presents E5, a family of state-of-the-art text embeddings that transfer well to a
wide range of tasks. The model is trained in a contrastive manner with weak supervision …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 411 Σχετικά άρθρα Όλες οι 2 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] arxiv.org

MTEB: Massive text embedding benchmark

N Muennighoff, N Tazi, L Magne, N Reimers - arxiv preprint arxiv …, 2022 - arxiv.org

Text embeddings are commonly evaluated on a small set of datasets from a single task not
covering their possible applications to other tasks. It is unclear whether state-of-the-art …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 595 Σχετικά άρθρα Όλες οι 4 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] mit.edu

Hallucinations in large multilingual translation models

NM Guerreiro, DM Alves, J Waldendorf… - Transactions of the …, 2023 - direct.mit.edu

Hallucinated translations can severely undermine and raise safety issues when machine
translation systems are deployed in the wild. Previous research on the topic focused on …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 132 Σχετικά άρθρα Όλες οι 11 εκδοχές

[Free GPT-4]

[PDF] arxiv.org

A survey on multi-modal summarization

A Jangra, S Mukherjee, A Jatowt, S Saha… - ACM Computing …, 2023 - dl.acm.org

The new era of technology has brought us to the point where it is convenient for people to
share their opinions over an abundance of platforms. These platforms have a provision for …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 64 Σχετικά άρθρα Όλες οι 4 εκδοχές

[Free GPT-4]

[PDF] arxiv.org

Large dual encoders are generalizable retrievers

J Ni, C Qu, J Lu, Z Dai, GH Ábrego, J Ma… - arxiv preprint arxiv …, 2021 - arxiv.org

It has been shown that dual encoders trained on one domain often fail to generalize to other
domains for retrieval tasks. One widespread belief is that the bottleneck layer of a dual …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 372 Σχετικά άρθρα Όλες οι 5 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] arxiv.org

Sentence-t5: Scalable sentence encoders from pre-trained text-to-text models

J Ni, GH Abrego, N Constant, J Ma, KB Hall… - arxiv preprint arxiv …, 2021 - arxiv.org

We provide the first exploration of sentence embeddings from text-to-text transformers (T5).
Sentence embeddings are broadly useful for language processing tasks. While T5 achieves …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 487 Σχετικά άρθρα Όλες οι 4 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] arxiv.org

Fleurs: Few-shot learning evaluation of universal representations of speech

A Conneau, M Ma, S Khanuja, Y Zhang… - 2022 IEEE Spoken …, 2023 - ieeexplore.ieee.org

We introduce FLEURS, the Few-shot Learning Evaluation of Universal Representations of
Speech benchmark. FLEURS is an n-way parallel speech dataset in 102 languages built on …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 281 Σχετικά άρθρα Όλες οι 6 εκδοχές

[Free GPT-4]

[PDF] arxiv.org

Parameter-efficient transfer learning with diff pruning

D Guo, AM Rush, Y Kim - arxiv preprint arxiv:2012.07463, 2020 - arxiv.org

While task-specific finetuning of pretrained networks has led to significant empirical
advances in NLP, the large size of networks makes finetuning difficult to deploy in multi-task …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 413 Σχετικά άρθρα Όλες οι 7 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] arxiv.org

InfoXLM: An information-theoretic framework for cross-lingual language model pre-training

Z Chi, L Dong, F Wei, N Yang, S Singhal… - arxiv preprint arxiv …, 2020 - arxiv.org

In this work, we present an information-theoretic framework that formulates cross-lingual
language model pre-training as maximizing mutual information between multilingual-multi …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 359 Σχετικά άρθρα Όλες οι 6 εκδοχές Προβολή ως HTML

Δημιουργία ειδοποίησης

Παράθεση

Σύνθετη αναζήτηση

Αποθηκεύτηκε στη Βιβλιοθήκη μου

Language-agnostic BERT sentence embedding

Neural machine translation for low-resource languages: A survey

Text embeddings by weakly-supervised contrastive pre-training

MTEB: Massive text embedding benchmark

Hallucinations in large multilingual translation models

A survey on multi-modal summarization

Large dual encoders are generalizable retrievers

Sentence-t5: Scalable sentence encoders from pre-trained text-to-text models

Fleurs: Few-shot learning evaluation of universal representations of speech

Parameter-efficient transfer learning with diff pruning

InfoXLM: An information-theoretic framework for cross-lingual language model pre-training