Μελετητής Google

Y Belinkov - Computational Linguistics, 2022 - direct.mit.edu

Probing classifiers have emerged as one of the prominent methodologies for interpreting
and analyzing deep neural network models of natural language processing. The basic idea …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 465 Σχετικά άρθρα Όλες οι 9 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Pre-trained models for natural language processing: A survey

X Qiu, T Sun, Y Xu, Y Shao, N Dai, X Huang - Science China …, 2020 - Springer

Recently, the emergence of pre-trained models (PTMs) has brought natural language
processing (NLP) to a new era. In this survey, we provide a comprehensive review of PTMs …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 1921 Σχετικά άρθρα Όλες οι 7 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] hal.science

Bloom: A 176b-parameter open-access multilingual language model

T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow… - 2023 - inria.hal.science

Large language models (LLMs) have been shown to be able to perform new tasks based on
a few demonstrations or natural language instructions. While these capabilities have led to …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 1754 Σχετικά άρθρα Όλες οι 10 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Locating and editing factual associations in gpt

K Meng, D Bau, A Andonian… - Advances in neural …, 2022 - proceedings.neurips.cc

We analyze the storage and recall of factual associations in autoregressive transformer
language models, finding evidence that these associations correspond to localized, directly …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 1067 Σχετικά άρθρα Όλες οι 8 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Unlearn what you want to forget: Efficient unlearning for llms

J Chen, D Yang - arxiv preprint arxiv:2310.20150, 2023 - arxiv.org

Large language models (LLMs) have achieved significant progress from pre-training on and
memorizing a wide range of textual data, however, this process might suffer from privacy …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 120 Σχετικά άρθρα Όλες οι 5 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Fine-tuning can distort pretrained features and underperform out-of-distribution

A Kumar, A Raghunathan, R Jones, T Ma… - arxiv preprint arxiv …, 2022 - arxiv.org

When transferring a pretrained model to a downstream task, two popular methods are full
fine-tuning (updating all the model parameters) and linear probing (updating only the last …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 729 Σχετικά άρθρα Όλες οι 5 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

On the opportunities and risks of foundation models

R Bommasani, DA Hudson, E Adeli, R Altman… - arxiv preprint arxiv …, 2021 - arxiv.org

AI is undergoing a paradigm shift with the rise of models (eg, BERT, DALL-E, GPT-3) that are
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 4771 Σχετικά άρθρα Όλες οι 2 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Fast model editing at scale

E Mitchell, C Lin, A Bosselut, C Finn… - arxiv preprint arxiv …, 2021 - arxiv.org

While large pre-trained models have enabled impressive results on a variety of downstream
tasks, the largest existing models still make errors, and even accurate predictions may …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 443 Σχετικά άρθρα Όλες οι 4 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Factual probing is [mask]: Learning vs. learning to recall

Z Zhong, D Friedman, D Chen - arxiv preprint arxiv:2104.05240, 2021 - arxiv.org

Petroni et al.(2019) demonstrated that it is possible to retrieve world facts from a pre-trained
language model by expressing them as cloze-style prompts and interpret the model's …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 407 Σχετικά άρθρα Όλες οι 6 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] mit.edu

How can we know what language models know?

Z Jiang, FF Xu, J Araki, G Neubig - Transactions of the Association for …, 2020 - direct.mit.edu

Recent work has presented intriguing results examining the knowledge contained in
language models (LMs) by having the LM fill in the blanks of prompts such as “Obama is a …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 1466 Σχετικά άρθρα Όλες οι 17 εκδοχές

Δημιουργία ειδοποίησης

Παράθεση

Σύνθετη αναζήτηση

Αποθηκεύτηκε στη Βιβλιοθήκη μου

What do neural machine translation models learn about morphology?

Probing classifiers: Promises, shortcomings, and advances

Pre-trained models for natural language processing: A survey

Bloom: A 176b-parameter open-access multilingual language model

Locating and editing factual associations in gpt

Unlearn what you want to forget: Efficient unlearning for llms

Fine-tuning can distort pretrained features and underperform out-of-distribution

On the opportunities and risks of foundation models

Fast model editing at scale

Factual probing is [mask]: Learning vs. learning to recall

How can we know what language models know?