Google znalac

K Ethayarajh, Y Choi… - … Conference on Machine …, 2022 - proceedings.mlr.press

Estimating the difficulty of a dataset typically involves comparing state-of-the-art models to
humans; the bigger the performance gap, the harder the dataset is said to be. However, this …

Spremi Citiraj Spominje se 228 puta Srodni članci Svih 8 inačica Prikaži kao HTML

[Free GPT-4]
[DeepSeek]

[PDF] frontiersin.org

Schrödinger's tree—On syntax and neural language models

A Kulmizev, J Nivre - Frontiers in Artificial Intelligence, 2022 - frontiersin.org

In the last half-decade, the field of natural language processing (NLP) has undergone two
major transitions: the switch to neural networks as the primary modeling paradigm and the …

Spremi Citiraj Spominje se 25 puta Srodni članci Svih 9 inačica Spremljeno u privremenu memoriju

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Ravel: Evaluating interpretability methods on disentangling language model representations

J Huang, Z Wu, C Potts, M Geva, A Geiger - arxiv preprint arxiv …, 2024 - arxiv.org

Individual neurons participate in the representation of multiple high-level concepts. To what
extent can different interpretability methods successfully disentangle these roles? To help …

Spremi Citiraj Spominje se 21 puta Srodni članci Svih 6 inačica Prikaži kao HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Receval: Evaluating reasoning chains via correctness and informativeness

A Prasad, S Saha, X Zhou, M Bansal - arxiv preprint arxiv:2304.10703, 2023 - arxiv.org

Multi-step reasoning ability is fundamental to many natural language tasks, yet it is unclear
what constitutes a good reasoning chain and how to evaluate them. Most existing methods …

Spremi Citiraj Spominje se 33 puta Srodni članci Svih 7 inačica Prikaži kao HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A closer look at how fine-tuning changes BERT

Y Zhou, V Srikumar - arxiv preprint arxiv:2106.14282, 2021 - arxiv.org

Given the prevalence of pre-trained contextualized representations in today's NLP, there
have been many efforts to understand what information they contain, and why they seem to …

Spremi Citiraj Spominje se 69 puta Srodni članci Svih 7 inačica Prikaži kao HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Probing for the usage of grammatical number

K Lasri, T Pimentel, A Lenci, T Poibeau… - arxiv preprint arxiv …, 2022 - arxiv.org

A central quest of probing is to uncover how pre-trained models encode a linguistic property
within their representations. An encoding, however, might be spurious-ie, the model might …

Spremi Citiraj Spominje se 56 puta Srodni članci Svih 9 inačica Prikaži kao HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Probing for constituency structure in neural language models

D Arps, Y Samih, L Kallmeyer, H Sajjad - arxiv preprint arxiv:2204.06201, 2022 - arxiv.org

In this paper, we investigate to which extent contextual neural language models (LMs)
implicitly learn syntactic structure. More concretely, we focus on constituent structure as …

Spremi Citiraj Spominje se 30 puta Srodni članci Svih 5 inačica Prikaži kao HTML

[Free GPT-4]
[DeepSeek]

[PDF] ict.ac.cn

[PDF][PDF] 自然语言处理中的探针可解释方法综述

鞠天杰，刘功申，张倬胜，张茹 - 计算机学报, 2024 - cjc.ict.ac.cn

摘要随着大规模预训练模型的广泛应用, 自然语言处理的多个领域(如文本分类和机器翻译)
取得了长足的发展. 然而, 受限于预训练模型的“黑盒” 特性, 其内部的决策模式以及编码的知识 …

Spremi Citiraj Spominje se 4 puta Srodni članci Svih 5 inačica Prikaži kao HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

When classifying grammatical role, BERT doesn't care about word order... except when it matters

I Papadimitriou, R Futrell, K Mahowald - arxiv preprint arxiv:2203.06204, 2022 - arxiv.org

Because meaning can often be inferred from lexical semantics alone, word order is often a
redundant cue in natural language. For example, the words chopped, chef, and onion are …

Spremi Citiraj Spominje se 32 puta Srodni članci Svih 6 inačica Prikaži kao HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Gaussian process probes (GPP) for uncertainty-aware probing

Z Wang, A Ku, J Baldridge… - Advances in neural …, 2023 - proceedings.neurips.cc

Understanding which concepts models can and cannot represent has been fundamental to
many tasks: from effective and responsible use of models to detecting out of distribution …

Spremi Citiraj Spominje se 9 puta Srodni članci Svih 7 inačica Prikaži kao HTML

Stvori obavijest

Citiraj

Napredno pretraživanje

Spremljeno u Moju knjižnicu

Conditional probing: measuring usable information beyond a baseline

Understanding Dataset Difficulty with -Usable Information

Schrödinger's tree—On syntax and neural language models

Ravel: Evaluating interpretability methods on disentangling language model representations

Receval: Evaluating reasoning chains via correctness and informativeness

A closer look at how fine-tuning changes BERT

Probing for the usage of grammatical number

Probing for constituency structure in neural language models

[PDF][PDF] 自然语言处理中的探针可解释方法综述

When classifying grammatical role, BERT doesn't care about word order... except when it matters

Gaussian process probes (GPP) for uncertainty-aware probing