- Academic Search

What do you learn from context? probing for sentence structure in contextualized word representat...

M Bayer, MA Kaufhold, C Reuter - ACM Computing Surveys, 2022 - dl.acm.org

Data augmentation, the artificial creation of training data for machine learning by
transformations, is a widely studied research field across machine learning disciplines …

Uložit Citovat Počet citací tohoto článku: 423 Související články Všechny verze (počet: 6)

[Free GPT-4]
[DeepSeek]

[PDF] royalsocietypublishing.org

Symbols and grounding in large language models

E Pavlick - … Transactions of the Royal Society A, 2023 - royalsocietypublishing.org

Large language models (LLMs) are one of the most impressive achievements of artificial
intelligence in recent years. However, their relevance to the study of language more broadly …

Uložit Citovat Počet citací tohoto článku: 104 Související články Všechny verze (počet: 4)

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Explainability for large language models: A survey

H Zhao, H Chen, F Yang, N Liu, H Deng, H Cai… - ACM Transactions on …, 2024 - dl.acm.org

Large language models (LLMs) have demonstrated impressive capabilities in natural
language processing. However, their internal mechanisms are still unclear and this lack of …

Uložit Citovat Počet citací tohoto článku: 449 Související články Všechny verze (počet: 7)

[Free GPT-4]
[DeepSeek]

[PDF] hal.science

Bloom: A 176b-parameter open-access multilingual language model

T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow… - 2023 - inria.hal.science

Large language models (LLMs) have been shown to be able to perform new tasks based on
a few demonstrations or natural language instructions. While these capabilities have led to …

Uložit Citovat Počet citací tohoto článku: 1758 Související články Všechny verze (počet: 10) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[HTML] google.com

[HTML][HTML] Modern language models refute Chomsky's approach to language

ST Piantadosi - From fieldwork to linguistic theory: A tribute to …, 2023 - books.google.com

Modern machine learning has subverted and bypassed the theoretical framework of
Chomsky's generative approach to linguistics, including its core claims to particular insights …

Uložit Citovat Počet citací tohoto článku: 195 Související články Všechny verze (počet: 5)

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] Pre-trained models: Past, present and future

X Han, Z Zhang, N Ding, Y Gu, X Liu, Y Huo, J Qiu… - AI Open, 2021 - Elsevier

Large-scale pre-trained models (PTMs) such as BERT and GPT have recently achieved
great success and become a milestone in the field of artificial intelligence (AI). Owing to …

Uložit Citovat Počet citací tohoto článku: 944 Související články Všechny verze (počet: 11)

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Lasuie: Unifying information extraction with latent adaptive structure-aware generative language model

H Fei, S Wu, J Li, B Li, F Li, L Qin… - Advances in …, 2022 - proceedings.neurips.cc

Universally modeling all typical information extraction tasks (UIE) with one generative
language model (GLM) has revealed great potential by the latest study, where various IE …

Uložit Citovat Počet citací tohoto článku: 130 Související články Všechny verze (počet: 5) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Finding neurons in a haystack: Case studies with sparse probing

W Gurnee, N Nanda, M Pauly, K Harvey… - arxiv preprint arxiv …, 2023 - arxiv.org

Despite rapid adoption and deployment of large language models (LLMs), the internal
computations of these models remain opaque and poorly understood. In this work, we seek …

Uložit Citovat Počet citací tohoto článku: 134 Související články Všechny verze (počet: 3) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] mit.edu

Probing classifiers: Promises, shortcomings, and advances

Y Belinkov - Computational Linguistics, 2022 - direct.mit.edu

Probing classifiers have emerged as one of the prominent methodologies for interpreting
and analyzing deep neural network models of natural language processing. The basic idea …

Uložit Citovat Počet citací tohoto článku: 466 Související články Všechny verze (počet: 9)

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Post-hoc interpretability for neural nlp: A survey

A Madsen, S Reddy, S Chandar - ACM Computing Surveys, 2022 - dl.acm.org

Neural networks for NLP are becoming increasingly complex and widespread, and there is a
growing concern if these models are responsible to use. Explaining models helps to address …

Uložit Citovat Počet citací tohoto článku: 280 Související články Všechny verze (počet: 5)

Vytvořit upozornění

Citovat

Rozšířené vyhledávání

Uloženo do Mojí knihovny

What do you learn from context? probing for sentence structure in contextualized word representat...

A survey on data augmentation for text classification

Symbols and grounding in large language models

Explainability for large language models: A survey

Bloom: A 176b-parameter open-access multilingual language model

[HTML][HTML] Modern language models refute Chomsky's approach to language

[HTML][HTML] Pre-trained models: Past, present and future

Lasuie: Unifying information extraction with latent adaptive structure-aware generative language model

Finding neurons in a haystack: Case studies with sparse probing

Probing classifiers: Promises, shortcomings, and advances

Post-hoc interpretability for neural nlp: A survey