- Academic Search

K Mahowald, AA Ivanova, IA Blank, N Kanwisher… - Trends in Cognitive …, 2024 - cell.com

Large language models (LLMs) have come closest among all models to date to mastering
human language, yet opinions about their linguistic and cognitive capabilities remain split …

Spara Citera Citerat av 432 Relaterade artiklar Alla 10 versionerna

[Free GPT-4]

[PDF] arxiv.org

Recent advances in natural language processing via large pre-trained language models: A survey

B Min, H Ross, E Sulem, APB Veyseh… - ACM Computing …, 2023 - dl.acm.org

Large, pre-trained language models (PLMs) such as BERT and GPT have drastically
changed the Natural Language Processing (NLP) field. For numerous NLP tasks …

Spara Citera Citerat av 1132 Relaterade artiklar Alla 5 versionerna

[Free GPT-4]

[PDF] arxiv.org

Beyond the imitation game: Quantifying and extrapolating the capabilities of language models

A Srivastava, A Rastogi, A Rao, AAM Shoeb… - arxiv preprint arxiv …, 2022 - arxiv.org

Language models demonstrate both quantitative improvement and new qualitative
capabilities with increasing scale. Despite their potentially transformative impact, these new …

Spara Citera Citerat av 1298 Relaterade artiklar Alla 11 versionerna Se som HTML-version

[Free GPT-4]

[HTML] google.com

[HTML][HTML] Modern language models refute Chomsky's approach to language

ST Piantadosi - From fieldwork to linguistic theory: A tribute to …, 2023 - books.google.com

Modern machine learning has subverted and bypassed the theoretical framework of
Chomsky's generative approach to linguistics, including its core claims to particular insights …

Spara Citera Citerat av 188 Relaterade artiklar Alla 3 versionerna

[Free GPT-4]

[PDF] thecvf.com

Winoground: Probing vision and language models for visio-linguistic compositionality

T Thrush, R Jiang, M Bartolo, A Singh… - Proceedings of the …, 2022 - openaccess.thecvf.com

We present a novel task and dataset for evaluating the ability of vision and language models
to conduct visio-linguistic compositional reasoning, which we call Winoground. Given two …

Spara Citera Citerat av 393 Relaterade artiklar Alla 6 versionerna Se som HTML-version

[Free GPT-4]

[PDF] arxiv.org

A survey of data augmentation approaches for NLP

SY Feng, V Gangal, J Wei, S Chandar… - arxiv preprint arxiv …, 2021 - arxiv.org

Data augmentation has recently seen increased interest in NLP due to more work in low-
resource domains, new tasks, and the popularity of large-scale neural networks that require …

Spara Citera Citerat av 945 Relaterade artiklar Alla 9 versionerna Se som HTML-version

[Free GPT-4]

[PDF] neurips.cc

The learnability of in-context learning

N Wies, Y Levine, A Shashua - Advances in Neural …, 2023 - proceedings.neurips.cc

In-context learning is a surprising and important phenomenon that emerged when modern
language models were scaled to billions of learned parameters. Without modifying a large …

Spara Citera Citerat av 89 Relaterade artiklar Alla 5 versionerna Se som HTML-version

[Free GPT-4]

[PDF] pkwyx.com

[PDF][PDF] What Does Bert Look At? An Analysis of Bert's Attention

K Clark - arxiv preprint arxiv:1906.04341, 2019 - fq.pkwyx.com

Large pre-trained neural networks such as BERT have had great recent success in NLP,
motivating a growing body of research investigating what aspects of language they are able …

Spara Citera Citerat av 1937 Relaterade artiklar Cachad

[Free GPT-4]

[PDF] arxiv.org

Masked language modeling and the distributional hypothesis: Order word matters pre-training for little

K Sinha, R Jia, D Hupkes, J Pineau, A Williams… - arxiv preprint arxiv …, 2021 - arxiv.org

A possible explanation for the impressive performance of masked language model (MLM)
pre-training is that such models have learned to represent the syntactic structures prevalent …

Spara Citera Citerat av 264 Relaterade artiklar Alla 5 versionerna Se som HTML-version

[Free GPT-4]

[PDF] aclanthology.org

Analyzing multi-head self-attention: Specialized heads do the heavy lifting, the rest can be pruned

E Voita, D Talbot, F Moiseev, R Sennrich… - arxiv preprint arxiv …, 2019 - arxiv.org

Multi-head self-attention is a key component of the Transformer, a state-of-the-art
architecture for neural machine translation. In this work we evaluate the contribution made …

Spara Citera Citerat av 1353 Relaterade artiklar Alla 10 versionerna Se som HTML-version

Skapa alarm

Citera

Avancerad sökning

Har sparats i Mitt bibliotek

Colorless green recurrent networks dream hierarchically

Dissociating language and thought in large language models

Recent advances in natural language processing via large pre-trained language models: A survey

Beyond the imitation game: Quantifying and extrapolating the capabilities of language models

[HTML][HTML] Modern language models refute Chomsky's approach to language

Winoground: Probing vision and language models for visio-linguistic compositionality

A survey of data augmentation approaches for NLP

The learnability of in-context learning

[PDF][PDF] What Does Bert Look At? An Analysis of Bert's Attention

Masked language modeling and the distributional hypothesis: Order word matters pre-training for little

Analyzing multi-head self-attention: Specialized heads do the heavy lifting, the rest can be pruned