- Academic Search

X Qiu, T Sun, Y Xu, Y Shao, N Dai, X Huang - Science China …, 2020 - Springer

Recently, the emergence of pre-trained models (PTMs) has brought natural language
processing (NLP) to a new era. In this survey, we provide a comprehensive review of PTMs …

Save Cite Cited by 1928 Related articles All 9 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

A survey on data augmentation for text classification

M Bayer, MA Kaufhold, C Reuter - ACM Computing Surveys, 2022 - dl.acm.org

Data augmentation, the artificial creation of training data for machine learning by
transformations, is a widely studied research field across machine learning disciplines …

Save Cite Cited by 411 Related articles All 5 versions Free GPT-4

[Free GPT-4]

[PDF] hal.science

Bloom: A 176b-parameter open-access multilingual language model

T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow… - 2023 - inria.hal.science

Large language models (LLMs) have been shown to be able to perform new tasks based on
a few demonstrations or natural language instructions. While these capabilities have led to …

Save Cite Cited by 1746 Related articles All 16 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Language models as knowledge bases?

F Petroni, T Rocktäschel, P Lewis, A Bakhtin… - arxiv preprint arxiv …, 2019 - arxiv.org

Recent progress in pretraining language models on large textual corpora led to a surge of
improvements for downstream NLP tasks. Whilst learning linguistic knowledge, these …

Save Cite Cited by 2965 Related articles All 7 versions Free GPT-4 View as HTML

[Free GPT-4]

[HTML] sciencedirect.com

[HTML][HTML] Pre-trained models: Past, present and future

X Han, Z Zhang, N Ding, Y Gu, X Liu, Y Huo, J Qiu… - AI Open, 2021 - Elsevier

Large-scale pre-trained models (PTMs) such as BERT and GPT have recently achieved
great success and become a milestone in the field of artificial intelligence (AI). Owing to …

Save Cite Cited by 923 Related articles All 9 versions Free GPT-4

[Free GPT-4]

[PDF] mit.edu

A primer in BERTology: What we know about how BERT works

A Rogers, O Kovaleva, A Rumshisky - Transactions of the Association …, 2021 - direct.mit.edu

Transformer-based models have pushed state of the art in many areas of NLP, but our
understanding of what is behind their success is still limited. This paper is the first survey of …

Save Cite Cited by 1819 Related articles All 12 versions Free GPT-4

[Free GPT-4]

[PDF] neurips.cc

Superglue: A stickier benchmark for general-purpose language understanding systems

A Wang, Y Pruksachatkun, N Nangia… - Advances in neural …, 2019 - proceedings.neurips.cc

In the last year, new models and methods for pretraining and transfer learning have driven
striking performance improvements across a range of language understanding tasks. The …

Save Cite Cited by 2435 Related articles All 10 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] pkwyx.com

[PDF][PDF] What Does Bert Look At? An Analysis of Bert's Attention

K Clark - arxiv preprint arxiv:1906.04341, 2019 - fq.pkwyx.com

Large pre-trained neural networks such as BERT have had great recent success in NLP,
motivating a growing body of research investigating what aspects of language they are able …

Save Cite Cited by 1933 Related articles Cached

[Free GPT-4]

[PDF] pkwyx.com

[PDF][PDF] BERT rediscovers the classical NLP pipeline

I Tenney - arxiv preprint arxiv:1905.05950, 2019 - fq.pkwyx.com

Pre-trained text encoders have rapidly advanced the state of the art on many NLP tasks. We
focus on one such model, BERT, and aim to quantify where linguistic information is captured …

Save Cite Cited by 1813 Related articles Cached

[Free GPT-4]

[PDF] pkwyx.com

[PDF][PDF] How multilingual is multilingual BERT

T Pires - arxiv preprint arxiv:1906.01502, 2019 - fq.pkwyx.com

In this paper, we show that Multilingual BERT (M-BERT), released by Devlin et al.(2018) as
a single language model pre-trained from monolingual corpora in 104 languages, is …

Save Cite Cited by 1741 Related articles Cached

Cite

Advanced search

Saved to My library

Pre-trained models for natural language processing: A survey

A survey on data augmentation for text classification

Bloom: A 176b-parameter open-access multilingual language model

Language models as knowledge bases?

[HTML][HTML] Pre-trained models: Past, present and future

A primer in BERTology: What we know about how BERT works

Superglue: A stickier benchmark for general-purpose language understanding systems

[PDF][PDF] What Does Bert Look At? An Analysis of Bert's Attention

[PDF][PDF] BERT rediscovers the classical NLP pipeline

[PDF][PDF] How multilingual is multilingual BERT