- Academic Search

Pangu-$\alpha $: Large-scale autoregressive pretrained Chinese language models with auto-parallel...

H Naveed, AU Khan, S Qiu, M Saqib, S Anwar… - arxiv preprint arxiv …, 2023 - arxiv.org

Large Language Models (LLMs) have recently demonstrated remarkable capabilities in
natural language processing tasks and beyond. This success of LLMs has led to a large …

保存引用被引用数: 700 関連記事全 3 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

Challenges and applications of large language models

J Kaddour, J Harris, M Mozes, H Bradley… - arxiv preprint arxiv …, 2023 - arxiv.org

Large Language Models (LLMs) went from non-existent to ubiquitous in the machine
learning discourse within a few years. Due to the fast pace of the field, it is difficult to identify …

保存引用被引用数: 482 関連記事全 3 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

A survey of large language models

WX Zhao, K Zhou, J Li, T Tang, X Wang, Y Hou… - arxiv preprint arxiv …, 2023 - arxiv.org

Language is essentially a complex, intricate system of human expressions governed by
grammatical rules. It poses a significant challenge to develop capable AI algorithms for …

保存引用被引用数: 3554 関連記事全 4 バージョン HTMLバージョン

[Free GPT-4]

[PDF] neurips.cc

Scaling data-constrained language models

N Muennighoff, A Rush, B Barak… - Advances in …, 2023 - proceedings.neurips.cc

The current trend of scaling language models involves increasing both parameter count and
training dataset size. Extrapolating this trend suggests that training dataset size may soon be …

保存引用被引用数: 221 関連記事全 7 バージョン HTMLバージョン

[Free GPT-4]

[PDF] hal.science

Bloom: A 176b-parameter open-access multilingual language model

T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow… - 2023 - inria.hal.science

Large language models (LLMs) have been shown to be able to perform new tasks based on
a few demonstrations or natural language instructions. While these capabilities have led to …

保存引用被引用数: 1745 関連記事全 16 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

Generative language models and automated influence operations: Emerging threats and potential mitigations

JA Goldstein, G Sastry, M Musser, R DiResta… - arxiv preprint arxiv …, 2023 - arxiv.org

Generative language models have improved drastically, and can now produce realistic text
outputs that are difficult to distinguish from human-written content. For malicious actors …

保存引用被引用数: 301 関連記事全 3 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

Glm-130b: An open bilingual pre-trained model

A Zeng, X Liu, Z Du, Z Wang, H Lai, M Ding… - arxiv preprint arxiv …, 2022 - arxiv.org

We introduce GLM-130B, a bilingual (English and Chinese) pre-trained language model
with 130 billion parameters. It is an attempt to open-source a 100B-scale model at least as …

保存引用被引用数: 596 関連記事全 5 バージョン HTMLバージョン

[Free GPT-4]

[HTML] sciencedirect.com

[HTML][HTML] Pre-trained language models and their applications

H Wang, J Li, H Wu, E Hovy, Y Sun - Engineering, 2023 - Elsevier

Pre-trained language models have achieved striking success in natural language
processing (NLP), leading to a paradigm shift from supervised learning to pre-training …

保存引用被引用数: 274 関連記事全 2 バージョン

[Free GPT-4]

[PDF] jmlr.org

Palm: Scaling language modeling with pathways

A Chowdhery, S Narang, J Devlin, M Bosma… - Journal of Machine …, 2023 - jmlr.org

Large language models have been shown to achieve remarkable performance across a
variety of natural language tasks using few-shot learning, which drastically reduces the …

Codegeex: A pre-trained model for code generation with multilingual evaluations on humaneval-x

Q Zheng, X **a, X Zou, Y Dong, S Wang, Y Xue… - arxiv preprint arxiv …, 2023 - arxiv.org

Large pre-trained code generation models, such as OpenAI Codex, can generate syntax-
and function-correct code, making the coding of programmers more productive and our …

保存引用被引用数: 239 関連記事全 2 バージョン HTMLバージョン

アラートを作成

引用

検索オプション

マイライブラリに保存しました

Pangu-$\alpha $: Large-scale autoregressive pretrained Chinese language models with auto-parallel...

A comprehensive overview of large language models

Challenges and applications of large language models

A survey of large language models

Scaling data-constrained language models

Bloom: A 176b-parameter open-access multilingual language model

Generative language models and automated influence operations: Emerging threats and potential mitigations

Glm-130b: An open bilingual pre-trained model

[HTML][HTML] Pre-trained language models and their applications

Palm: Scaling language modeling with pathways

Codegeex: A pre-trained model for code generation with multilingual evaluations on humaneval-x