الباحث العلمي من Google

J Chen, Z Liu, X Huang, C Wu, Q Liu, G Jiang, Y Pu… - World Wide Web, 2024‏ - Springer‏

The advent of large language models marks a revolutionary breakthrough in artificial
intelligence. With the unprecedented scale of training and model parameters, the capability …‏

حفظ اقتباس تم اقتباسها في عدد: 137 مقالات ذات صلة الإصدارات الـ 2كلها

[Free GPT-4]

[HTML] sciencedirect.com

[HTML][HTML] Large language models in law: A survey‏

J Lai, W Gan, J Wu, Z Qi, SY Philip - AI Open, 2024‏ - Elsevier‏

The advent of artificial intelligence (AI) has significantly impacted the traditional judicial
industry. Moreover, recently, with the development of the concept of AI-generated content …‏

حفظ اقتباس تم اقتباسها في عدد: 58 مقالات ذات صلة الإصدارات الـ 3كلها

[Free GPT-4]

[PDF] arxiv.org

A survey of large language models‏

WX Zhao, K Zhou, J Li, T Tang, X Wang, Y Hou… - arxiv preprint arxiv …, 2023‏ - arxiv.org‏

Language is essentially a complex, intricate system of human expressions governed by
grammatical rules. It poses a significant challenge to develop capable AI algorithms for …‏

حفظ اقتباس تم اقتباسها في عدد: 3598 مقالات ذات صلة الإصدارات الـ 4كلها إصدار HTML‏

[Free GPT-4]

[PDF] mlr.press

Pythia: A suite for analyzing large language models across training and scaling‏

S Biderman, H Schoelkopf… - International …, 2023‏ - proceedings.mlr.press‏

How do large language models (LLMs) develop and evolve over the course of training?
How do these patterns change as models scale? To answer these questions, we introduce …‏

حفظ اقتباس تم اقتباسها في عدد: 982 مقالات ذات صلة الإصدارات الـ 7كلها إصدار HTML‏

[Free GPT-4]

[PDF] arxiv.org

C-pack: Packed resources for general chinese embeddings‏

S **ao, Z Liu, P Zhang, N Muennighoff, D Lian… - Proceedings of the 47th …, 2024‏ - dl.acm.org‏

We introduce C-Pack, a package of resources that significantly advances the field of general
text embeddings for Chinese. C-Pack includes three critical resources. 1) C-MTP is a …‏

حفظ اقتباس تم اقتباسها في عدد: 422 مقالات ذات صلة الإصدارات الـ 2كلها

[Free GPT-4]

[PDF] hal.science

Bloom: A 176b-parameter open-access multilingual language model‏

T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow… - 2023‏ - inria.hal.science‏

Large language models (LLMs) have been shown to be able to perform new tasks based on
a few demonstrations or natural language instructions. While these capabilities have led to …‏

حفظ اقتباس تم اقتباسها في عدد: 1747 مقالات ذات صلة الإصدارات الـ 16كلها إصدار HTML‏

[Free GPT-4]

[PDF] arxiv.org

Rwkv: Reinventing rnns for the transformer era‏

B Peng, E Alcaide, Q Anthony, A Albalak… - arxiv preprint arxiv …, 2023‏ - arxiv.org‏

Transformers have revolutionized almost all natural language processing (NLP) tasks but
suffer from memory and computational complexity that scales quadratically with sequence …‏

حفظ اقتباس تم اقتباسها في عدد: 441 مقالات ذات صلة الإصدارات الـ 9كلها إصدار HTML‏

[Free GPT-4]

[PDF] neurips.cc

Scaling data-constrained language models‏

N Muennighoff, A Rush, B Barak… - Advances in …, 2024‏ - proceedings.neurips.cc‏

The current trend of scaling language models involves increasing both parameter count and
training dataset size. Extrapolating this trend suggests that training dataset size may soon be …‏

حفظ اقتباس تم اقتباسها في عدد: 225 مقالات ذات صلة الإصدارات الـ 7كلها إصدار HTML‏

[Free GPT-4]

[PDF] arxiv.org

Crosslingual generalization through multitask finetuning‏

N Muennighoff, T Wang, L Sutawika, A Roberts… - arxiv preprint arxiv …, 2022‏ - arxiv.org‏

Multitask prompted finetuning (MTF) has been shown to help large language models
generalize to new tasks in a zero-shot setting, but so far explorations of MTF have focused …‏

حفظ اقتباس تم اقتباسها في عدد: 686 مقالات ذات صلة الإصدارات الـ 5كلها إصدار HTML‏

[Free GPT-4]

[PDF] arxiv.org

Glm-130b: An open bilingual pre-trained model‏

A Zeng, X Liu, Z Du, Z Wang, H Lai, M Ding… - arxiv preprint arxiv …, 2022‏ - arxiv.org‏

We introduce GLM-130B, a bilingual (English and Chinese) pre-trained language model
with 130 billion parameters. It is an attempt to open-source a 100B-scale model at least as …‏

حفظ اقتباس تم اقتباسها في عدد: 603 مقالات ذات صلة الإصدارات الـ 5كلها إصدار HTML‏

إنشاء تنبيه

اقتباس

بحث متقدم

تم حفظ المقالة في مكتبتي.

What language model to train if you have one million gpu hours?

When large language models meet personalization: Perspectives of challenges and opportunities‏

[HTML][HTML] Large language models in law: A survey‏

A survey of large language models‏

Pythia: A suite for analyzing large language models across training and scaling‏

C-pack: Packed resources for general chinese embeddings‏

Bloom: A 176b-parameter open-access multilingual language model‏

Rwkv: Reinventing rnns for the transformer era‏

Scaling data-constrained language models‏

Crosslingual generalization through multitask finetuning‏

Glm-130b: An open bilingual pre-trained model‏