- Academic Search

H Naveed, AU Khan, S Qiu, M Saqib, S Anwar… - arxiv preprint arxiv …, 2023 - arxiv.org

Large Language Models (LLMs) have recently demonstrated remarkable capabilities in
natural language processing tasks and beyond. This success of LLMs has led to a large …

Zapisz Cytuj Cytowane przez 693 Powiązane artykuły Wszystkie wersje 3 Wersja HTML

[Free GPT-4]

[PDF] arxiv.org

Ammus: A survey of transformer-based pretrained models in natural language processing

KS Kalyan, A Rajasekharan, S Sangeetha - arxiv preprint arxiv …, 2021 - arxiv.org

Transformer-based pretrained language models (T-PTLMs) have achieved great success in
almost every NLP task. The evolution of these models started with GPT and BERT. These …

Zapisz Cytuj Cytowane przez 363 Powiązane artykuły Wszystkie wersje 2 Wersja HTML

[Free GPT-4]

[PDF] arxiv.org

C-pack: Packed resources for general chinese embeddings

S **ao, Z Liu, P Zhang, N Muennighoff, D Lian… - Proceedings of the 47th …, 2024 - dl.acm.org

We introduce C-Pack, a package of resources that significantly advances the field of general
text embeddings for Chinese. C-Pack includes three critical resources. 1) C-MTP is a …

Zapisz Cytuj Cytowane przez 415 Powiązane artykuły Wszystkie wersje 2

[Free GPT-4]

[PDF] arxiv.org

Glm-130b: An open bilingual pre-trained model

A Zeng, X Liu, Z Du, Z Wang, H Lai, M Ding… - arxiv preprint arxiv …, 2022 - arxiv.org

We introduce GLM-130B, a bilingual (English and Chinese) pre-trained language model
with 130 billion parameters. It is an attempt to open-source a 100B-scale model at least as …

Zapisz Cytuj Cytowane przez 597 Powiązane artykuły Wszystkie wersje 5 Wersja HTML

[Free GPT-4]

[PDF] neurips.cc

The bigscience roots corpus: A 1.6 tb composite multilingual dataset

H Laurençon, L Saulnier, T Wang… - Advances in …, 2022 - proceedings.neurips.cc

As language models grow ever larger, the need for large-scale high-quality text datasets has
never been more pressing, especially in multilingual settings. The BigScience workshop, a 1 …

Zapisz Cytuj Cytowane przez 188 Powiązane artykuły Wszystkie wersje 21 Wersja HTML

[Free GPT-4]

[PDF] neurips.cc

Cogview: Mastering text-to-image generation via transformers

M Ding, Z Yang, W Hong, W Zheng… - Advances in neural …, 2021 - proceedings.neurips.cc

Text-to-Image generation in the general domain has long been an open problem, which
requires both a powerful generative model and cross-modal understanding. We propose …

Zapisz Cytuj Cytowane przez 797 Powiązane artykuły Wszystkie wersje 7 Wersja HTML

[Free GPT-4]

[PDF] arxiv.org

Ppt: Pre-trained prompt tuning for few-shot learning

Y Gu, X Han, Z Liu, M Huang - arxiv preprint arxiv:2109.04332, 2021 - arxiv.org

Prompts for pre-trained language models (PLMs) have shown remarkable performance by
bridging the gap between pre-training tasks and various downstream tasks. Among these …

Zapisz Cytuj Cytowane przez 473 Powiązane artykuły Wszystkie wersje 6 Wersja HTML

[Free GPT-4]

[PDF] arxiv.org

Continual learning of large language models: A comprehensive survey

H Shi, Z Xu, H Wang, W Qin, W Wang, Y Wang… - arxiv preprint arxiv …, 2024 - arxiv.org

The recent success of large language models (LLMs) trained on static, pre-collected,
general datasets has sparked numerous research directions and applications. One such …

Zapisz Cytuj Cytowane przez 54 Powiązane artykuły Wszystkie wersje 2 Wersja HTML

[Free GPT-4]

[PDF] aclanthology.org

NusaCrowd: Open source initiative for Indonesian NLP resources

S Cahyawijaya, H Lovenia, AF Aji… - Findings of the …, 2023 - aclanthology.org

We present NusaCrowd, a collaborative initiative to collect and unify existing resources for
Indonesian languages, including opening access to previously non-public resources …

Zapisz Cytuj Cytowane przez 970 Powiązane artykuły Wszystkie wersje 7 Wersja HTML

[Free GPT-4]

[PDF] arxiv.org

The all-seeing project: Towards panoptic visual recognition and understanding of the open world

W Wang, M Shi, Q Li, W Wang, Z Huang, L **ng… - arxiv preprint arxiv …, 2023 - arxiv.org

We present the All-Seeing (AS) project: a large-scale data and model for recognizing and
understanding everything in the open world. Using a scalable data engine that incorporates …

Zapisz Cytuj Cytowane przez 78 Powiązane artykuły Wszystkie wersje 3 Wersja HTML

Cytuj

Szukanie zaawansowane

Zapisano w Mojej bibliotece

A comprehensive overview of large language models

Ammus: A survey of transformer-based pretrained models in natural language processing

C-pack: Packed resources for general chinese embeddings

Glm-130b: An open bilingual pre-trained model

The bigscience roots corpus: A 1.6 tb composite multilingual dataset

Cogview: Mastering text-to-image generation via transformers

Ppt: Pre-trained prompt tuning for few-shot learning

Continual learning of large language models: A comprehensive survey

NusaCrowd: Open source initiative for Indonesian NLP resources

The all-seeing project: Towards panoptic visual recognition and understanding of the open world