„Google“ mokslinčius

H Naveed, AU Khan, S Qiu, M Saqib, S Anwar… - arxiv preprint arxiv …, 2023 - arxiv.org

Large Language Models (LLMs) have recently demonstrated remarkable capabilities in
natural language processing tasks and beyond. This success of LLMs has led to a large …

Išsaugoti Cituoti Cituoja 794 Susiję straipsniai Visos 4 versijos HTML kopija

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

Datasets for large language models: A comprehensive survey

Y Liu, J Cao, C Liu, K Ding, L ** - arxiv preprint arxiv:2402.18041, 2024 - arxiv.org

This paper embarks on an exploration into the Large Language Model (LLM) datasets,
which play a crucial role in the remarkable advancements of LLMs. The datasets serve as …

Išsaugoti Cituoti Cituoja 138 Susiję straipsniai Visos 9 versijos HTML kopija

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

Rwkv: Reinventing rnns for the transformer era

B Peng, E Alcaide, Q Anthony, A Albalak… - arxiv preprint arxiv …, 2023 - arxiv.org

Transformers have revolutionized almost all natural language processing (NLP) tasks but
suffer from memory and computational complexity that scales quadratically with sequence …

Išsaugoti Cituoti Cituoja 484 Susiję straipsniai Visos 9 versijos HTML kopija

[免费ChatGPT] [DeepSeek可用网址] [PDF] neurips.cc

The refinedweb dataset for falcon llm: Outperforming curated corpora with web data only

G Penedo, Q Malartic, D Hesslow… - Advances in …, 2023 - proceedings.neurips.cc

Large language models are commonly trained on a mixture of filtered web data and
curated``high-quality''corpora, such as social media conversations, books, or technical …

Išsaugoti Cituoti Cituoja 122 Susiję straipsniai Visos 5 versijos HTML kopija

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

Olmo: Accelerating the science of language models

D Groeneveld, I Beltagy, P Walsh, A Bhagia… - arxiv preprint arxiv …, 2024 - arxiv.org

Language models (LMs) have become ubiquitous in both NLP research and in commercial
product offerings. As their commercial importance has surged, the most powerful models …

Išsaugoti Cituoti Cituoja 160 Susiję straipsniai Visos 7 versijos HTML kopija

[免费ChatGPT] [DeepSeek可用网址] [PDF] nature.com

Towards building multilingual language model for medicine

P Qiu, C Wu, X Zhang, W Lin, H Wang, Y Zhang… - Nature …, 2024 - nature.com

The development of open-source, multilingual medical language models can benefit a wide,
linguistically diverse audience from different regions. To promote this domain, we present …

Išsaugoti Cituoti Cituoja 51 Susiję straipsniai Visos 12 versijos

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

Med-halt: Medical domain hallucination test for large language models

A Pal, LK Umapathi, M Sankarasubbu - arxiv preprint arxiv:2307.15343, 2023 - arxiv.org

This research paper focuses on the challenges posed by hallucinations in large language
models (LLMs), particularly in the context of the medical domain. Hallucination, wherein …

Išsaugoti Cituoti Cituoja 134 Susiję straipsniai Visos 4 versijos HTML kopija

[免费ChatGPT] [DeepSeek可用网址] [PDF] acm.org

Natural language reasoning, a survey

F Yu, H Zhang, P Tiwari, B Wang - ACM Computing Surveys, 2024 - dl.acm.org

This survey article proposes a clearer view of Natural Language Reasoning (NLR) in the
field of Natural Language Processing (NLP), both conceptually and practically …

Išsaugoti Cituoti Cituoja 73 Susiję straipsniai Visos 4 versijos

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

Qa dataset explosion: A taxonomy of nlp resources for question answering and reading comprehension

A Rogers, M Gardner, I Augenstein - ACM Computing Surveys, 2023 - dl.acm.org

Alongside huge volumes of research on deep learning models in NLP in the recent years,
there has been much work on benchmark datasets needed to track modeling progress …

Išsaugoti Cituoti Cituoja 234 Susiję straipsniai Visos 7 versijos

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

Neural natural language processing for unstructured data in electronic health records: a review

I Li, J Pan, J Goldwasser, N Verma, WP Wong… - Computer Science …, 2022 - Elsevier

Electronic health records (EHRs), digital collections of patient healthcare events and
observations, are ubiquitous in medicine and critical to healthcare delivery, operations, and …

Išsaugoti Cituoti Cituoja 186 Susiję straipsniai Visos 5 versijos

Kurti įspėjimą

Cituoti

Išplėstinė paieška

Išsaugota skiltyje „Mano biblioteka“

HEAD-QA: A healthcare dataset for complex reasoning

A comprehensive overview of large language models

Datasets for large language models: A comprehensive survey

Rwkv: Reinventing rnns for the transformer era

The refinedweb dataset for falcon llm: Outperforming curated corpora with web data only

Olmo: Accelerating the science of language models

Towards building multilingual language model for medicine

Med-halt: Medical domain hallucination test for large language models

Natural language reasoning, a survey

Qa dataset explosion: A taxonomy of nlp resources for question answering and reading comprehension

Neural natural language processing for unstructured data in electronic health records: a review