- Academic Search

KS Kalyan - Natural Language Processing Journal, 2024 - Elsevier

Large language models (LLMs) are a special class of pretrained language models (PLMs)
obtained by scaling model size, pretraining corpus and computation. LLMs, because of their …

Simpan Kutip Dirujuk 255 kali Artikel terkait 5 versi

[Free GPT-4]

[PDF] arxiv.org

Challenges and applications of large language models

J Kaddour, J Harris, M Mozes, H Bradley… - arxiv preprint arxiv …, 2023 - arxiv.org

Large Language Models (LLMs) went from non-existent to ubiquitous in the machine
learning discourse within a few years. Due to the fast pace of the field, it is difficult to identify …

Simpan Kutip Dirujuk 483 kali Artikel terkait 3 versi Versi HTML

[Free GPT-4]

[PDF] arxiv.org

Phi-3 technical report: A highly capable language model locally on your phone

M Abdin, J Aneja, H Awadalla, A Awadallah… - arxiv preprint arxiv …, 2024 - arxiv.org

We introduce phi-3-mini, a 3.8 billion parameter language model trained on 3.3 trillion
tokens, whose overall performance, as measured by both academic benchmarks and …

Simpan Kutip Dirujuk 757 kali Artikel terkait 2 versi Versi HTML

[Free GPT-4]

[PDF] acm.org

Harnessing the power of llms in practice: A survey on chatgpt and beyond

J Yang, H **, R Tang, X Han, Q Feng, H Jiang… - ACM Transactions on …, 2024 - dl.acm.org

This article presents a comprehensive and practical guide for practitioners and end-users
working with Large Language Models (LLMs) in their downstream Natural Language …

Simpan Kutip Dirujuk 766 kali Artikel terkait 6 versi

[Free GPT-4]

[PDF] aaai.org

Zhong**g: Enhancing the chinese medical capabilities of large language model through expert feedback and real-world multi-turn dialogue

S Yang, H Zhao, S Zhu, G Zhou, H Xu, Y Jia… - Proceedings of the AAAI …, 2024 - ojs.aaai.org

Abstract Recent advances in Large Language Models (LLMs) have achieved remarkable
breakthroughs in understanding and responding to user intents. However, their performance …

Simpan Kutip Dirujuk 650 kali Artikel terkait 10 versi Versi HTML

[Free GPT-4]

[PDF] arxiv.org

Large language models for information retrieval: A survey

Y Zhu, H Yuan, S Wang, J Liu, W Liu, C Deng… - arxiv preprint arxiv …, 2023 - arxiv.org

As a primary means of information acquisition, information retrieval (IR) systems, such as
search engines, have integrated themselves into our daily lives. These systems also serve …

Simpan Kutip Dirujuk 293 kali Artikel terkait 3 versi Versi HTML

[Free GPT-4]

[PDF] neurips.cc

Large language model as attributed training data generator: A tale of diversity and bias

Y Yu, Y Zhuang, J Zhang, Y Meng… - Advances in …, 2024 - proceedings.neurips.cc

Large language models (LLMs) have been recently leveraged as training data generators
for various natural language processing (NLP) tasks. While previous research has explored …

Simpan Kutip Dirujuk 182 kali Artikel terkait 5 versi Versi HTML

[Free GPT-4]

[PDF] acm.org

A survey on hallucination in large language models: Principles, taxonomy, challenges, and open questions

L Huang, W Yu, W Ma, W Zhong, Z Feng… - ACM Transactions on …, 2024 - dl.acm.org

The emergence of large language models (LLMs) has marked a significant breakthrough in
natural language processing (NLP), fueling a paradigm shift in information acquisition …

Simpan Kutip Dirujuk 100 kali Artikel terkait

[Free GPT-4]

[PDF] arxiv.org

Aligning large language models with human: A survey

Y Wang, W Zhong, L Li, F Mi, X Zeng, W Huang… - arxiv preprint arxiv …, 2023 - arxiv.org

Large Language Models (LLMs) trained on extensive textual corpora have emerged as
leading solutions for a broad array of Natural Language Processing (NLP) tasks. Despite …

Simpan Kutip Dirujuk 283 kali Artikel terkait 2 versi Versi HTML

[Free GPT-4]

[PDF] arxiv.org

Textbooks are all you need ii: phi-1.5 technical report

Y Li, S Bubeck, R Eldan, A Del Giorno… - arxiv preprint arxiv …, 2023 - arxiv.org

We continue the investigation into the power of smaller Transformer-based language
models as initiated by\textbf {TinyStories}--a 10 million parameter model that can produce …

Simpan Kutip Dirujuk 400 kali Artikel terkait 2 versi Versi HTML

Buat notifikasi

Kutip

Penelusuran lanjutan

Disimpan ke Koleksi saya

Textbooks are all you need

[HTML][HTML] A survey of GPT-3 family large language models including ChatGPT and GPT-4

Challenges and applications of large language models

Phi-3 technical report: A highly capable language model locally on your phone

Harnessing the power of llms in practice: A survey on chatgpt and beyond

Zhong**g: Enhancing the chinese medical capabilities of large language model through expert feedback and real-world multi-turn dialogue

Large language models for information retrieval: A survey

Large language model as attributed training data generator: A tale of diversity and bias

A survey on hallucination in large language models: Principles, taxonomy, challenges, and open questions

Aligning large language models with human: A survey

Textbooks are all you need ii: phi-1.5 technical report