Google 학술 검색

AJ Thirunavukarasu, DSJ Ting, K Elangovan… - Nature medicine, 2023 - nature.com

Large language models (LLMs) can respond to free-text queries without being specifically
trained in the task in question, causing excitement and concern about their use in healthcare …

저장 인용 2039회 인용 관련 학술자료 전체 7개의 버전

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

A survey on evaluation of large language models

Y Chang, X Wang, J Wang, Y Wu, L Yang… - ACM transactions on …, 2024 - dl.acm.org

Large language models (LLMs) are gaining increasing popularity in both academia and
industry, owing to their unprecedented performance in various applications. As LLMs …

저장 인용 2270회 인용 관련 학술자료 전체 8개의 버전

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Direct preference optimization: Your language model is secretly a reward model

R Rafailov, A Sharma, E Mitchell… - Advances in …, 2023 - proceedings.neurips.cc

While large-scale unsupervised language models (LMs) learn broad world knowledge and
some reasoning skills, achieving precise control of their behavior is difficult due to the …

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Gemini: a family of highly capable multimodal models

G Team, R Anil, S Borgeaud, JB Alayrac, J Yu… - arxiv preprint arxiv …, 2023 - arxiv.org

This report introduces a new family of multimodal models, Gemini, that exhibit remarkable
capabilities across image, audio, video, and text understanding. The Gemini family consists …

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Llama 2: Open foundation and fine-tuned chat models

H Touvron, L Martin, K Stone, P Albert… - arxiv preprint arxiv …, 2023 - arxiv.org

In this work, we develop and release Llama 2, a collection of pretrained and fine-tuned large
language models (LLMs) ranging in scale from 7 billion to 70 billion parameters. Our fine …

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Visual instruction tuning

H Liu, C Li, Q Wu, YJ Lee - Advances in neural information …, 2023 - proceedings.neurips.cc

Instruction tuning large language models (LLMs) using machine-generated instruction-
following data has been shown to improve zero-shot capabilities on new tasks, but the idea …

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Qlora: Efficient finetuning of quantized llms

T Dettmers, A Pagnoni, A Holtzman… - Advances in neural …, 2023 - proceedings.neurips.cc

We present QLoRA, an efficient finetuning approach that reduces memory usage enough to
finetune a 65B parameter model on a single 48GB GPU while preserving full 16-bit …

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Judging llm-as-a-judge with mt-bench and chatbot arena

L Zheng, WL Chiang, Y Sheng… - Advances in …, 2023 - proceedings.neurips.cc

Evaluating large language model (LLM) based chat assistants is challenging due to their
broad capabilities and the inadequacy of existing benchmarks in measuring human …

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Tree of thoughts: Deliberate problem solving with large language models

S Yao, D Yu, J Zhao, I Shafran… - Advances in neural …, 2023 - proceedings.neurips.cc

Abstract Language models are increasingly being deployed for general problem solving
across a wide range of tasks, but are still confined to token-level, left-to-right decision …

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Qwen technical report

J Bai, S Bai, Y Chu, Z Cui, K Dang, X Deng… - arxiv preprint arxiv …, 2023 - arxiv.org

Large language models (LLMs) have revolutionized the field of artificial intelligence,
enabling natural language processing tasks that were previously thought to be exclusive to …

알림 만들기

인용

고급 검색

라이브러리에 저장됨

Llama: Open and efficient foundation language models

Large language models in medicine

A survey on evaluation of large language models

Direct preference optimization: Your language model is secretly a reward model

Gemini: a family of highly capable multimodal models

Llama 2: Open foundation and fine-tuned chat models

Visual instruction tuning

Qlora: Efficient finetuning of quantized llms

Judging llm-as-a-judge with mt-bench and chatbot arena

Tree of thoughts: Deliberate problem solving with large language models

Qwen technical report