Google Academic

A Plaat, A Wong, S Verberne, J Broekens… - arxiv preprint arxiv …, 2024 - arxiv.org

Scaling up language models to billions of parameters has opened up possibilities for in-
context learning, allowing instruction tuning and few-shot learning on tasks that the model …

Salvați Citați Citat de 32 ori Articole cu conținut similar Toate cele 2 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Training large language models to reason in a continuous latent space

S Hao, S Sukhbaatar, DJ Su, X Li, Z Hu… - arxiv preprint arxiv …, 2024 - arxiv.org

Large language models (LLMs) are restricted to reason in the" language space", where they
typically express the reasoning process with a chain-of-thought (CoT) to solve a complex …

Salvați Citați Citat de 21 ori Articole cu conținut similar Toate cele 5 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] openreview.net

Dualformer: Controllable fast and slow thinking by learning with randomized reasoning traces

DJ Su, S Sukhbaatar, M Rabbat, Y Tian… - The Thirteenth …, 2024 - openreview.net

In human cognition theory, human thinking is governed by two systems: the fast and intuitive
System 1 and the slower but more deliberative System 2. Recent studies have shown that …

Salvați Citați Citat de 7 ori Articole cu conținut similar Toate cele 2 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Longwriter: Unleashing 10,000+ word generation from long context llms

Y Bai, J Zhang, X Lv, L Zheng, S Zhu, L Hou… - arxiv preprint arxiv …, 2024 - arxiv.org

Current long context large language models (LLMs) can process inputs up to 100,000
tokens, yet struggle to generate outputs exceeding even a modest length of 2,000 words …

Salvați Citați Citat de 17 ori Articole cu conținut similar Toate cele 2 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Thinking llms: General instruction following with thought generation

T Wu, J Lan, W Yuan, J Jiao, J Weston… - arxiv preprint arxiv …, 2024 - arxiv.org

LLMs are typically trained to answer user questions or follow instructions similarly to how
human experts respond. However, in the standard alignment framework they lack the basic …

Salvați Citați Citat de 8 ori Articole cu conținut similar Toate cele 3 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] pku.edu.cn

[PDF][PDF] Numinamath: The largest public dataset in ai4maths with 860k pairs of competition math problems and solutions

J Li, E Beeching, L Tunstall, B Lipkin… - Hugging Face …, 2024 - faculty.bicmr.pku.edu.cn

Numina is an open AI4Maths initiative dedicated to advancing both artificial and human
intelligence in the field of mathematics. In this paper, we present the NuminaMath dataset, a …

Salvați Citați Citat de 22 ori Articole cu conținut similar Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

An Overview of Large Language Models for Statisticians

W Ji, W Yuan, E Getzen, K Cho, MI Jordan… - arxiv preprint arxiv …, 2025 - arxiv.org

Large Language Models (LLMs) have emerged as transformative tools in artificial
intelligence (AI), exhibiting remarkable capabilities across diverse tasks such as text …

Salvați Citați Articole cu conținut similar Toate cele 2 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Disentangling memory and reasoning ability in large language models

M **, W Luo, S Cheng, X Wang, W Hua… - arxiv preprint arxiv …, 2024 - arxiv.org

Large Language Models (LLMs) have demonstrated strong performance in handling
complex tasks requiring both extensive knowledge and reasoning abilities. However, the …

Salvați Citați Citat de 4 ori Articole cu conținut similar Toate cele 2 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

VipAct: Visual-perception enhancement via specialized vlm agent collaboration and tool-use

Z Zhang, R Rossi, T Yu, F Dernoncourt… - arxiv preprint arxiv …, 2024 - arxiv.org

While vision-language models (VLMs) have demonstrated remarkable performance across
various tasks combining textual and visual information, they continue to struggle with fine …

Salvați Citați Citat de 2 ori Articole cu conținut similar Toate cele 2 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Think More, Hallucinate Less: Mitigating Hallucinations via Dual Process of Fast and Slow Thinking

X Cheng, J Li, WX Zhao, JR Wen - arxiv preprint arxiv:2501.01306, 2025 - arxiv.org

Large language models (LLMs) demonstrate exceptional capabilities, yet still face the
hallucination issue. Typical text generation approaches adopt an auto-regressive generation …

Salvați Citați Citat de 1 ori Articole cu conținut similar Toate cele 2 versiuni Afișare ca HTML

Creează alerta

Citați

Căutare avansată

Salvat în Bibliotecă

Distilling system 2 into system 1

Reasoning with large language models, a survey

Training large language models to reason in a continuous latent space

Dualformer: Controllable fast and slow thinking by learning with randomized reasoning traces

Longwriter: Unleashing 10,000+ word generation from long context llms

Thinking llms: General instruction following with thought generation

[PDF][PDF] Numinamath: The largest public dataset in ai4maths with 860k pairs of competition math problems and solutions

An Overview of Large Language Models for Statisticians

Disentangling memory and reasoning ability in large language models

VipAct: Visual-perception enhancement via specialized vlm agent collaboration and tool-use

Think More, Hallucinate Less: Mitigating Hallucinations via Dual Process of Fast and Slow Thinking