- Academic Search

[HTML][HTML] From turing to transformers: A comprehensive review and tutorial on the evolution and applications of generative transformer models

EY Zhang, AD Cheok, Z Pan, J Cai, Y Yan - Sci, 2023 - mdpi.com

In recent years, generative transformers have become increasingly prevalent in the field of
artificial intelligence, especially within the scope of natural language processing. This paper …

บันทึก อ้างอิง อ้างโดย26 บทความที่เกี่ยวข้อง ทั้งหมด 4 ฉบับ แคช

[Free GPT-4]
[DeepSeek]

[PDF] zhjwpku.com

[PDF][PDF] A survey of large language models

WX Zhao, K Zhou, J Li, T Tang… - arxiv preprint arxiv …, 2023 - paper-notes.zhjwpku.com

Ever since the Turing Test was proposed in the 1950s, humans have explored the mastering
of language intelligence by machine. Language is essentially a complex, intricate system of …

บันทึก อ้างอิง อ้างโดย3800 บทความที่เกี่ยวข้อง ทั้งหมด 6 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Medusa: Simple llm inference acceleration framework with multiple decoding heads

T Cai, Y Li, Z Geng, H Peng, JD Lee, D Chen… - arxiv preprint arxiv …, 2024 - arxiv.org

Large Language Models (LLMs) employ auto-regressive decoding that requires sequential
computation, with each step reliant on the previous one's output. This creates a bottleneck …

บันทึก อ้างอิง อ้างโดย180 บทความที่เกี่ยวข้อง ทั้งหมด 8 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] Applying large language models and chain-of-thought for automatic scoring

GG Lee, E Latif, X Wu, N Liu, X Zhai - Computers and Education: Artificial …, 2024 - Elsevier

This study investigates the application of large language models (LLMs), specifically GPT-
3.5 and GPT-4, with Chain-of-Though (CoT) in the automatic scoring of student-written …

บันทึก อ้างอิง อ้างโดย85 บทความที่เกี่ยวข้อง ทั้งหมด 6 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Evaluating the world model implicit in a generative model

K Vafa, J Chen, A Rambachan… - Advances in …, 2025 - proceedings.neurips.cc

Recent work suggests that large language models may implicitly learn world models. How
should we assess this possibility? We formalize this question for the case where the …

บันทึก อ้างอิง อ้างโดย20 บทความที่เกี่ยวข้อง ทั้งหมด 4 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Evaluating large language models on controlled generation tasks

J Sun, Y Tian, W Zhou, N Xu, Q Hu, R Gupta… - arxiv preprint arxiv …, 2023 - arxiv.org

While recent studies have looked into the abilities of large language models in various
benchmark tasks, including question generation, reading comprehension, multilingual and …

บันทึก อ้างอิง อ้างโดย54 บทความที่เกี่ยวข้อง ทั้งหมด 5 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A thorough examination of decoding methods in the era of llms

C Shi, H Yang, D Cai, Z Zhang, Y Wang, Y Yang… - arxiv preprint arxiv …, 2024 - arxiv.org

Decoding methods play an indispensable role in converting language models from next-
token predictors into practical task solvers. Prior research on decoding methods, primarily …

บันทึก อ้างอิง อ้างโดย34 บทความที่เกี่ยวข้อง ทั้งหมด 4 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

From decoding to meta-generation: Inference-time algorithms for large language models

S Welleck, A Bertsch, M Finlayson… - arxiv preprint arxiv …, 2024 - arxiv.org

One of the most striking findings in modern research on large language models (LLMs) is
that scaling up compute during training leads to better results. However, less attention has …

บันทึก อ้างอิง อ้างโดย25 บทความที่เกี่ยวข้อง ทั้งหมด 3 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] unit.no

Beyond extractive: advancing abstractive automatic text summarization in norwegian with transformers

JJ Navjord, JMR Korsvik - 2023 - nmbu.brage.unit.no

Automatic summarization is a key area in natural language processing (NLP) and machine
learning which attempts to generate informative summaries of articles and documents …

บันทึก อ้างอิง อ้างโดย57 บทความที่เกี่ยวข้อง ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Uncertainty in natural language generation: From theory to applications

J Baan, N Daheim, E Ilia, D Ulmer, HS Li… - arxiv preprint arxiv …, 2023 - arxiv.org

Recent advances of powerful Language Models have allowed Natural Language
Generation (NLG) to emerge as an important technology that can not only perform traditional …

บันทึก อ้างอิง อ้างโดย32 บทความที่เกี่ยวข้อง ทั้งหมด 2 ฉบับ ดูในรูปแบบ HTML

สร้างการแจ้งเตือน

อ้างอิง

การค้นหาขั้นสูง

บันทึกไปยังคลังของฉันแล้ว

Truncation sampling as language model desmoothing

[HTML][HTML] From turing to transformers: A comprehensive review and tutorial on the evolution and applications of generative transformer models

[PDF][PDF] A survey of large language models

Medusa: Simple llm inference acceleration framework with multiple decoding heads

[HTML][HTML] Applying large language models and chain-of-thought for automatic scoring

Evaluating the world model implicit in a generative model

Evaluating large language models on controlled generation tasks

A thorough examination of decoding methods in the era of llms

From decoding to meta-generation: Inference-time algorithms for large language models

Beyond extractive: advancing abstractive automatic text summarization in norwegian with transformers

Uncertainty in natural language generation: From theory to applications