- Academic Search

A Chandra, L Tünnermann, T Löfstedt, R Gratz - Elife, 2023 - elifesciences.org

Recent developments in deep learning, coupled with an increasing number of sequenced
proteins, have led to a breakthrough in life science applications, in particular in protein …

บันทึก อ้างอิง อ้างโดย103 บทความที่เกี่ยวข้อง ทั้งหมด 13 ฉบับ แคช

[Free GPT-4]
[DeepSeek]

[PDF] mit.edu

[PDF][PDF] Language model behavior: A comprehensive survey

TA Chang, BK Bergen - Computational Linguistics, 2024 - direct.mit.edu

Transformer language models have received widespread public attention, yet their
generated text is often surprising even to NLP researchers. In this survey, we discuss over …

บันทึก อ้างอิง อ้างโดย107 บทความที่เกี่ยวข้อง ทั้งหมด 9 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey on hallucination in large language models: Principles, taxonomy, challenges, and open questions

L Huang, W Yu, W Ma, W Zhong, Z Feng… - ACM Transactions on …, 2025 - dl.acm.org

The emergence of large language models (LLMs) has marked a significant breakthrough in
natural language processing (NLP), fueling a paradigm shift in information acquisition …

บันทึก อ้างอิง อ้างโดย963 บทความที่เกี่ยวข้อง ทั้งหมด 4 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Faith and fate: Limits of transformers on compositionality

N Dziri, X Lu, M Sclar, XL Li, L Jiang… - Advances in …, 2023 - proceedings.neurips.cc

Transformer large language models (LLMs) have sparked admiration for their exceptional
performance on tasks that demand intricate multi-step reasoning. Yet, these models …

บันทึก อ้างอิง อ้างโดย339 บทความที่เกี่ยวข้อง ทั้งหมด 8 ฉบับ ดูในรูปแบบ HTML

Hallucination is inevitable: An innate limitation of large language models

Z Xu, S Jain, M Kankanhalli - arxiv preprint arxiv:2401.11817, 2024 - arxiv.org

Hallucination has been widely recognized to be a significant drawback for large language
models (LLMs). There have been many works that attempt to reduce the extent of …

บันทึก อ้างอิง อ้างโดย329 บทความที่เกี่ยวข้อง ทั้งหมด 3 ฉบับ แคช

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Towards revealing the mystery behind chain of thought: a theoretical perspective

G Feng, B Zhang, Y Gu, H Ye, D He… - Advances in Neural …, 2023 - proceedings.neurips.cc

Recent studies have discovered that Chain-of-Thought prompting (CoT) can dramatically
improve the performance of Large Language Models (LLMs), particularly when dealing with …

บันทึก อ้างอิง อ้างโดย209 บทความที่เกี่ยวข้อง ทั้งหมด 6 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Holistic evaluation of language models

P Liang, R Bommasani, T Lee, D Tsipras… - arxiv preprint arxiv …, 2022 - arxiv.org

Language models (LMs) are becoming the foundation for almost all major language
technologies, but their capabilities, limitations, and risks are not well understood. We present …

บันทึก อ้างอิง อ้างโดย1222 บทความที่เกี่ยวข้อง ทั้งหมด 6 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Transformers as statisticians: Provable in-context learning with in-context algorithm selection

Y Bai, F Chen, H Wang, C **ong… - Advances in neural …, 2023 - proceedings.neurips.cc

Neural sequence models based on the transformer architecture have demonstrated
remarkable\emph {in-context learning}(ICL) abilities, where they can perform new tasks …

บันทึก อ้างอิง อ้างโดย207 บทความที่เกี่ยวข้อง ทั้งหมด 7 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

What can transformers learn in-context? a case study of simple function classes

S Garg, D Tsipras, PS Liang… - Advances in Neural …, 2022 - proceedings.neurips.cc

In-context learning is the ability of a model to condition on a prompt sequence consisting of
in-context examples (input-output pairs corresponding to some task) along with a new query …

บันทึก อ้างอิง อ้างโดย445 บทความที่เกี่ยวข้อง ทั้งหมด 9 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Beyond the imitation game: Quantifying and extrapolating the capabilities of language models

A Srivastava, A Rastogi, A Rao, AAM Shoeb… - arxiv preprint arxiv …, 2022 - arxiv.org

Language models demonstrate both quantitative improvement and new qualitative
capabilities with increasing scale. Despite their potentially transformative impact, these new …

บันทึก อ้างอิง อ้างโดย1363 บทความที่เกี่ยวข้อง ทั้งหมด 15 ฉบับ ดูในรูปแบบ HTML

สร้างการแจ้งเตือน

อ้างอิง

การค้นหาขั้นสูง

บันทึกไปยังคลังของฉันแล้ว

Theoretical limitations of self-attention in neural sequence models

Transformer-based deep learning for predicting protein properties in the life sciences

[PDF][PDF] Language model behavior: A comprehensive survey

A survey on hallucination in large language models: Principles, taxonomy, challenges, and open questions

Faith and fate: Limits of transformers on compositionality

Hallucination is inevitable: An innate limitation of large language models

Towards revealing the mystery behind chain of thought: a theoretical perspective

Holistic evaluation of language models

Transformers as statisticians: Provable in-context learning with in-context algorithm selection

What can transformers learn in-context? a case study of simple function classes

Beyond the imitation game: Quantifying and extrapolating the capabilities of language models