Survey of hallucination in natural language generation

Z Ji, N Lee, R Frieske, T Yu, D Su, Y Xu, E Ishii… - ACM computing …, 2023 - dl.acm.org
Natural Language Generation (NLG) has improved exponentially in recent years thanks to
the development of sequence-to-sequence deep learning technologies such as Transformer …

Towards mitigating hallucination in large language models via self-reflection

Z Ji, T Yu, Y Xu, N Lee, E Ishii, P Fung - arxiv preprint arxiv:2310.06271, 2023 - arxiv.org
Large language models (LLMs) have shown promise for generative and knowledge-
intensive tasks including question-answering (QA) tasks. However, the practical deployment …

Exploring and evaluating hallucinations in llm-powered code generation

F Liu, Y Liu, L Shi, H Huang, R Wang, Z Yang… - arxiv preprint arxiv …, 2024 - arxiv.org
The rise of Large Language Models (LLMs) has significantly advanced many applications
on software engineering tasks, particularly in code generation. Despite the promising …

FaithDial: A Faithful Benchmark for Information-Seeking Dialogue

N Dziri, E Kamalloo, S Milton, O Zaiane… - Transactions of the …, 2022 - direct.mit.edu
The goal of information-seeking dialogue is to respond to seeker queries with natural
language utterances that are grounded on knowledge sources. However, dialogue systems …

Elastic weight removal for faithful and abstractive dialogue generation

N Daheim, N Dziri, M Sachan, I Gurevych… - arxiv preprint arxiv …, 2023 - arxiv.org
Ideally, dialogue systems should generate responses that are faithful to the knowledge
contained in relevant documents. However, many models generate hallucinated responses …

Token-budget-aware llm reasoning

T Han, C Fang, S Zhao, S Ma, Z Chen… - arxiv preprint arxiv …, 2024 - arxiv.org
Reasoning is critical for large language models (LLMs) to excel in a wide range of tasks.
While methods like Chain-of-Thought (CoT) reasoning enhance LLM performance by …

Concise thoughts: Impact of output length on llm reasoning and cost

S Nayab, G Rossolini, M Simoni, A Saracino… - arxiv preprint arxiv …, 2024 - arxiv.org
Today's large language models (LLMs) can solve challenging question-answering tasks,
and prompt engineering techniques, such as chain-of-thought (CoT), have gained attention …

Generative models for product attribute extraction

A Blume, N Zalmout, H Ji, X Li - Proceedings of the 2023 …, 2023 - aclanthology.org
Product attribute extraction is an emerging field in information extraction and e-commerce,
with applications including knowledge base construction, product recommendation, and …

Fidelity-enriched contrastive search: reconciling the faithfulness-diversity trade-off in text generation

WL Chen, CK Wu, HH Chen, CC Chen - arxiv preprint arxiv:2310.14981, 2023 - arxiv.org
In this paper, we address the hallucination problem commonly found in natural language
generation tasks. Language models often generate fluent and convincing content but can …

[PDF][PDF] Generative AI Decision-Making Attributes in Complex Health Services: A Rapid Review

N Doreswamy, L Horstmanshof - Cureus, 2025 - cureus.com
Abstract The advent of Generative Artificial Intelligence (Generative AI or GAI) marks a
significant inflection point in AI development. Long viewed as the epitome of reasoning and …