الباحث العلمي من Google

S Ghosh, CKR Evuru, S Kumar, U Tyagi… - The Thirteenth …, 2024‏ - openreview.net‏

Large Vision-Language Models (LVLMs) often produce responses that misalign with factual
information, a phenomenon known as hallucinations. While hallucinations are well-studied …‏

حفظ اقتباس تم اقتباسها في عدد: 2 مقالات ذات صلة الإصدارات الـ 2كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Monotonic paraphrasing improves generalization of language model prompting‏

Q Liu, F Wang, N Xu, T Yan, T Meng… - arxiv preprint arxiv …, 2024‏ - arxiv.org‏

Performance of large language models (LLMs) may vary with different prompts or
instructions of even the same task. One commonly recognized factor for this phenomenon is …‏

حفظ اقتباس تم اقتباسها في عدد: 5 مقالات ذات صلة الإصدارات الـ 2كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Improving open-ended text generation via adaptive decoding‏

W Zhu, H Hao, Z He, Y Ai, R Wang - arxiv preprint arxiv:2402.18223, 2024‏ - arxiv.org‏

Current language models decode text token by token according to probabilistic distribution,
and determining the appropriate candidates for the next token is crucial to ensure …‏

حفظ اقتباس تم اقتباسها في عدد: 8 مقالات ذات صلة الإصدارات الـ 3كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] ai-plans.com

[PDF][PDF] Future Token Prediction--Causal Language Modelling with Per-Token Semantic State Vector for Multi-Token Prediction‏

N Walker - arxiv preprint arxiv:2410.18160, 2024‏ - test.ai-plans.com‏

Causal decoder-only transformer models used for generative language modelling, such as
Generative Pre-trained Transformers (GPT), are trained to predict the next token in a …‏

حفظ اقتباس تم اقتباسها في عدد: 2 مقالات ذات صلة الإصدارات الـ 3كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

" We Need Structured Output": Towards User-centered Constraints on Large Language Model Output‏

MX Liu, F Liu, AJ Fiannaca, T Koo, L Dixon… - Extended Abstracts of …, 2024‏ - dl.acm.org‏

Large language models can produce creative and diverse responses. However, to integrate
them into current developer workflows, it is essential to constrain their outputs to follow …‏

حفظ اقتباس تم اقتباسها في عدد: 22 مقالات ذات صلة الإصدارات الـ 5كلها

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

VDGD: Mitigating LVLM Hallucinations in Cognitive Prompts by Bridging the Visual Perception Gap‏

S Ghosh, CKR Evuru, S Kumar, U Tyagi… - arxiv preprint arxiv …, 2024‏ - arxiv.org‏

Recent interest in Large Vision-Language Models (LVLMs) for practical applications is
moderated by the significant challenge of hallucination or the inconsistency between the …‏

حفظ اقتباس تم اقتباسها في عدد: 6 مقالات ذات صلة الإصدارات الـ 2كلها إصدار HTML‏

إنشاء تنبيه

اقتباس

بحث متقدم

تم حفظ المقالة في مكتبتي.

Look-back decoding for open-ended text generation

Visual description grounding reduces hallucinations and boosts reasoning in lvlms‏

Monotonic paraphrasing improves generalization of language model prompting‏

Improving open-ended text generation via adaptive decoding‏

[PDF][PDF] Future Token Prediction--Causal Language Modelling with Per-Token Semantic State Vector for Multi-Token Prediction‏

" We Need Structured Output": Towards User-centered Constraints on Large Language Model Output‏

VDGD: Mitigating LVLM Hallucinations in Cognitive Prompts by Bridging the Visual Perception Gap‏