A systematic survey and critical review on evaluating large language models: Challenges, limitations, and recommendations
Abstract Large Language Models (LLMs) have recently gained significant attention due to
their remarkable capabilities in performing diverse tasks across various domains. However …
their remarkable capabilities in performing diverse tasks across various domains. However …
Natural Language Generation for Visualizations: State of the Art, Challenges and Future Directions
Natural language and visualization are two complementary modalities of human
communication that play a crucial role in conveying information effectively. While …
communication that play a crucial role in conveying information effectively. While …
Alleviating hallucinations of large language models through induced hallucinations
Y Zhang, L Cui, W Bi, S Shi - ar**_a_Framework_for_Auditing_Large_Language_Models_Using_Human-in-the-Loop/links/65cdc8b6790074549791de40/Develo**-a-Framework-for-Auditing-Large-Language-Models-Using-Human-in-the-Loop.pdf" data-clk="hl=en&sa=T&oi=gga&ct=gga&cd=5&d=10236333257919685027&ei=0VKxZ9DXA4C96rQP29mI6AY" data-clk-atid="o-HRGN3CDo4J" target="_blank">[PDF] researchgate.net
[PDF][PDF] Develo** a framework for auditing large language models using human-in-the-loop
* Work does not relate to position at Amazon. Authors' addresses: Maryam Amirizaniani,
amaryam@ uw. edu, University of Washington, Seattle, WA, USA; Jihan Yao, jihany2@ uw …
amaryam@ uw. edu, University of Washington, Seattle, WA, USA; Jihan Yao, jihany2@ uw …
GenAudit: Fixing Factual Errors in Language Model Outputs with Evidence
LLMs can generate factually incorrect statements even when provided access to reference
documents. Such errors can be dangerous in high-stakes applications (eg, document …
documents. Such errors can be dangerous in high-stakes applications (eg, document …
ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language Models
With the rise of multimodal applications, instruction data has become critical for training
multimodal language models capable of understanding complex image-based queries …
multimodal language models capable of understanding complex image-based queries …
An Audit on the Perspectives and Challenges of Hallucinations in NLP
We audit how hallucination in large language models (LLMs) is characterized in peer-
reviewed literature, using a critical examination of 103 publications across NLP research …
reviewed literature, using a critical examination of 103 publications across NLP research …
A Comparative Analysis of Text-Based Explainable Recommender Systems
One way to increase trust among users towards recommender systems is to provide the
recommendation along with a textual explanation. In the literature, extraction-based …
recommendation along with a textual explanation. In the literature, extraction-based …