Survey on factuality in large language models: Knowledge, retrieval and domain-specificity
This survey addresses the crucial issue of factuality in Large Language Models (LLMs). As
LLMs find applications across diverse domains, the reliability and accuracy of their outputs …
LLMs find applications across diverse domains, the reliability and accuracy of their outputs …
Combating misinformation in the age of llms: Opportunities and challenges
Misinformation such as fake news and rumors is a serious threat for information ecosystems
and public trust. The emergence of large language models (LLMs) has great potential to …
and public trust. The emergence of large language models (LLMs) has great potential to …
Large language models for information retrieval: A survey
As a primary means of information acquisition, information retrieval (IR) systems, such as
search engines, have integrated themselves into our daily lives. These systems also serve …
search engines, have integrated themselves into our daily lives. These systems also serve …
Knowledge conflicts for llms: A survey
This survey provides an in-depth analysis of knowledge conflicts for large language models
(LLMs), highlighting the complex challenges they encounter when blending contextual and …
(LLMs), highlighting the complex challenges they encounter when blending contextual and …
Generate-then-ground in retrieval-augmented generation for multi-hop question answering
Multi-Hop Question Answering (MHQA) tasks present a significant challenge for large
language models (LLMs) due to the intensive knowledge required. Current solutions, like …
language models (LLMs) due to the intensive knowledge required. Current solutions, like …
Financemath: Knowledge-intensive math reasoning in finance domains
We introduce FinanceMath, a novel benchmark designed to evaluate LLMs' capabilities in
solving knowledge-intensive math reasoning problems. Compared to prior works, this study …
solving knowledge-intensive math reasoning problems. Compared to prior works, this study …
Halueval-wild: Evaluating hallucinations of language models in the wild
Hallucinations pose a significant challenge to the reliability of large language models
(LLMs) in critical domains. Recent benchmarks designed to assess LLM hallucinations …
(LLMs) in critical domains. Recent benchmarks designed to assess LLM hallucinations …
Beyond the turn-based game: Enabling real-time conversations with duplex models
As large language models (LLMs) increasingly permeate daily lives, there is a growing
demand for real-time interactions that mirror human conversations. Traditional turn-based …
demand for real-time interactions that mirror human conversations. Traditional turn-based …
Trustworthy, responsible, and safe ai: A comprehensive architectural framework for ai safety with challenges and mitigations
AI Safety is an emerging area of critical importance to the safe adoption and deployment of
AI systems. With the rapid proliferation of AI and especially with the recent advancement of …
AI systems. With the rapid proliferation of AI and especially with the recent advancement of …
Blinded by Generated Contexts: How Language Models Merge Generated and Retrieved Contexts When Knowledge Conflicts?
While auxiliary information has become a key to enhancing Large Language Models
(LLMs), relatively little is known about how LLMs merge these contexts, specifically contexts …
(LLMs), relatively little is known about how LLMs merge these contexts, specifically contexts …