Bartscore: Evaluating generated text as text generation
A wide variety of NLP applications, such as machine translation, summarization, and dialog,
involve text generation. One major challenge for these applications is how to evaluate …
involve text generation. One major challenge for these applications is how to evaluate …
Progressive-hint prompting improves reasoning in large language models
The performance of Large Language Models (LLMs) in reasoning tasks depends heavily on
prompt design, with Chain-of-Thought (CoT) and self-consistency being critical methods that …
prompt design, with Chain-of-Thought (CoT) and self-consistency being critical methods that …
Self-rag: Learning to retrieve, generate, and critique through self-reflection
Despite their remarkable capabilities, large language models (LLMs) often produce
responses containing factual inaccuracies due to their sole reliance on the parametric …
responses containing factual inaccuracies due to their sole reliance on the parametric …
Retrieval augmentation reduces hallucination in conversation
Despite showing increasingly human-like conversational abilities, state-of-the-art dialogue
models often suffer from factual incorrectness and hallucination of knowledge (Roller et al …
models often suffer from factual incorrectness and hallucination of knowledge (Roller et al …
Dense text retrieval based on pretrained language models: A survey
Text retrieval is a long-standing research topic on information seeking, where a system is
required to return relevant information resources to user's queries in natural language. From …
required to return relevant information resources to user's queries in natural language. From …
RocketQA: An optimized training approach to dense passage retrieval for open-domain question answering
In open-domain question answering, dense passage retrieval has become a new paradigm
to retrieve relevant passages for finding answers. Typically, the dual-encoder architecture is …
to retrieve relevant passages for finding answers. Typically, the dual-encoder architecture is …