Who validates the validators? aligning llm-assisted evaluation of llm outputs with human preferences
Due to the cumbersome nature of human evaluation and limitations of code-based
evaluation, Large Language Models (LLMs) are increasingly being used to assist humans in …
evaluation, Large Language Models (LLMs) are increasingly being used to assist humans in …
Artificial intelligence co-piloted auditing
This paper proposes the concept of artificial intelligence co-piloted auditing, emphasizing
the collaborative potential of auditors and foundation models in the auditing domain. The …
the collaborative potential of auditors and foundation models in the auditing domain. The …
How do data analysts respond to ai assistance? a wizard-of-oz study
Data analysis is challenging as analysts must navigate nuanced decisions that may yield
divergent conclusions. AI assistants have the potential to support analysts in planning their …
divergent conclusions. AI assistants have the potential to support analysts in planning their …
Improving steering and verification in AI-assisted data analysis with interactive task decomposition
LLM-powered tools like ChatGPT Data Analysis, have the potential to help users tackle the
challenging task of data analysis programming, which requires expertise in data processing …
challenging task of data analysis programming, which requires expertise in data processing …
Beyond the chat: Executable and verifiable text-editing with llms
Conversational interfaces powered by Large Language Models (LLMs) have recently
become a popular way to obtain feedback during document editing. However, standard chat …
become a popular way to obtain feedback during document editing. However, standard chat …
A framework for exploring the consequences of ai-mediated enterprise knowledge access and identifying risks to workers
Organisations generate vast amounts of information, which has resulted in a long-term
research effort into knowledge access systems for enterprise settings. Recent developments …
research effort into knowledge access systems for enterprise settings. Recent developments …
Selenite: Scaffolding Online Sensemaking with Comprehensive Overviews Elicited from Large Language Models
Sensemaking in unfamiliar domains can be challenging, demanding considerable user
effort to compare different options with respect to various criteria. Prior research and our …
effort to compare different options with respect to various criteria. Prior research and our …
Large Language Models Cannot Explain Themselves
A Sarkar - arxiv preprint arxiv:2405.04382, 2024 - arxiv.org
Large language models can be prompted to produce text. They can also be prompted to
produce" explanations" of their output. But these are not really explanations, because they …
produce" explanations" of their output. But these are not really explanations, because they …
Data Analysis in the Era of Generative AI
This paper explores the potential of AI-powered tools to reshape data analysis, focusing on
design considerations and challenges. We explore how the emergence of large language …
design considerations and challenges. We explore how the emergence of large language …