Explainable generative ai (genxai): A survey, conceptualization, and research agenda

J Schneider - Artificial Intelligence Review, 2024 - Springer
Generative AI (GenAI) represents a shift from AI's ability to “recognize” to its ability to
“generate” solutions for a wide range of tasks. As generated solutions and applications grow …

Explainability for large language models: A survey

H Zhao, H Chen, F Yang, N Liu, H Deng, H Cai… - ACM Transactions on …, 2024 - dl.acm.org
Large language models (LLMs) have demonstrated impressive capabilities in natural
language processing. However, their internal mechanisms are still unclear and this lack of …

Rethinking interpretability in the era of large language models

C Singh, JP Inala, M Galley, R Caruana… - arxiv preprint arxiv …, 2024 - arxiv.org
Interpretable machine learning has exploded as an area of interest over the last decade,
sparked by the rise of increasingly large datasets and deep neural networks …

Large scale foundation models for intelligent manufacturing applications: a survey

H Zhang, SD Semujju, Z Wang, X Lv, K Xu… - Journal of Intelligent …, 2025 - Springer
Although the applications of artificial intelligence especially deep learning have greatly
improved various aspects of intelligent manufacturing, they still face challenges for broader …

Rigorously assessing natural language explanations of neurons

J Huang, A Geiger, K D'Oosterlinck, Z Wu… - arxiv preprint arxiv …, 2023 - arxiv.org
Natural language is an appealing medium for explaining how large language models
process and store information, but evaluating the faithfulness of such explanations is …

Xai meets llms: A survey of the relation between explainable ai and large language models

E Cambria, L Malandri, F Mercorio, N Nobani… - arxiv preprint arxiv …, 2024 - arxiv.org
In this survey, we address the key challenges in Large Language Models (LLM) research,
focusing on the importance of interpretability. Driven by increasing interest from AI and …

Self-verification improves few-shot clinical information extraction

Z Gero, C Singh, H Cheng, T Naumann… - arxiv preprint arxiv …, 2023 - arxiv.org
Extracting patient information from unstructured text is a critical task in health decision-
support and clinical research. Large language models (LLMs) have shown the potential to …

An interdisciplinary outlook on large language models for scientific research

J Boyko, J Cohen, N Fox, MH Veiga, JI Li, J Liu… - arxiv preprint arxiv …, 2023 - arxiv.org
In this paper, we describe the capabilities and constraints of Large Language Models
(LLMs) within disparate academic disciplines, aiming to delineate their strengths and …

Tell your model where to attend: Post-hoc attention steering for llms

Q Zhang, C Singh, L Liu, X Liu, B Yu, J Gao… - arxiv preprint arxiv …, 2023 - arxiv.org
In human-written articles, we often leverage the subtleties of text style, such as bold and
italics, to guide the attention of readers. These textual emphases are vital for the readers to …

From language modeling to instruction following: Understanding the behavior shift in llms after instruction tuning

X Wu, W Yao, J Chen, X Pan, X Wang, N Liu… - arxiv preprint arxiv …, 2023 - arxiv.org
Large Language Models (LLMs) have achieved remarkable success, demonstrating
powerful instruction-following capabilities across diverse tasks. Instruction fine-tuning is …