Grounding and evaluation for large language models: Practical challenges and lessons learned (survey)

K Kenthapadi, M Sameki, A Taly - Proceedings of the 30th ACM SIGKDD …, 2024 - dl.acm.org
With the ongoing rapid adoption of Artificial Intelligence (AI)-based systems in high-stakes
domains, ensuring the trustworthiness, safety, and observability of these systems has …

Infecting Generative AI With Viruses

D Noever, F McKee - arxiv preprint arxiv:2501.05542, 2025 - arxiv.org
This study demonstrates a novel approach to testing the security boundaries of Vision-Large
Language Model (VLM/LLM) using the EICAR test file embedded within JPEG images. We …

Measurement challenges in AI catastrophic risk governance and safety frameworks

A Kasirzadeh - arxiv preprint arxiv:2410.00608, 2024 - arxiv.org
Safety frameworks represent a significant development in AI governance: they are the first
type of publicly shared catastrophic risk management framework developed by major AI …