Seguir
Edoardo Debenedetti
Edoardo Debenedetti
PhD student @ ETH Zürich
Dirección de correo verificada de inf.ethz.ch - Página principal
Título
Citado por
Citado por
Año
Robustbench: a standardized adversarial robustness benchmark
F Croce*, M Andriushchenko*, V Sehwag*, E Debenedetti*, N Flammarion, ...
NeurIPS 2021 Datasets and Benchmark Track, 2021
7832021
Jailbreakbench: An open robustness benchmark for jailbreaking large language models
P Chao*, E Debenedetti*, A Robey*, M Andriushchenko*, F Croce, ...
NeurIPS 2024 Datasets and Benchmark Track, 2024
972024
A light recipe to train robust vision transformers
E Debenedetti, V Sehwag, P Mittal
IEEE SaTML 2023, 225-253, 2023
662023
Privacy side channels in machine learning systems
E Debenedetti, G Severi, N Carlini, CA Choquette-Choo, M Jagielski, ...
33rd USENIX Security Symposium (USENIX Security 24), 6861-6848, 2024
322024
AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents
E Debenedetti, J Zhang, M Balunovic, L Beurer-Kellner, M Fischer, ...
NeurIPS 2024 Datasets and Benchmark Track, 2024
19*2024
AI Risk Management Should Incorporate Both Safety and Security
X Qi, Y Huang, Y Zeng, E Debenedetti, J Geiping, L He, K Huang, ...
arXiv preprint arXiv:2405.19524, 2024
132024
Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition
E Debenedetti*, J Rando*, D Paleka*, SF Florin, D Albastroiu, N Cohen, ...
NeurIPS 2024 Datasets and Benchmark Track (Spotlight), 2024
92024
Evading black-box classifiers without breaking eggs
E Debenedetti, N Carlini, F Tramèr
IEEE SaTML 2024 (Distinguished Paper Award Runner-up), 408-424, 2024
82024
Scaling compute is not all you need for adversarial robustness
E Debenedetti, Z Wan, M Andriushchenko, V Sehwag, K Bhardwaj, ...
ICLR 2024 Workshop on Reliable and Responsible Foundation Models, 2023
82023
Adversarial search engine optimization for large language models
F Nestaas, E Debenedetti, F Tramèr
ICLR 2025, 2024
52024
Exploring Memorization and Copyright Violation in Frontier LLMs: A Study of the New York Times v. OpenAI 2023 Lawsuit
J Freeman, C Rippe, E Debenedetti, M Andriushchenko
NeurIPS 2024 Safe Generative AI Workshop, 0
1*
Measuring Non-Adversarial Reproduction of Training Data in Large Language Models
M Aerni, J Rando, E Debenedetti, N Carlini, D Ippolito, F Tramèr
ICLR 2025, 2024
2024
AutoAdvExBench: Benchmarking Autonomous Exploitation of Adversarial Example Defenses
N Carlini, E Debenedetti, J Rando, M Nasr, F Tramèr
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–13