Theo dõi
Edoardo Debenedetti
Edoardo Debenedetti
PhD student @ ETH Zürich
Email được xác minh tại inf.ethz.ch - Trang chủ
Tiêu đề
Trích dẫn bởi
Trích dẫn bởi
Năm
Robustbench: a standardized adversarial robustness benchmark
F Croce*, M Andriushchenko*, V Sehwag*, E Debenedetti*, N Flammarion, ...
NeurIPS 2021 Datasets and Benchmark Track, 2021
7942021
Jailbreakbench: An open robustness benchmark for jailbreaking large language models
P Chao*, E Debenedetti*, A Robey*, M Andriushchenko*, F Croce, ...
NeurIPS 2024 Datasets and Benchmark Track, 2024
1152024
A light recipe to train robust vision transformers
E Debenedetti, V Sehwag, P Mittal
IEEE SaTML 2023, 225-253, 2023
672023
Privacy side channels in machine learning systems
E Debenedetti, G Severi, N Carlini, CA Choquette-Choo, M Jagielski, ...
33rd USENIX Security Symposium (USENIX Security 24), 6861-6848, 2024
362024
AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents
E Debenedetti, J Zhang, M Balunovic, L Beurer-Kellner, M Fischer, ...
NeurIPS 2024 Datasets and Benchmark Track, 2024
24*2024
Ai risk management should incorporate both safety and security
X Qi, Y Huang, Y Zeng, E Debenedetti, J Geiping, L He, K Huang, ...
arXiv preprint arXiv:2405.19524, 2024
152024
Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition
E Debenedetti*, J Rando*, D Paleka*, SF Florin, D Albastroiu, N Cohen, ...
NeurIPS 2024 Datasets and Benchmark Track (Spotlight), 2024
122024
Evading black-box classifiers without breaking eggs
E Debenedetti, N Carlini, F Tramèr
IEEE SaTML 2024 (Distinguished Paper Award Runner-up), 408-424, 2024
82024
Scaling compute is not all you need for adversarial robustness
E Debenedetti, Z Wan, M Andriushchenko, V Sehwag, K Bhardwaj, ...
ICLR 2024 Workshop on Reliable and Responsible Foundation Models, 2023
82023
Adversarial search engine optimization for large language models
F Nestaas, E Debenedetti, F Tramèr
ICLR 2025, 2024
62024
Exploring Memorization and Copyright Violation in Frontier LLMs: A Study of the New York Times v. OpenAI 2023 Lawsuit
J Freeman, C Rippe, E Debenedetti, M Andriushchenko
NeurIPS 2024 Safe Generative AI Workshop, 2024
12024
Measuring Non-Adversarial Reproduction of Training Data in Large Language Models
M Aerni, J Rando, E Debenedetti, N Carlini, D Ippolito, F Tramèr
ICLR 2025, 2024
2024
AutoAdvExBench: Benchmarking Autonomous Exploitation of Adversarial Example Defenses
N Carlini, E Debenedetti, J Rando, M Nasr, F Tramèr
Hệ thống không thể thực hiện thao tác ngay bây giờ. Hãy thử lại sau.
Bài viết 1–13