Segui
Lewis Hammond
Titolo
Citata da
Citata da
Anno
Foundational Challenges in Assuring Alignment and Safety of Large Language Models
U Anwar, A Saparov, J Rando, D Paleka, M Turpin, P Hase, ES Lubana, ...
Transactions on Machine Learning Research, 2024
136*2024
Multi-Agent Reinforcement Learning with Temporal Logic Specifications
L Hammond, A Abate, J Gutierrez, M Wooldridge
International Conference on Autonomous Agents and Multi-Agent Systems, 583-592, 2021
522021
Rational Verification: Game-Theoretic Verification of Multi-Agent Systems
A Abate, J Gutierrez, L Hammond, P Harrenstein, M Kwiatkowska, M Najib, ...
Applied Intelligence 51 (9), 6569-6584, 2021
332021
Open Problems in Technical AI Governance
A Reuel, B Bucknall, S Casper, T Fist, L Soder, O Aarne, L Hammond, ...
arXiv preprint arXiv:2407.14981, 2024
27*2024
Lexicographic Multi-Objective Reinforcement Learning
J Skalse, L Hammond, C Griffin, A Abate
International Joint Conference on Artificial Intelligence, 3430-3436, 2022
272022
Welfare Diplomacy: Benchmarking Language Model Cooperation
G Mukobi, H Erlebach, N Lauffer, L Hammond, A Chan, J Clifton
Socially Responsible Language Modelling Research Workshop at NeurIPS, 2023
232023
Visibility into AI Agents
A Chan, C Ezell, M Kaufmann, K Wei, L Hammond, H Bradley, E Bluemke, ...
ACM Conference on Fairness, Accountability, and Transparency, 958-973, 2024
222024
Reasoning about Causality in Games
L Hammond, J Fox, T Everitt, R Carey, A Abate, M Wooldridge
Artificial Intelligence 320, 103919, 2023
202023
Rational Verification for Probabilistic Systems
J Gutierrez, L Hammond, AW Lin, M Najib, M Wooldridge
International Conference on Principles of Knowledge Representation and …, 2021
152021
Equilibrium Refinements for Multi-Agent Influence Diagrams: Theory and Practice
L Hammond, J Fox, T Everitt, A Abate, M Wooldridge
International Conference on Autonomous Agents and Multi-Agent Systems, 574-582, 2021
132021
Learning Tractable Probabilistic Models for Moral Responsibility and Blame
L Hammond, V Belle
Data Mining and Knowledge Discovery 35 (2), 621–659, 2021
12*2021
Secret Collusion among AI Agents: Multi-Agent Deception via Steganography
SR Motwani, M Baranchuk, M Strohmeier, V Bolina, P Torr, L Hammond, ...
Neural Information Processing Systems, 2024
10*2024
Bounded Robustness in Reinforcement Learning via Lexicographic Objectives
DJ Ornia, L Romao, L Hammond, M Mazo Jr, A Abate
Learning for Dynamics & Control Conference, 954-967, 2024
5*2024
IDs for AI Systems
A Chan, N Kolt, P Wills, U Anwar, CS de Witt, N Rajkumar, L Hammond, ...
Regulatable ML Workshop at NeurIPS, 2024
42024
On Imperfect Recall in Multi-Agent Influence Diagrams
J Fox, M MacDermott, L Hammond, P Harrenstein, A Abate, M Wooldridge
Conference on Theoretical Aspects of Rationality and Knowledge, 201–22, 2023
42023
Game Theory with Simulation in the Presence of Unpredictable Randomisation
V Kovarik, N Sauerberg, L Hammond, V Conitzer
arXiv preprint arXiv:2410.14311, 2024
22024
All’s Well That Ends Well: Avoiding Side Effects with Distance-Impact Penalties
C Griffin, J Skalse, L Hammond, A Abate
ML Safety Workshop at NeurIPS, 2022
22022
Cooperation and Control in Delegation Games
O Sourbut, L Hammond, H Wood
International Joint Conference on Artificial Intelligence, 229-237, 2024
12024
Neural Interactive Proofs
L Hammond, S Adam-Day
arXiv preprint arXiv:2412.08897, 2024
2024
Melting Pot Contest: Charting the Future of Generalized Cooperative Intelligence
R Trivedi, A Khan, J Clifton, L Hammond, EA Duéñez-Guzmán, ...
Neural Information Processing Systems (Datasets and Benchmarks Track), 2024
2024
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–20