Segui
Ryan Carey
Ryan Carey
Email verificata su philosophy.ox.ac.uk - Home page
Titolo
Citata da
Citata da
Anno
Agent Incentives: A Causal Perspective
T Everitt, R Carey, E Langlois, PA Ortega, S Legg
AAAI, 2021
592021
Path-Specific Objectives for Safer Agent Incentives
S Farquhar, R Carey, T Everitt
AAAI, 2022
292022
Incorrigibility in the CIRL Framework
R Carey
Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society, 30-35, 2018
28*2018
Why Fair Labels Can Yield Unfair Predictions: Graphical Conditions for Introduced Unfairness
C Ashurst, R Carey, S Chiappa, T Everitt
AAAI, 2022
202022
Reasoning about Causality in Games
L Hammond, J Fox, T Everitt, R Carey, A Abate, M Wooldridge
AI Journal, 2023
192023
Human Control: Definitions and Algorithms
R Carey, T Everitt
UAI, 2023
172023
Predicting human deliberative judgments with machine learning
O Evans, A Stuhlmüller, C Cundy, R Carey, Z Kenton, T McGrath, ...
Technical report, University of Oxford, 2018
162018
The Incentives that Shape Behaviour
R Carey, E Langlois, T Everitt, S Legg
Safe AI AAAI Workshop, 2020
142020
Interpreting AI Compute Trends
R Carey
AI Impacts Blog, 2018
102018
The Effective Altruism Handbook
R Carey
The Centre for Effective Altruism, 2015
82015
PyCID: A Python Library for Causal Influence Diagrams
J Fox, T Everitt, R Carey, E Langlois, A Abate, M Wooldridge
SciPy, 2021
72021
A Complete Criterion for Value of Information in Soluble Influence Diagrams
C van Merwijk, R Carey, T Everitt
AAAI, 2022
62022
How useful is quantilization for mitigating specification-gaming?
R Carey
SafeML Workshop at International Conference on Learning Representations, 2019
32019
Reasoning about causality in games (abstract reprint)
L Hammond, J Fox, T Everitt, R Carey, A Abate, M Wooldridge
Proceedings of the AAAI Conference on Artificial Intelligence 38 (20), 22697 …, 2024
12024
Toward a complete criterion for value of information in insoluble decision problems
S Lee, R Carey, R Evans
Transactions of Machine Learning Research, 2024
2024
(When) Is Truth-telling Favored in AI Debate?
V Kovařík, R Carey
SafeAI AAAI Workshop, 2019
2019
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–16