Ryan Carey

Citata da

	Tutte	Dal 2020
Citazioni	237	219
Indice H	9	8
i10-index	9	8

2017201820192020202120222023202420254 4 9 21 24 26 51 86 11

Accesso pubblico

Visualizza tutto

4 articoli

0 articoli

Disponibili

Non disponibili

In base ai mandati di finanziamento

Coautori

Tom EverittStaff Research Scientist at Google DeepMindEmail verificata su google.com

Segui

Ryan Carey

University of Oxford

Email verificata su philosophy.ox.ac.uk - Home page

AI Safety Causality Incentives


Titolo Ordina per citazioni Ordina per anno Ordina per titolo	Citata da Citata da	Anno
Agent Incentives: A Causal Perspective T Everitt, R Carey, E Langlois, PA Ortega, S Legg AAAI, 2021	59	2021
Path-Specific Objectives for Safer Agent Incentives S Farquhar, R Carey, T Everitt AAAI, 2022	29	2022
Incorrigibility in the CIRL Framework R Carey Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society, 30-35, 2018	28*	2018
Why Fair Labels Can Yield Unfair Predictions: Graphical Conditions for Introduced Unfairness C Ashurst, R Carey, S Chiappa, T Everitt AAAI, 2022	20	2022
Reasoning about Causality in Games L Hammond, J Fox, T Everitt, R Carey, A Abate, M Wooldridge AI Journal, 2023	19	2023
Human Control: Definitions and Algorithms R Carey, T Everitt UAI, 2023	17	2023
Predicting human deliberative judgments with machine learning O Evans, A Stuhlmüller, C Cundy, R Carey, Z Kenton, T McGrath, ... Technical report, University of Oxford, 2018	16	2018
The Incentives that Shape Behaviour R Carey, E Langlois, T Everitt, S Legg Safe AI AAAI Workshop, 2020	14	2020
Interpreting AI Compute Trends R Carey AI Impacts Blog, 2018	10	2018
The Effective Altruism Handbook R Carey The Centre for Effective Altruism, 2015	8	2015
PyCID: A Python Library for Causal Influence Diagrams J Fox, T Everitt, R Carey, E Langlois, A Abate, M Wooldridge SciPy, 2021	7	2021
A Complete Criterion for Value of Information in Soluble Influence Diagrams C van Merwijk, R Carey, T Everitt AAAI, 2022	6	2022
How useful is quantilization for mitigating specification-gaming? R Carey SafeML Workshop at International Conference on Learning Representations, 2019	3	2019
Reasoning about causality in games (abstract reprint) L Hammond, J Fox, T Everitt, R Carey, A Abate, M Wooldridge Proceedings of the AAAI Conference on Artificial Intelligence 38 (20), 22697 …, 2024	1	2024
Toward a complete criterion for value of information in insoluble decision problems S Lee, R Carey, R Evans Transactions of Machine Learning Research, 2024		2024
(When) Is Truth-telling Favored in AI Debate? V Kovařík, R Carey SafeAI AAAI Workshop, 2019		2019

Il sistema al momento non può eseguire l'operazione. Riprova più tardi.

Articoli 1–16

Citazioni per anno

Citazioni duplicate

Citazioni unite

Aggiungi coautoriCoautori

Segui

Citata da

Coautori