Takip et
Iason Gabriel
Iason Gabriel
Senior Staff Research Scientist, Google DeepMind
google.com üzerinde doğrulanmış e-posta adresine sahip
Başlık
Alıntı yapanlar
Alıntı yapanlar
Yıl
Scaling language models: Methods, analysis & insights from training gopher
JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, F Song, J Aslanides, ...
arXiv preprint arXiv:2112.11446, 2021
11512021
Ethical and social risks of harm from language models
L Weidinger, J Mellor, M Rauh, C Griffin, J Uesato, PS Huang, M Cheng, ...
arXiv preprint arXiv:2112.04359, 2021
10722021
Artificial intelligence, values, and alignment
I Gabriel
Minds and machines 30 (3), 411-437, 2020
7952020
Taxonomy of risks posed by language models
L Weidinger, J Uesato, M Rauh, C Griffin, PS Huang, J Mellor, A Glaese, ...
Proceedings of the 2022 ACM Conference on Fairness, Accountability, and …, 2022
6162022
Improving alignment of dialogue agents via targeted human judgements
A Glaese, N McAleese, M Trębacz, J Aslanides, V Firoiu, T Ewalds, ...
arXiv preprint arXiv:2209.14375, 2022
4862022
Power to the people? Opportunities and challenges for participatory AI
A Birhane, W Isaac, V Prabhakaran, M Diaz, MC Elish, I Gabriel, ...
Proceedings of the 2nd ACM Conference on Equity and Access in Algorithms …, 2022
2612022
Alignment of language agents
Z Kenton, T Everitt, L Weidinger, I Gabriel, V Mikulik, G Irving
arXiv preprint arXiv:2103.14659, 2021
1672021
Effective altruism and its critics
I Gabriel
Journal of Applied Philosophy 34 (4), 457-473, 2017
1662017
Model evaluation for extreme risks
T Shevlane, S Farquhar, B Garfinkel, M Phuong, J Whittlestone, J Leung, ...
arXiv preprint arXiv:2305.15324, 2023
1572023
Sociotechnical safety evaluation of generative ai systems
L Weidinger, M Rauh, N Marchal, A Manzini, LA Hendricks, ...
arXiv preprint arXiv:2310.11986, 2023
1412023
In conversation with artificial intelligence: aligning language models with human values
A Kasirzadeh, I Gabriel
Philosophy & Technology 36 (2), 27, 2023
1332023
Toward a theory of justice for artificial intelligence
I Gabriel
Daedalus 151 (2), 218-231, 2022
902022
A human rights-based approach to responsible AI
V Prabhakaran, M Mitchell, T Gebru, I Gabriel
arXiv preprint arXiv:2210.02667, 2022
65*2022
The Challenge of Value Alignment
I Gabriel, V Ghazavi
The Oxford Handbook of Digital Ethics, 2022
64*2022
The ethics of advanced ai assistants
I Gabriel, A Manzini, G Keeling, LA Hendricks, V Rieser, H Iqbal, ...
arXiv preprint arXiv:2404.16244, 2024
522024
Characteristics of harmful text: Towards rigorous benchmarking of language models
M Rauh, J Mellor, J Uesato, PS Huang, J Welbl, L Weidinger, S Dathathri, ...
Advances in Neural Information Processing Systems 35, 24720-24739, 2022
482022
Using the Veil of Ignorance to align AI systems with principles of justice
L Weidinger, KR McKee, R Everett, S Huang, TO Zhu, MJ Chadwick, ...
Proceedings of the National Academy of Sciences 120 (18), e2213709120, 2023
362023
Beyond privacy trade-offs with structured transparency
A Trask, E Bluemke, T Collins, BGE Drexler, CG Cuervas-Mons, I Gabriel, ...
arXiv preprint arXiv:2012.08347, 2020
322020
STELA: a community-centred approach to norm elicitation for AI alignment
S Bergman, N Marchal, J Mellor, S Mohamed, I Gabriel, W Isaac
Scientific Reports 14 (1), 6616, 2024
232024
Permissible secrets
H Lazenby, I Gabriel
The Philosophical Quarterly 68 (271), 265-285, 2018
222018
Sistem, işlemi şu anda gerçekleştiremiyor. Daha sonra yeniden deneyin.
Makaleler 1–20