Markus Anderljung

צוטט על ידי

	הכל	מאז 2020
ציטוטים ביבליוגרפיים	1666	1658
H-index	17	17
i10-index	26	26

860

430

215

645

20202021202220232024202532 86 162 354 859 154

גישה ציבורית

הצג הכל

2 מאמרים

0 מאמרים

זמין

לא זמין

על סמך ייפוי כח מהמממנים

מחברים משותפים

Allan DafoePrincipal Scientist (Director), Google DeepMindכתובת אימייל מאומתת בדומיין google.com
Jonas SchuettSenior Research Fellow, Centre for the Governance of AI, Oxford, UKכתובת אימייל מאומתת בדומיין governance.ai
Jade LeungResearcher, Centre for the Governance of AI, Future of Humanity Institute, University of Oxfordכתובת אימייל מאומתת בדומיין governance.ai
Ben GarfinkelDirector, Centre for the Governance of AI; Research Fellow, University of Oxfordכתובת אימייל מאומתת בדומיין philosophy.ox.ac.uk
Shahar AvinUniversity of Cambridgeכתובת אימייל מאומתת בדומיין cam.ac.uk
Lennart HeimCentre for the Governance of AIכתובת אימייל מאומתת בדומיין governance.ai
Noemi DrekslerCentre for the Governance of AIכתובת אימייל מאומתת בדומיין governance.ai
Emma BluemkeUniversity of Oxfordכתובת אימייל מאומתת בדומיין dtc.ox.ac.uk
Divya SiddarthMicrosoft Researchכתובת אימייל מאומתת בדומיין microsoft.com
Baobao ZhangSyracuse Universityכתובת אימייל מאומתת בדומיין syr.edu
Michael C. HorowitzProfessor of Political Science, University of Pennsylvaniaכתובת אימייל מאומתת בדומיין sas.upenn.edu
Robert TragerUniversity of Oxfordכתובת אימייל מאומתת בדומיין bsg.ox.ac.uk
Jess WhittlestoneSenior Research Associate, Centre for the Study of Existential Risk, University of Cambridgeכתובת אימייל מאומתת בדומיין cam.ac.uk
Haydn BelfieldUniversity of Cambridge, Centre for the Study of Existential Riskכתובת אימייל מאומתת בדומיין cam.ac.uk
Toby ShevlaneGoogle DeepMindכתובת אימייל מאומתת בדומיין google.com
Anton KorinekUniversity of Virginia, Brookings, NBER, and CEPRכתובת אימייל מאומתת בדומיין virginia.edu
Sara HookerHead of Cohere For AIכתובת אימייל מאומתת בדומיין cohere.com
Carina PrunklEthics Institute, Utrecht Universityכתובת אימייל מאומתת בדומיין uu.nl
Jan LeikeOpenAIכתובת אימייל מאומתת בדומיין openai.com
Remco ZwetslootExecutive Director, Horizon Institute for Public Serviceכתובת אימייל מאומתת בדומיין georgetown.edu

עקוב אחר

Markus Anderljung

Centre for the Governance of AI

כתובת אימייל מאומתת בדומיין governance.ai - דף הבית

AI governance AI policy AI forecasting


כותרת מיון לפי ציטוט ביבליוגרפי מיון לפי שנה מיון לפי כותרת	צוטט על ידי צוטט על ידי	שנה
Toward trustworthy AI development: mechanisms for supporting verifiable claims‏ M Brundage, S Avin, J Wang, H Belfield, G Krueger, G Hadfield, H Khlaaf, ...‏ arXiv preprint arXiv:2004.07213, 2020‏	466	2020
Model evaluation for extreme risks‏ T Shevlane, S Farquhar, B Garfinkel, M Phuong, J Whittlestone, J Leung, ...‏ arXiv preprint arXiv:2305.15324, 2023‏	173	2023
Frontier AI regulation: Managing emerging risks to public safety‏ M Anderljung, J Barnhart, A Korinek, J Leung, C O'Keefe, J Whittlestone, ...‏ arXiv preprint arXiv:2307.03718, 2023‏	148	2023
Foundational challenges in assuring alignment and safety of large language models‏ U Anwar, A Saparov, J Rando, D Paleka, M Turpin, P Hase, ES Lubana, ...‏ arXiv preprint arXiv:2404.09932, 2024‏	136	2024
Ethics and governance of artificial intelligence: Evidence from a survey of machine learning researchers‏ B Zhang, M Anderljung, L Kahn, N Dreksler, MC Horowitz, A Dafoe‏ Journal of Artificial Intelligence Research 71, 591–666-591–666, 2021‏	88	2021
The Brussels effect and artificial intelligence: How EU regulation will impact the global AI market‏ C Siegmann, M Anderljung‏ arXiv preprint arXiv:2208.12645, 2022‏	72	2022
Institutionalizing ethics in AI through broader impact requirements‏ CEA Prunkl, C Ashurst, M Anderljung, H Webb, J Leike, A Dafoe‏ Nature Machine Intelligence 3 (2), 104-110, 2021‏	72	2021
Filling gaps in trustworthy development of AI‏ NZ Shahar Avin, Haydn Belfield, Miles Brundage, Gretchen Krueger, Jasmine ...‏ Science 374 (6573), pp. 1327-1329, 2021‏	53	2021
Computing power and the governance of artificial intelligence‏ G Sastry, L Heim, H Belfield, M Anderljung, M Brundage, J Hazell, ...‏ arXiv preprint arXiv:2402.08797, 2024‏	50	2024
Towards best practices in AGI safety and governance: A survey of expert opinion‏ J Schuett, N Dreksler, M Anderljung, D McCaffary, L Heim, E Bluemke, ...‏ arXiv preprint arXiv:2305.07153, 2023‏	50	2023
Open-sourcing highly capable foundation models: An evaluation of risks, benefits, and alternative methods for pursuing open-source objectives‏ E Seger, N Dreksler, R Moulange, E Dardaman, J Schuett, K Wei, ...‏ arXiv preprint arXiv:2311.09227, 2023‏	45	2023
Protecting society from AI misuse: when are restrictions on capabilities warranted?‏ M Anderljung, J Hazell, M von Knebel‏ AI & SOCIETY, 1-17, 2024‏	38	2024
Forecasting AI progress: Evidence from a survey of machine learning researchers‏ B Zhang, N Dreksler, M Anderljung, L Kahn, C Giattino, A Dafoe, ...‏ arXiv preprint arXiv:2206.04132, 2022‏	32	2022
Open problems in technical ai governance‏ A Reuel, B Bucknall, S Casper, T Fist, L Soder, O Aarne, L Hammond, ...‏ arXiv preprint arXiv:2407.14981, 2024‏	28	2024
Towards publicly accountable frontier LLMs: Building an external scrutiny ecosystem under the ASPIRE framework‏ M Anderljung, ET Smith, J O'Brien, L Soder, B Bucknall, E Bluemke, ...‏ arXiv preprint arXiv:2311.14711, 2023‏	28	2023
Visibility into AI agents‏ A Chan, C Ezell, M Kaufmann, K Wei, L Hammond, H Bradley, E Bluemke, ...‏ Proceedings of the 2024 ACM Conference on Fairness, Accountability, and …, 2024‏	23	2024
Social and governance implications of improved data efficiency‏ AD Tucker, M Anderljung, A Dafoe‏ Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society, 378-384, 2020‏	19	2020
Model evaluation for extreme risks, 2023‏ T Shevlane, S Farquhar, B Garfinkel, M Phuong, J Whittlestone, J Leung, ...‏ URL https://arxiv. org/abs/2305.15324, 0‏	17
Responsible reporting for frontier AI development‏ N Kolt, M Anderljung, J Barnhart, A Brass, K Esvelt, GK Hadfield, L Heim, ...‏ Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society 7, 768-783, 2024‏	15	2024
A guide to writing the NeurIPS impact statement‏ C Ashurst, M Anderljung, C Prunkl, J Leike, Y Gal, T Shevlane, A Dafoe‏ Centre for the Governance of AI. URL: https://perma. cc/B5R8-2B9V, 2020‏	14	2020

המערכת אינה יכולה לבצע את הפעולה כעת. נסה שוב מאוחר יותר.

מאמרים 1–20

ציטוטים ביבליוגרפיים בשנה

ציטוטים ביביליוגרפיים כפולים

ציטוטים ביביליוגרפיים שמוזגו

הוסף מחברים שותפיםמחברים משותפים

עקוב אחר

צוטט על ידי

מחברים משותפים