‪Jackson Kernion‬ - ‫الباحث العلمي من Google‬

الحصول على ملفك الشخصي الخاص

عدد مرات الاقتباسات

	الكل	قبل 2020
اقتباسات	4020	4018
h-index	5	5
i10-index	5	5

0

2700

1350

675

2025

202220232024202550 1014 2652 295

Jackson Kernion

Jackson Kernion

Anthropic

بريد إلكتروني تم التحقق منه على anthropic.com - الصفحة الرئيسية

Language models Philosophy of Mind Epistemology


عنوان ترتيب حسب الاقتباسات ترتيب حسب السنة الترتيب حسب العنوان	عدد مرات الاقتباسات عدد مرات الاقتباسات	السنة
Training a helpful and harmless assistant with reinforcement learning from human feedback‏ Y Bai, A Jones, K Ndousse, A Askell, A Chen, N DasSarma, D Drain, ...‏ arXiv preprint arXiv:2204.05862, 2022‏	1671	2022
Constitutional ai: Harmlessness from ai feedback‏ Y Bai, S Kadavath, S Kundu, A Askell, J Kernion, A Jones, A Chen, ...‏ arXiv preprint arXiv:2212.08073, 2022‏	1294	2022
Red teaming language models to reduce harms: Methods, scaling behaviors, and lessons learned‏ D Ganguli, L Lovitt, J Kernion, A Askell, Y Bai, S Kadavath, B Mann, ...‏ arXiv preprint arXiv:2209.07858, 2022‏	479	2022
A general language assistant as a laboratory for alignment‏ A Askell, Y Bai, A Chen, D Drain, D Ganguli, T Henighan, A Jones, ...‏ arXiv preprint arXiv:2112.00861, 2021‏	394	2021
Language models (mostly) know what they know‏ S Kadavath, T Conerly, A Askell, T Henighan, D Drain, E Perez, ...‏ arXiv preprint arXiv:2207.05221, 2022‏	182	2022
Strange Experience: Why Experience Without Access Makes No Sense‏ J Kernion, UC Berkeley‏		2017

يتعذر على النظام إجراء العملية في الوقت الحالي. عاود المحاولة لاحقًا.

مقالات 1–6