Lawrence Chan

عدد مرات الاقتباسات

	الكل	قبل 2020
اقتباسات	1021	1017
h-index	11	11
i10-index	11	11

600

300

150

450

20202021202220232024202511 25 48 292 584 52

عدد المنشورات المتاحة للجميع

عرض المجموعة جميعها

7 مقالات

0 مقالة

المقالات البحثية المتاحة للجميع

المقالات البحثية غير المتاحة للجميع

تمّ اختيار المعلومات استنادًا إلى تفويضات التمويل

المؤلفون المشاركون

Neel NandaMechanistic Interpretability Team Lead, Google DeepMindبريد إلكتروني تم التحقق منه على deepmind.com
Anca D DraganAssistant Professor at UC Berkeley // Director, AI Safety and Alignment, Google DeepMindبريد إلكتروني تم التحقق منه على berkeley.edu
Sören MindermannUniversity of Oxford, OATMLبريد إلكتروني تم التحقق منه على cs.ox.ac.uk
Buck ShlegerisCEO, Redwood Researchبريد إلكتروني تم التحقق منه على rdwrs.com
Jacob SteinhardtStanford Universityبريد إلكتروني تم التحقق منه على cs.stanford.edu
Tom LieberumGoogle DeepMindبريد إلكتروني تم التحقق منه على deepmind.com
Bilal ChughtaiIndependentبريد إلكتروني تم التحقق منه على cam.ac.uk
Richard NgoOpenAIبريد إلكتروني تم التحقق منه على openai.com
Lucas Jun Koba SatoModel Evaluation and Threat Research (METR)بريد إلكتروني تم التحقق منه على metr.org
Brian GoodrichModel Evaluation & Threat Researchبريد إلكتروني تم التحقق منه على evals.alignment.org
Elizabeth BarnesMETRبريد إلكتروني تم التحقق منه على metr.org
Megan KinnimentMETRبريد إلكتروني تم التحقق منه على evals.alignment.org
Haoxing DuWindBorne Systemsبريد إلكتروني تم التحقق منه على berkeley.edu
Paul ChristianoNational Institute of Standards and Technologyبريد إلكتروني تم التحقق منه على nist.gov
Nicholas Goldowsky-DillMember of Technical Staff, Redwood Researchبريد إلكتروني تم التحقق منه على rdwrs.com
Adrià Garriga-AlonsoResearch Scientist, FAR AIبريد إلكتروني تم التحقق منه على far.ai
Andrew CritchUC Berkeley, Department of Electrical Engineering and Computer Sciencesبريد إلكتروني تم التحقق منه على eecs.berkeley.edu
Dylan Hadfield-MenellMassachusetts Institute of Technologyبريد إلكتروني تم التحقق منه على csail.mit.edu
Siddhartha SrinivasaProfessor, University of Washingtonبريد إلكتروني تم التحقق منه على cs.washington.edu
Adam ScherlisInterpretability Researcher, EleutherAIبريد إلكتروني تم التحقق منه على scherlis.com

متابعة

Lawrence Chan

PhD Student, UC Berkeley

بريد إلكتروني تم التحقق منه على berkeley.edu

AI Alignment Interpretability Reward Learning


عنوان ترتيب حسب الاقتباسات ترتيب حسب السنة الترتيب حسب العنوان	عدد مرات الاقتباسات عدد مرات الاقتباسات	السنة
Progress measures for grokking via mechanistic interpretability‏ N Nanda, L Chan, T Liberum, J Smith, J Steinhardt‏ ICLR 2023, 2023‏	355	2023
The alignment problem from a deep learning perspective‏ R Ngo, L Chan, S Mindermann‏ ICLR 2024, 2022‏	209*	2022
A toy model of universality: Reverse engineering how networks learn group operations‏ B Chughtai, L Chan, N Nanda‏ ICML 2023, 2023‏	91	2023
Causal Scrubbing: a method for rigorously testing interpretability hypotheses‏ L Chan, A Garriga-Alonso, N Goldowsky-Dill, R Greenblatt, ...‏ https://www.alignmentforum.org/posts/JvZhhzycHu2Yd57RN/causal-scrubbing-a …, 2022‏	64	2022
The assistive multi-armed bandit‏ L Chan, D Hadfield-Menell, S Srinivasa, A Dragan‏ 2019 14th ACM/IEEE International Conference on Human-Robot Interaction (HRI …, 2019‏	55	2019
Adversarial Training for High-Stakes Reliability‏ DM Ziegler, S Nix, L Chan, T Bauman, P Schmidt-Nielsen, T Lin, ...‏ NeurIPS 2022, 2022‏	53	2022
Evaluating Language-Model Agents on Realistic Autonomous Tasks‏ M Kinniment, LJK Sato, H Du, B Goodrich, M Hasin, L Chan, LH Miles, ...‏ https://evals.alignment.org/Evaluating_LMAs_Realistic_Tasks.pdf, 2023‏	38	2023
Benefits of assistance over reward learning‏ R Shah, P Freire, N Alex, R Freedman, D Krasheninnikov, L Chan, ...‏	36	2020
Remote corneal suturing wet lab: microsurgical education during the COVID-19 pandemic‏ ND Pasricha, Z Haq, TR Ahmad, L Chan, TK Redd, GD Seitzman, ...‏ Journal of Cataract & Refractive Surgery 46 (12), 1667-1673, 2020‏	33	2020
Human irrationality: both bad and good for reward inference‏ L Chan, A Critch, A Dragan‏ arXiv preprint arXiv:2111.06956, 2021‏	25	2021
Optimal cost design for model predictive control‏ A Jain, L Chan, DS Brown, AD Dragan‏ Learning for Dynamics and Control, 1205-1217, 2021‏	23	2021
The Alignment Problem from a Deep Learning Perspective (2022)‏ R Ngo, L Chan, S Mindermann‏ URL https://arxiv. org/abs/2209.00626.[Accessed: 4th May 2023], 3, 0‏	9
Language models are better than humans at next-token prediction‏ B Shlegeris, F Roger, L Chan, E McLean‏ Transactions of Machine Learning Research (TMLR), 2022‏	8	2022
Risk factors predicting loss to follow-up, medication noncompliance, and poor visual outcomes among patients with infectious keratitis at a public county hospital‏ JB Lopez, L Chan, M Saifee, S Padmanabhan, M Yung, MF Chan‏ Cornea 42 (9), 1069-1073, 2023‏	6	2023
Minor tobacco alkaloids as biomarkers to distinguish combusted tobacco use from Electronic Nicotine Delivery Systems use. two new analytical methods‏ P Jacob, L Chan, P Cheung, K Bello, L Yu, G StHelen, NL Benowitz‏ Frontiers in Chemistry 10, 749089, 2022‏	5	2022
Mathematical models of computation in superposition‏ K Hänni, J Mendel, D Vaintrob, L Chan‏ arXiv preprint arXiv:2408.05451, 2024‏	4	2024
Compact proofs of model performance via mechanistic interpretability‏ J Gross, R Agrawal, T Kwa, E Ong, CH Yip, A Gibson, S Noubir, L Chan‏ The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024‏	2	2024
Spontaneous hyphema in the setting of COVID-19 pneumonia‏ J Chiang, L Chan, JY Stallworth, MF Chan‏ American Journal of Ophthalmology Case Reports 26, 101447, 2022‏	2	2022
Re-bench: Evaluating frontier ai r&d capabilities of language model agents against human experts‏ H Wijk, T Lin, J Becker, S Jawhar, N Parikh, T Broadley, L Chan, M Chen, ...‏ arXiv preprint arXiv:2411.15114, 2024‏	1	2024
Characterization of Polymicrobial and Antibiotic-Resistant Infectious Keratitis in a County Hospital Setting‏ L Chan, JB Lopez, M Saifee, S Padmanabhan, MF Chan, M Yung‏ Cornea open 2 (3), e0016, 2023‏	1	2023

يتعذر على النظام إجراء العملية في الوقت الحالي. عاود المحاولة لاحقًا.

مقالات 1–20

عدد الاقتباسات في العام

اقتباسات مكررة

الاقتباسات المدمجة

إضافة مؤلفين مشاركينالمؤلفون المشاركون

متابعة

عدد مرات الاقتباسات

المؤلفون المشاركون