Takip et
Lawrence Chan
Lawrence Chan
PhD Student, UC Berkeley
berkeley.edu üzerinde doğrulanmış e-posta adresine sahip
Başlık
Alıntı yapanlar
Alıntı yapanlar
Yıl
Progress measures for grokking via mechanistic interpretability
N Nanda, L Chan, T Liberum, J Smith, J Steinhardt
ICLR 2023, 2023
372*2023
The alignment problem from a deep learning perspective
R Ngo, L Chan, S Mindermann
ICLR 2024, 2022
214*2022
A toy model of universality: Reverse engineering how networks learn group operations
B Chughtai, L Chan, N Nanda
ICML 2023, 2023
912023
Causal Scrubbing: a method for rigorously testing interpretability hypotheses
L Chan, A Garriga-Alonso, N Goldowsky-Dill, R Greenblatt, ...
https://www.alignmentforum.org/posts/JvZhhzycHu2Yd57RN/causal-scrubbing-a …, 2022
662022
The assistive multi-armed bandit
L Chan, D Hadfield-Menell, S Srinivasa, A Dragan
2019 14th ACM/IEEE International Conference on Human-Robot Interaction (HRI …, 2019
572019
Adversarial Training for High-Stakes Reliability
DM Ziegler, S Nix, L Chan, T Bauman, P Schmidt-Nielsen, T Lin, ...
NeurIPS 2022, 2022
552022
Evaluating Language-Model Agents on Realistic Autonomous Tasks
M Kinniment, LJK Sato, H Du, B Goodrich, M Hasin, L Chan, LH Miles, ...
https://evals.alignment.org/Evaluating_LMAs_Realistic_Tasks.pdf, 2023
422023
Benefits of assistance over reward learning
R Shah, P Freire, N Alex, R Freedman, D Krasheninnikov, L Chan, ...
362020
Remote corneal suturing wet lab: microsurgical education during the COVID-19 pandemic
ND Pasricha, Z Haq, TR Ahmad, L Chan, TK Redd, GD Seitzman, ...
Journal of Cataract & Refractive Surgery 46 (12), 1667-1673, 2020
342020
Human irrationality: both bad and good for reward inference
L Chan, A Critch, A Dragan
arXiv preprint arXiv:2111.06956, 2021
272021
Optimal cost design for model predictive control
A Jain, L Chan, DS Brown, AD Dragan
Learning for Dynamics and Control, 1205-1217, 2021
242021
The alignment problem from a deep learning perspective, 2024
R Ngo, L Chan, S Mindermann
URL https://arxiv. org/abs/2209.00626, 0
11
Language models are better than humans at next-token prediction
B Shlegeris, F Roger, L Chan, E McLean
Transactions of Machine Learning Research (TMLR), 2022
92022
Mathematical models of computation in superposition
K Hänni, J Mendel, D Vaintrob, L Chan
arXiv preprint arXiv:2408.05451, 2024
62024
Risk factors predicting loss to follow-up, medication noncompliance, and poor visual outcomes among patients with infectious keratitis at a public county hospital
JB Lopez, L Chan, M Saifee, S Padmanabhan, M Yung, MF Chan
Cornea 42 (9), 1069-1073, 2023
62023
Compact proofs of model performance via mechanistic interpretability
J Gross, R Agrawal, T Kwa, E Ong, CH Yip, A Gibson, S Noubir, L Chan
arXiv preprint arXiv:2406.11779, 2024
42024
Minor tobacco alkaloids as biomarkers to distinguish combusted tobacco use from Electronic Nicotine Delivery Systems use. two new analytical methods
P Jacob, L Chan, P Cheung, K Bello, L Yu, G StHelen, NL Benowitz
Frontiers in Chemistry 10, 749089, 2022
42022
Spontaneous hyphema in the setting of COVID-19 pneumonia
J Chiang, L Chan, JY Stallworth, MF Chan
American Journal of Ophthalmology Case Reports 26, 101447, 2022
22022
Re-bench: Evaluating frontier ai r&d capabilities of language model agents against human experts
H Wijk, T Lin, J Becker, S Jawhar, N Parikh, T Broadley, L Chan, M Chen, ...
arXiv preprint arXiv:2411.15114, 2024
12024
Characterization of Polymicrobial and Antibiotic-Resistant Infectious Keratitis in a County Hospital Setting
L Chan, JB Lopez, M Saifee, S Padmanabhan, MF Chan, M Yung
Cornea open 2 (3), e0016, 2023
12023
Sistem, işlemi şu anda gerçekleştiremiyor. Daha sonra yeniden deneyin.
Makaleler 1–20