Inferring the effectiveness of government interventions against COVID-19 J Brauner*, S Mindermann*, M Sharma*, D Johnston, J Salvatier, ... Science 371 (6531), 2021 | 1161 | 2021 |
Understanding the effectiveness of government interventions against the resurgence of COVID-19 in Europe M Sharma*, S Mindermann*, C Rogers-Smith, G Leech, B Snodin, J Ahuja, ... Nature Communications 12 (1), 1-13, 2021 | 251 | 2021 |
Managing extreme AI risks amid rapid progress Y Bengio, G Hinton, A Yao, D Song, P Abbeel, T Darrell, YN Harari, ... Science 384 (6698), 842-845, 2024 | 245* | 2024 |
The alignment problem from a deep learning perspective R Ngo, L Chan, S Mindermann International Conference on Learning Representations, 2024 | 211 | 2024 |
Prioritized training on points that are learnable, worth learning, and not yet learned S Mindermann*, M Razzak*, W Xu, A Kirsch, M Sharma, A Morisot, ... ICML, 2022 | 144 | 2022 |
Occam's razor is insufficient to infer the preferences of irrational agents S Armstrong*, S Mindermann* NeurIPS, 2018 | 136* | 2018 |
Changing composition of SARS-CoV-2 lineages and rise of Delta variant in England S Mishra*, S Mindermann*, M Sharma*, C Whittaker*, TA Mellan, T Wilton, ... EClinicalMedicine - The Lancet 39, 101064, 2021 | 129* | 2021 |
Mask wearing in community settings reduces SARS-CoV-2 transmission G Leech, C Rogers-Smith, JT Monrad, JB Sandbrink, B Snodin, R Zinkov, ... Proceedings of the National Academy of Sciences 119 (23), e2119266119, 2022 | 116* | 2022 |
Is the cure really worse than the disease? The health impacts of lockdowns during COVID-19 G Meyerowitz-Katz, S Bhatt, O Ratmann, JM Brauner, S Flaxman, ... BMJ global health 6 (8), e006653, 2021 | 93 | 2021 |
Identifying Causal-Effect Inference Failure with Uncertainty-Aware Models A Jesson*, S Mindermann*, U Shalit, Y Gal NeurIPS, 2020 | 89 | 2020 |
Sleeper agents: Training deceptive llms that persist through safety training E Hubinger, C Denison, J Mu, M Lambert, M Tong, M MacDiarmid, ... arXiv preprint arXiv:2401.05566, 2024 | 72 | 2024 |
Quantifying Ignorance in Individual-Level Causal-Effect Estimates under Hidden Confounding A Jesson, S Mindermann, Y Gal, U Shalit ICML, 2021 | 66 | 2021 |
Seasonal variation in SARS-CoV-2 transmission in temperate climates: A Bayesian modelling study in 143 European regions T Gavenčiak, JT Monrad, G Leech, M Sharma, S Mindermann, S Bhatt, ... PLoS computational biology 18 (8), e1010435, 2022 | 59 | 2022 |
Active Inverse Reward Design S Mindermann*, R Shah*, A Gleave, D Hadfield-Menell arXiv preprint arXiv:1809.03060, 2018 | 58* | 2018 |
How to catch an ai liar: Lie detection in black-box llms by asking unrelated questions L Pacchiardi, AJ Chan, S Mindermann, I Moscovitz, AY Pan, Y Gal, ... ICLR 2024, 2023 | 50 | 2023 |
Effectiveness assessment of non-pharmaceutical interventions: lessons learned from the COVID-19 pandemic A Lison, N Banholzer, M Sharma, S Mindermann, HJT Unwin, S Mishra, ... The Lancet Public Health 8 (4), e311-e317, 2023 | 37 | 2023 |
How Robust are the Estimated Effects of Nonpharmaceutical Interventions against COVID-19? M Sharma*, S Mindermann*, J Brauner*, G Leech, A Stephenson, ... NeurIPS (Spotlight talk), 2020 | 33* | 2020 |
Inferring the effectiveness of government interventions against COVID-19. Science, eabd9338 JM Brauner, S Mindermann, M Sharma, D Johnston, J Salvatier, ... | 25 | 2020 |
Specific versus general principles for constitutional ai S Kundu, Y Bai, S Kadavath, A Askell, A Callahan, A Chen, A Goldie, ... arXiv preprint arXiv:2310.13798, 2023 | 24 | 2023 |
A dataset of non-pharmaceutical interventions on SARS-CoV-2 in Europe G Altman, J Ahuja, JT Monrad, G Dhaliwal, C Rogers-Smith, G Leech, ... Scientific Data 9 (1), 1-9, 2022 | 14 | 2022 |