[HTML][HTML] The reanimation of pseudoscience in machine learning and its ethical repercussions

M Andrews, A Smart, A Birhane - Patterns, 2024 - cell.com
The present perspective outlines how epistemically baseless and ethically pernicious
paradigms are recycled back into the scientific literature via machine learning (ML) and …

Weak baselines and reporting biases lead to overoptimism in machine learning for fluid-related partial differential equations

N McGreivy, A Hakim - Nature Machine Intelligence, 2024 - nature.com
One of the most promising applications of machine learning in computational physics is to
accelerate the solution of partial differential equations (PDEs). The key objective of machine …

Avoiding common machine learning pitfalls

MA Lones - Patterns, 2024 - cell.com
Mistakes in machine learning practice are commonplace and can result in loss of confidence
in the findings and products of machine learning. This tutorial outlines common mistakes that …

[PDF][PDF] Consent in crisis: The rapid decline of the ai data commons

S Longpre, R Mahari, A Lee, C Lund… - …, 2024 - proceedings.neurips.cc
General-purpose artificial intelligence (AI) systems are built on massive swathes of public
web data, assembled into corpora such as C4, RefinedWeb, and Dolma. To our knowledge …

How to avoid machine learning pitfalls: a guide for academic researchers

MA Lones - arxiv preprint arxiv:2108.02497, 2021 - arxiv.org
Mistakes in machine learning practice are commonplace, and can result in a loss of
confidence in the findings and products of machine learning. This guide outlines common …

A benchmark dataset for machine learning in ecotoxicology

C Schür, L Gasser, F Perez-Cruz, K Schirmer… - Scientific Data, 2023 - nature.com
The use of machine learning for predicting ecotoxicological outcomes is promising, but
underutilized. The curation of data with informative features requires both expertise in …

How can we make sound replication decisions?

CP Davis-Stober, A Sarafoglou, B Aczel… - Proceedings of the …, 2025 - pnas.org
Replication and the reported crises impacting many fields of research have become a focal
point for the sciences. This has led to reforms in publishing, methodological design and …

The responsible foundation model development cheatsheet: A review of tools & resources

S Longpre, S Biderman, A Albalak… - arxiv preprint arxiv …, 2024 - arxiv.org
Foundation model development attracts a rapidly expanding body of contributors, scientists,
and applications. To help shape responsible development practices, we introduce the …

A review of model evaluation metrics for machine learning in genetics and genomics

C Miller, T Portlock, DM Nyaga… - Frontiers in …, 2024 - frontiersin.org
Machine learning (ML) has shown great promise in genetics and genomics where large and
complex datasets have the potential to provide insight into many aspects of disease risk …