Harnessing artificial intelligence to fill global shortfalls in biodiversity knowledge

LJ Pollock, J Kitzes, S Beery, KM Gaynor… - Nature Reviews …, 2025 - nature.com
Large, well described gaps exist in both what we know and what we need to know to
address the biodiversity crisis. Artificial intelligence (AI) offers new potential for filling these …

[HTML][HTML] Towards a Taxonomy Machine: A Training Set of 5.6 Million Arthropod Images

D Steinke, S Ratnasingham, J Agda, H Ait Boutou… - Data, 2024 - mdpi.com
The taxonomic identification of organisms from images is an active research area within the
machine learning community. Current algorithms are very effective for object recognition and …

Enhancing DNA Foundation Models to Address Masking Inefficiencies

M Safari, PM Arias, SC Lowe, L Kari, AX Chang… - arxiv preprint arxiv …, 2025 - arxiv.org
Masked language modelling (MLM) as a pretraining objective has been widely adopted in
genomic sequence modelling. While pretrained models can successfully serve as encoders …

G2PDiffusion: Genotype-to-Phenotype Prediction with Diffusion Models

M Liu, Z Gao, H Chang, SZ Li, S Shan… - arxiv preprint arxiv …, 2025 - arxiv.org
Discovering the genotype-phenotype relationship is crucial for genetic engineering, which
will facilitate advances in fields such as crop breeding, conservation biology, and …

[PDF][PDF] Loss Functions Robust to the Presence of Label Errors

N Pellegrino, D Szczecina… - … Vision and Imaging …, 2024 - openjournals.uwaterloo.ca
Methods for detecting label errors in training data require models that are robust to label
errors (ie, not fit to erroneously labelled data points). However, acquiring such models often …