When less is more: sketching with minimizers in genomics

M Ndiaye, S Prieto-Baños, LM Fitzgerald… - Genome biology, 2024 - Springer
The exponential increase in sequencing data calls for conceptual and computational
advances to extract useful biological insights. One such advance, minimizers, allows for …

Towards Foundation Models: Evaluation of Geoscience Artificial Intelligence with Uncertainty

S Myren, N Parikh, R Rael, G Flynn, D Higdon… - arxiv preprint arxiv …, 2025 - arxiv.org
Artificial intelligence (AI) has transformed the geoscience community with deep learning
models (DLMs) that are trained to complete specific tasks within workflows. This success has …

The Impact of Train-Test Leakage on Machine Learning-based Android Malware Detection

G Liu, D Caragea, X Ou, S Roy - arxiv preprint arxiv:2410.19364, 2024 - arxiv.org
When machine learning is used for Android malware detection, an app needs to be
represented in a numerical format for training and testing. We identify a widespread …