A survey on missing data in machine learning

T Emmanuel, T Maupong, D Mpoeleng, T Semong… - Journal of Big …, 2021 - Springer
Abstract Machine learning has been the corner stone in analysing and extracting information
from data and often a problem of missing values is encountered. Missing values occur …

Reforms: Consensus-based recommendations for machine-learning-based science

S Kapoor, EM Cantrell, K Peng, TH Pham, CA Bail… - Science …, 2024 - science.org
Machine learning (ML) methods are proliferating in scientific research. However, the
adoption of these methods has been accompanied by failures of validity, reproducibility, and …

The Matthews correlation coefficient (MCC) should replace the ROC AUC as the standard metric for assessing binary classification

D Chicco, G Jurman - BioData Mining, 2023 - Springer
Binary classification is a common task for which machine learning and computational
statistics are used, and the area under the receiver operating characteristic curve (ROC …

Want to model a species niche? A step-by-step guideline on correlative ecological niche modelling

N Sillero, S Arenas-Castro, U Enriquez‐Urzelai… - Ecological …, 2021 - Elsevier
The use of correlative ecological niche models has highly increased in the last decade.
Despite all literature and textbooks in this field, few practical guidelines exist on the correct …

Species distribution models rarely predict the biology of real populations

J A. Lee‐Yaw, J L. McCune, S Pironon, S N. Sheth - Ecography, 2022 - Wiley Online Library
Species distribution models (SDMs) are widely used in ecology. In theory, SDMs capture (at
least part of) species' ecological niches and can be used to make inferences about the …

The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation

D Chicco, G Jurman - BMC genomics, 2020 - Springer
Background To evaluate binary classifications and their confusion matrices, scientific
researchers can employ several statistical rates, accordingly to the goal of the experiment …

[HTML][HTML] Accuracy assessment in convolutional neural network-based deep learning remote sensing studies—Part 1: Literature review

AE Maxwell, TA Warner, LA Guillén - Remote Sensing, 2021 - mdpi.com
Convolutional neural network (CNN)-based deep learning (DL) is a powerful, recently
developed image classification approach. With origins in the computer vision and image …

The global distribution of known and undiscovered ant biodiversity

JM Kass, B Guénard, KL Dudley, CN Jenkins… - Science …, 2022 - science.org
Invertebrates constitute the majority of animal species and are critical for ecosystem
functioning and services. Nonetheless, global invertebrate biodiversity patterns and their …

Top ten hazards to avoid when modeling species distributions: a didactic guide of assumptions, problems, and recommendations

M Soley‐Guardia, DF Alvarado‐Serrano… - Ecography, 2024 - Wiley Online Library
Species distribution models, also known as ecological niche models or habitat suitability
models, have become commonplace for addressing fundamental and applied biodiversity …

Impact of 2019–2020 mega-fires on Australian fauna habitat

M Ward, AIT Tulloch, JQ Radford, BA Williams… - Nature Ecology & …, 2020 - nature.com
Abstract Australia's 2019–2020 mega-fires were exacerbated by drought, anthropogenic
climate change and existing land-use management. Here, using a combination of remotely …