A review on longitudinal data analysis with random forest

J Hu, S Szymczak - Briefings in Bioinformatics, 2023 - academic.oup.com
In longitudinal studies variables are measured repeatedly over time, leading to clustered
and correlated observations. If the goal of the study is to develop prediction models …

Fifty years of classification and regression trees

WY Loh - International Statistical Review, 2014 - Wiley Online Library
Fifty years have passed since the publication of the first regression tree algorithm. New
techniques have added capabilities that far surpass those of the early methods. Modern …

Statistical solutions for error and bias in global citizen science datasets

TJ Bird, AE Bates, JS Lefcheck, NA Hill… - Biological …, 2014 - Elsevier
Networks of citizen scientists (CS) have the potential to observe biodiversity and species
distributions at global scales. Yet the adoption of such datasets in conservation science may …

Comparing random forest with logistic regression for predicting class-imbalanced civil war onset data

D Muchlinski, D Siroky, J He, M Kocher - Political Analysis, 2016 - cambridge.org
The most commonly used statistical models of civil war onset fail to correctly predict most
occurrences of this rare event in out-of-sample data. Statistical methods for the analysis of …

Detecting treatment-subgroup interactions in clustered data with generalized linear mixed-effects model trees

M Fokkema, N Smits, A Zeileis, T Hothorn… - Behavior research …, 2018 - Springer
Identification of subgroups of patients for whom treatment A is more effective than treatment
B, and vice versa, is of key importance to the development of personalized medicine. Tree …

Analysing the impact of multiple stressors in aquatic biomonitoring data: A 'cookbook'with applications in R

CK Feld, P Segurado, C Gutiérrez-Cánovas - Science of the Total …, 2016 - Elsevier
Multiple stressors threaten biodiversity and ecosystem integrity, imposing new challenges to
ecosystem management and restoration. Ecosystem managers are required to address and …

Mixed-effects random forest for clustered data

A Hajjem, F Bellavance, D Larocque - Journal of Statistical …, 2014 - Taylor & Francis
This paper presents an extension of the random forest (RF) method to the case of clustered
data. The proposed 'mixed-effects random forest'(MERF) is implemented using a standard …

Use and performance of machine learning models for type 2 diabetes prediction in community settings: A systematic review and meta-analysis

K De Silva, WK Lee, A Forbes, RT Demmer… - International journal of …, 2020 - Elsevier
Objective We aimed to identify machine learning (ML) models for type 2 diabetes (T2DM)
prediction in community settings and determine their predictive performance. Method …

Exploratory study on classification of diabetes mellitus through a combined Random Forest Classifier

X Wang, M Zhai, Z Ren, H Ren, M Li, D Quan… - BMC medical informatics …, 2021 - Springer
Abstract Background Diabetes Mellitus (DM) has become the third chronic non-
communicable disease that hits patients after tumors, cardiovascular and cerebrovascular …

[HTML][HTML] A random forest method with feature selection for develo** medical prediction models with clustered and longitudinal data

JL Speiser - Journal of biomedical informatics, 2021 - Elsevier
Background Machine learning methodologies are gaining popularity for develo** medical
prediction models for datasets with a large number of predictors, particularly in the setting of …