A review on longitudinal data analysis with random forest
J Hu, S Szymczak - Briefings in Bioinformatics, 2023 - academic.oup.com
In longitudinal studies variables are measured repeatedly over time, leading to clustered
and correlated observations. If the goal of the study is to develop prediction models …
and correlated observations. If the goal of the study is to develop prediction models …
Fifty years of classification and regression trees
WY Loh - International Statistical Review, 2014 - Wiley Online Library
Fifty years have passed since the publication of the first regression tree algorithm. New
techniques have added capabilities that far surpass those of the early methods. Modern …
techniques have added capabilities that far surpass those of the early methods. Modern …
Statistical solutions for error and bias in global citizen science datasets
Networks of citizen scientists (CS) have the potential to observe biodiversity and species
distributions at global scales. Yet the adoption of such datasets in conservation science may …
distributions at global scales. Yet the adoption of such datasets in conservation science may …
Comparing random forest with logistic regression for predicting class-imbalanced civil war onset data
The most commonly used statistical models of civil war onset fail to correctly predict most
occurrences of this rare event in out-of-sample data. Statistical methods for the analysis of …
occurrences of this rare event in out-of-sample data. Statistical methods for the analysis of …
Detecting treatment-subgroup interactions in clustered data with generalized linear mixed-effects model trees
Identification of subgroups of patients for whom treatment A is more effective than treatment
B, and vice versa, is of key importance to the development of personalized medicine. Tree …
B, and vice versa, is of key importance to the development of personalized medicine. Tree …
Analysing the impact of multiple stressors in aquatic biomonitoring data: A 'cookbook'with applications in R
Multiple stressors threaten biodiversity and ecosystem integrity, imposing new challenges to
ecosystem management and restoration. Ecosystem managers are required to address and …
ecosystem management and restoration. Ecosystem managers are required to address and …
Mixed-effects random forest for clustered data
This paper presents an extension of the random forest (RF) method to the case of clustered
data. The proposed 'mixed-effects random forest'(MERF) is implemented using a standard …
data. The proposed 'mixed-effects random forest'(MERF) is implemented using a standard …
Use and performance of machine learning models for type 2 diabetes prediction in community settings: A systematic review and meta-analysis
Objective We aimed to identify machine learning (ML) models for type 2 diabetes (T2DM)
prediction in community settings and determine their predictive performance. Method …
prediction in community settings and determine their predictive performance. Method …
Exploratory study on classification of diabetes mellitus through a combined Random Forest Classifier
X Wang, M Zhai, Z Ren, H Ren, M Li, D Quan… - BMC medical informatics …, 2021 - Springer
Abstract Background Diabetes Mellitus (DM) has become the third chronic non-
communicable disease that hits patients after tumors, cardiovascular and cerebrovascular …
communicable disease that hits patients after tumors, cardiovascular and cerebrovascular …
[HTML][HTML] A random forest method with feature selection for develo** medical prediction models with clustered and longitudinal data
JL Speiser - Journal of biomedical informatics, 2021 - Elsevier
Background Machine learning methodologies are gaining popularity for develo** medical
prediction models for datasets with a large number of predictors, particularly in the setting of …
prediction models for datasets with a large number of predictors, particularly in the setting of …