Data analysis strategies for targeted and untargeted LC-MS metabolomic studies: Overview and workflow

E Gorrochategui, J Jaumot, S Lacorte… - TrAC Trends in Analytical …, 2016 - Elsevier
Data analysis is a very challenging task in LC-MS metabolomic studies. The use of powerful
analytical techniques (eg, high-resolution mass spectrometry) provides high-dimensional …

[HTML][HTML] Clinical concept extraction: a methodology review

S Fu, D Chen, H He, S Liu, S Moon, KJ Peterson… - Journal of biomedical …, 2020 - Elsevier
Background Concept extraction, a subdomain of natural language processing (NLP) with a
focus on extracting concepts of interest, has been adopted to computationally extract clinical …

Robust prediction of response to immune checkpoint blockade therapy in metastatic melanoma

N Auslander, G Zhang, JS Lee, DT Frederick, B Miao… - Nature medicine, 2018 - nature.com
Immune checkpoint blockade (ICB) therapy provides remarkable clinical gains and has
been very successful in treatment of melanoma. However, only a subset of patients with …

[PDF][PDF] Applied predictive modeling

M Kuhn - 2013 - mathematics.foi.hr
This is a book on data analysis with a specific focus on the practice of predictive modeling.
The term predictive modeling may stir associations such as machine learning, pattern …

Toward more realistic drug–target interaction predictions

T Pahikkala, A Airola, S Pietilä… - Briefings in …, 2015 - academic.oup.com
A number of supervised machine learning models have recently been introduced for the
prediction of drug–target interactions based on chemical structure and genomic sequence …

Gene selection and classification of microarray data using random forest

R Díaz-Uriarte, S Alvarez de Andrés - BMC bioinformatics, 2006 - Springer
Background Selection of relevant genes for sample classification is a common task in most
gene expression studies, where researchers try to identify the smallest possible set of genes …

Bias in error estimation when using cross-validation for model selection

S Varma, R Simon - BMC bioinformatics, 2006 - Springer
Background Cross-validation (CV) is an effective method for estimating the prediction error
of a classifier. Some recent articles have proposed methods for optimizing classifiers by …

Gene expression-based classification of non-small cell lung carcinomas and survival prediction

J Hou, J Aerts, B Den Hamer, W Van Ijcken… - PloS one, 2010 - journals.plos.org
Background Current clinical therapy of non-small cell lung cancer depends on histo-
pathological classification. This approach poorly predicts clinical outcome for individual …

Prediction error estimation: a comparison of resampling methods

AM Molinaro, R Simon, RM Pfeiffer - Bioinformatics, 2005 - academic.oup.com
Motivation: In genomic studies, thousands of features are collected on relatively few
samples. One of the goals of these studies is to build classifiers to predict the outcome of …

Consensus features nested cross-validation

S Parvandeh, HW Yeh, MP Paulus… - Bioinformatics, 2020 - academic.oup.com
Feature selection can improve the accuracy of machine-learning models, but appropriate
steps must be taken to avoid overfitting. Nested cross-validation (nCV) is a common …