Long-tail learning via logit adjustment

AK Menon, S Jayasumana, AS Rawat, H Jain… - arxiv preprint arxiv …, 2020 - arxiv.org
Real-world classification problems typically exhibit an imbalanced or long-tailed label
distribution, wherein many labels are associated with only a few samples. This poses a …

Sparse PLS discriminant analysis: biologically relevant feature selection and graphical displays for multiclass problems

KA Lê Cao, S Boitard, P Besse - BMC bioinformatics, 2011 - Springer
Background Variable selection on high throughput biological data, such as gene expression
or single nucleotide polymorphisms (SNPs), becomes inevitable to select relevant …

Sure independence screening for ultrahigh dimensional feature space

J Fan, J Lv - Journal of the Royal Statistical Society Series B …, 2008 - academic.oup.com
Variable selection plays an important role in high dimensional statistical modelling which
nowadays appears in many areas and is key to various scientific discoveries. For problems …

Epileptic seizure predictors based on computational intelligence techniques: A comparative study with 278 patients

CA Teixeira, B Direito, M Bandarabadi… - Computer methods and …, 2014 - Elsevier
The ability of computational intelligence methods to predict epileptic seizures is evaluated in
long-term EEG recordings of 278 patients suffering from pharmaco-resistant partial epilepsy …

Classifying conduct disorder using a biopsychosocial model and machine learning method

L Chan, C Simmons, S Tillem, M Conley… - Biological Psychiatry …, 2023 - Elsevier
Background Conduct disorder (CD) is a common syndrome with far-reaching effects. Risk
factors for the development of CD span social, psychological, and biological domains …

On the statistical consistency of algorithms for binary classification under class imbalance

A Menon, H Narasimhan, S Agarwal… - … on Machine Learning, 2013 - proceedings.mlr.press
Class imbalance situations, where one class is rare compared to the other, arise frequently
in machine learning applications. It is well known that the usual misclassification error is ill …

[HTML][HTML] EEG epileptic seizure detection and classification based on dual-tree complex wavelet transform and machine learning algorithms

IB Slimen, L Boubchir, Z Mbarki… - Journal of biomedical …, 2020 - ncbi.nlm.nih.gov
The visual analysis of common neurological disorders such as epileptic seizures in
electroencephalography (EEG) is an oversensitive operation and prone to errors, which has …

A generalized Fellegi–Sunter framework for multiple record linkage with application to homicide record systems

M Sadinle, SE Fienberg - Journal of the American Statistical …, 2013 - Taylor & Francis
We present a probabilistic method for linking multiple datafiles. This task is not trivial in the
absence of unique identifiers for the individuals recorded. This is a common scenario when …

Weighted distance weighted discrimination and its asymptotic properties

X Qiao, HH Zhang, Y Liu, MJ Todd… - Journal of the American …, 2010 - Taylor & Francis
While Distance Weighted Discrimination (DWD) is an appealing approach to classification in
high dimensions, it was designed for balanced datasets. In the case of unequal costs …