[HTML][HTML] A review of ensemble learning and data augmentation models for class imbalanced problems: combination, implementation and evaluation
Class imbalance (CI) in classification problems arises when the number of observations
belonging to one class is lower than the other. Ensemble learning combines multiple models …
belonging to one class is lower than the other. Ensemble learning combines multiple models …
Tabular data: Deep learning is not all you need
A key element in solving real-life data science problems is selecting the types of models to
use. Tree ensemble models (such as XGBoost) are usually recommended for classification …
use. Tree ensemble models (such as XGBoost) are usually recommended for classification …
A performance-driven benchmark for feature selection in tabular deep learning
Academic tabular benchmarks often contain small sets of curated features. In contrast, data
scientists typically collect as many features as possible into their datasets, and even …
scientists typically collect as many features as possible into their datasets, and even …
Switchtab: Switched autoencoders are effective tabular learners
Self-supervised representation learning methods have achieved significant success in
computer vision and natural language processing (NLP), where data samples exhibit explicit …
computer vision and natural language processing (NLP), where data samples exhibit explicit …
Recontab: Regularized contrastive representation learning for tabular data
Representation learning stands as one of the critical machine learning techniques across
various domains. Through the acquisition of high-quality features, pre-trained embeddings …
various domains. Through the acquisition of high-quality features, pre-trained embeddings …
iPiDA-GCN: Identification of piRNA-disease associations based on Graph Convolutional Network
J Hou, H Wei, B Liu - PLOS Computational Biology, 2022 - journals.plos.org
Motivation Piwi-interacting RNAs (piRNAs) play a critical role in the progression of various
diseases. Accurately identifying the associations between piRNAs and diseases is important …
diseases. Accurately identifying the associations between piRNAs and diseases is important …
The diversified ensemble neural network
Ensemble is a general way of improving the accuracy and stability of learning models,
especially for the generalization ability on small datasets. Compared with tree-based …
especially for the generalization ability on small datasets. Compared with tree-based …
Attention versus contrastive learning of tabular data: a data-centric benchmarking
Despite groundbreaking success in image and text learning, deep learning has not
achieved significant improvements against traditional machine learning (ML) on tabular …
achieved significant improvements against traditional machine learning (ML) on tabular …
Transfer learning with deep tabular models
Recent work on deep learning for tabular data demonstrates the strong performance of deep
tabular models, often bridging the gap between gradient boosted decision trees and neural …
tabular models, often bridging the gap between gradient boosted decision trees and neural …
Precision machine learning
We explore unique considerations involved in fitting machine learning (ML) models to data
with very high precision, as is often required for science applications. We empirically …
with very high precision, as is often required for science applications. We empirically …