Hyperparameter optimization: Foundations, algorithms, best practices, and open challenges

B Bischl, M Binder, M Lang, T Pielok… - … : Data Mining and …, 2023 - Wiley Online Library
Most machine learning algorithms are configured by a set of hyperparameters whose values
must be carefully chosen and which often considerably impact performance. To avoid a time …

CatBoost for big data: an interdisciplinary review

JT Hancock, TM Khoshgoftaar - Journal of big data, 2020 - Springer
Abstract Gradient Boosted Decision Trees (GBDT's) are a powerful tool for classification and
regression tasks in Big Data. Researchers should be familiar with the strengths and …

Why do tree-based models still outperform deep learning on typical tabular data?

L Grinsztajn, E Oyallon… - Advances in neural …, 2022 - proceedings.neurips.cc
While deep learning has enabled tremendous progress on text and image datasets, its
superiority on tabular data is not clear. We contribute extensive benchmarks of standard and …

Deep neural networks and tabular data: A survey

V Borisov, T Leemann, K Seßler, J Haug… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Heterogeneous tabular data are the most commonly used form of data and are essential for
numerous critical and computationally demanding applications. On homogeneous datasets …

When do neural nets outperform boosted trees on tabular data?

D McElfresh, S Khandagale… - Advances in …, 2024 - proceedings.neurips.cc
Tabular data is one of the most commonly used types of data in machine learning. Despite
recent advances in neural nets (NNs) for tabular data, there is still an active discussion on …

[HTML][HTML] Deep Learning applications for COVID-19

C Shorten, TM Khoshgoftaar, B Furht - Journal of big Data, 2021 - Springer
This survey explores how Deep Learning has battled the COVID-19 pandemic and provides
directions for future research on COVID-19. We cover Deep Learning applications in Natural …

Machine-learning-based prediction and optimization of emerging contaminants' adsorption capacity on biochar materials

ZH Jaffari, H Jeong, J Shin, J Kwak, C Son… - Chemical Engineering …, 2023 - Elsevier
Biochar materials have recently received considerable recognition as eco-friendly and cost-
effective adsorbents capable of effectively removing hazardous emerging contaminants (eg …

Quantum computing for finance: State-of-the-art and future prospects

DJ Egger, C Gambella, J Marecek… - IEEE Transactions …, 2020 - ieeexplore.ieee.org
This article outlines our point of view regarding the applicability, state-of-the-art, and
potential of quantum computing for problems in finance. We provide an introduction to …

A deep-learned embedding technique for categorical features encoding

MK Dahouda, I Joe - IEEE Access, 2021 - ieeexplore.ieee.org
Many machine learning algorithms and almost all deep learning architectures are incapable
of processing plain texts in their raw form. This means that their input to the algorithms must …

Recent applications of machine learning in alloy design: A review

M Hu, Q Tan, R Knibbe, M Xu, B Jiang, S Wang… - Materials Science and …, 2023 - Elsevier
The history of machine learning (ML) can be traced back to the 1950 s, and its application in
alloy design has recently begun to flourish and expand rapidly. The driving force behind this …