Well-tuned Simple Nets Excel on Tabular Datasets

Kadra, Arlind; Lindauer, Marius; Hutter, Frank; Grabocka, Josif

Computer Science > Machine Learning

arXiv:2106.11189 (cs)

[Submitted on 21 Jun 2021 (v1), last revised 5 Nov 2021 (this version, v2)]

Title:Well-tuned Simple Nets Excel on Tabular Datasets

Authors:Arlind Kadra, Marius Lindauer, Frank Hutter, Josif Grabocka

View PDF

Abstract:Tabular datasets are the last "unconquered castle" for deep learning, with traditional ML methods like Gradient-Boosted Decision Trees still performing strongly even against recent specialized neural architectures. In this paper, we hypothesize that the key to boosting the performance of neural networks lies in rethinking the joint and simultaneous application of a large set of modern regularization techniques. As a result, we propose regularizing plain Multilayer Perceptron (MLP) networks by searching for the optimal combination/cocktail of 13 regularization techniques for each dataset using a joint optimization over the decision on which regularizers to apply and their subsidiary hyperparameters. We empirically assess the impact of these regularization cocktails for MLPs in a large-scale empirical study comprising 40 tabular datasets and demonstrate that (i) well-regularized plain MLPs significantly outperform recent state-of-the-art specialized neural network architectures, and (ii) they even outperform strong traditional ML methods, such as XGBoost.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2106.11189 [cs.LG]
	(or arXiv:2106.11189v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2106.11189

Submission history

From: Arlind Kadra [view email]
[v1] Mon, 21 Jun 2021 15:27:43 UTC (910 KB)
[v2] Fri, 5 Nov 2021 09:53:40 UTC (946 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-06

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Marius Lindauer
Frank Hutter
Josif Grabocka

export BibTeX citation

Computer Science > Machine Learning

Title:Well-tuned Simple Nets Excel on Tabular Datasets

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Well-tuned Simple Nets Excel on Tabular Datasets

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators