Naive automated machine learning

Research output: Contribution to journalArticleResearchpeer review

Authors

External Research Organisations

  • Universidad de la Sabana
  • Paderborn University
View graph of relations

Details

Original languageEnglish
Pages (from-to)1131-1170
Number of pages40
JournalMachine learning
Volume112
Issue number4
Publication statusPublished - Apr 2023
Externally publishedYes

Abstract

An essential task of automated machine learning (AutoML) is the problem of automatically finding the pipeline with the best generalization performance on a given dataset. This problem has been addressed with sophisticated black-box optimization techniques such as Bayesian optimization, grammar-based genetic algorithms, and tree search algorithms. Most of the current approaches are motivated by the assumption that optimizing the components of a pipeline in isolation may yield sub-optimal results. We present Naive AutoML , an approach that precisely realizes such an in-isolation optimization of the different components of a pre-defined pipeline scheme. The returned pipeline is obtained by just taking the best algorithm of each slot. The isolated optimization leads to substantially reduced search spaces, and, surprisingly, this approach yields comparable and sometimes even better performance than current state-of-the-art optimizers.

Keywords

    Automated Machine Learning, Black-Box Optimization, Data Science

ASJC Scopus subject areas

Cite this

Naive automated machine learning. / Mohr, Felix; Wever, Marcel.
In: Machine learning, Vol. 112, No. 4, 04.2023, p. 1131-1170.

Research output: Contribution to journalArticleResearchpeer review

Mohr F, Wever M. Naive automated machine learning. Machine learning. 2023 Apr;112(4):1131-1170. doi: 10.1007/s10994-022-06200-0
Mohr, Felix ; Wever, Marcel. / Naive automated machine learning. In: Machine learning. 2023 ; Vol. 112, No. 4. pp. 1131-1170.
Download
@article{ac0870d820644e81a5ae39bea4c07c54,
title = "Naive automated machine learning",
abstract = "An essential task of automated machine learning (AutoML) is the problem of automatically finding the pipeline with the best generalization performance on a given dataset. This problem has been addressed with sophisticated black-box optimization techniques such as Bayesian optimization, grammar-based genetic algorithms, and tree search algorithms. Most of the current approaches are motivated by the assumption that optimizing the components of a pipeline in isolation may yield sub-optimal results. We present Naive AutoML , an approach that precisely realizes such an in-isolation optimization of the different components of a pre-defined pipeline scheme. The returned pipeline is obtained by just taking the best algorithm of each slot. The isolated optimization leads to substantially reduced search spaces, and, surprisingly, this approach yields comparable and sometimes even better performance than current state-of-the-art optimizers.",
keywords = "Automated Machine Learning, Black-Box Optimization, Data Science",
author = "Felix Mohr and Marcel Wever",
note = "Publisher Copyright: {\textcopyright} 2022, The Author(s).",
year = "2023",
month = apr,
doi = "10.1007/s10994-022-06200-0",
language = "English",
volume = "112",
pages = "1131--1170",
journal = "Machine learning",
issn = "0885-6125",
publisher = "Springer Netherlands",
number = "4",

}

Download

TY - JOUR

T1 - Naive automated machine learning

AU - Mohr, Felix

AU - Wever, Marcel

N1 - Publisher Copyright: © 2022, The Author(s).

PY - 2023/4

Y1 - 2023/4

N2 - An essential task of automated machine learning (AutoML) is the problem of automatically finding the pipeline with the best generalization performance on a given dataset. This problem has been addressed with sophisticated black-box optimization techniques such as Bayesian optimization, grammar-based genetic algorithms, and tree search algorithms. Most of the current approaches are motivated by the assumption that optimizing the components of a pipeline in isolation may yield sub-optimal results. We present Naive AutoML , an approach that precisely realizes such an in-isolation optimization of the different components of a pre-defined pipeline scheme. The returned pipeline is obtained by just taking the best algorithm of each slot. The isolated optimization leads to substantially reduced search spaces, and, surprisingly, this approach yields comparable and sometimes even better performance than current state-of-the-art optimizers.

AB - An essential task of automated machine learning (AutoML) is the problem of automatically finding the pipeline with the best generalization performance on a given dataset. This problem has been addressed with sophisticated black-box optimization techniques such as Bayesian optimization, grammar-based genetic algorithms, and tree search algorithms. Most of the current approaches are motivated by the assumption that optimizing the components of a pipeline in isolation may yield sub-optimal results. We present Naive AutoML , an approach that precisely realizes such an in-isolation optimization of the different components of a pre-defined pipeline scheme. The returned pipeline is obtained by just taking the best algorithm of each slot. The isolated optimization leads to substantially reduced search spaces, and, surprisingly, this approach yields comparable and sometimes even better performance than current state-of-the-art optimizers.

KW - Automated Machine Learning

KW - Black-Box Optimization

KW - Data Science

UR - http://www.scopus.com/inward/record.url?scp=85139115238&partnerID=8YFLogxK

U2 - 10.1007/s10994-022-06200-0

DO - 10.1007/s10994-022-06200-0

M3 - Article

AN - SCOPUS:85139115238

VL - 112

SP - 1131

EP - 1170

JO - Machine learning

JF - Machine learning

SN - 0885-6125

IS - 4

ER -

By the same author(s)