AutoRL Hyperparameter Landscapes

Aditya Mohan; Carolin Benjamins; Konrad Wienecke; Alexander Dockhorn; Marius Lindauer

doi:10.48550/arXiv.2304.02396

Details

Original language	English
Title of host publication	Conference proceeding
Subtitle of host publication	Second Internatinal Conference on Automated Machine Learning
Number of pages	27
Publication status	Published - 12 Nov 2023
Event	2nd International Conference on Automated Machine Learning, AutoML 2023 - Potsdam, Germany Duration: 12 Nov 2023 → 15 Nov 2023

Publication series

Name	Proceedings of Machine Learning Research
Volume	228
ISSN (Print)	2640-3498

Abstract

Although Reinforcement Learning (RL) has shown to be capable of producing impressive results, its use is limited by the impact of its hyperparameters on performance. This often makes it difficult to achieve good results in practice. Automated RL (AutoRL) addresses this difficulty, yet little is known about the dynamics of the hyperparameter landscapes that hyperparameter optimization (HPO) methods traverse in search of optimal configurations. In view of existing AutoRL approaches dynamically adjusting hyperparameter configurations, we propose an approach to build and analyze these hyperparameter landscapes not just for one point in time but at multiple points in time throughout training. Addressing an important open question on the legitimacy of such dynamic AutoRL approaches, we provide thorough empirical evidence that the hyperparameter landscapes strongly vary over time across representative algorithms from RL literature (DQN and SAC) in different kinds of environments (Cartpole and Hopper). This supports the theory that hyperparameters should be dynamically adjusted during training and shows the potential for more insights on AutoRL problems that can be gained through landscape analyses.

Keywords

Reinforcement learning, AutoML, Hyperparameter optimization

ASJC Scopus subject areas

Computer Science(all)
Artificial Intelligence
Computer Science(all)
Software
Engineering(all)
Control and Systems Engineering
Mathematics(all)
Statistics and Probability

Cite this

AutoRL Hyperparameter Landscapes. / Mohan, Aditya ; Benjamins, Carolin; Wienecke, Konrad et al.
Conference proceeding: Second Internatinal Conference on Automated Machine Learning. 2023. (Proceedings of Machine Learning Research; Vol. 228).

Research output: Chapter in book/report/conference proceeding › Conference contribution › Research › peer review

Mohan, A , Benjamins, C, Wienecke, K, Dockhorn, A & Lindauer, M 2023, AutoRL Hyperparameter Landscapes. in Conference proceeding: Second Internatinal Conference on Automated Machine Learning. Proceedings of Machine Learning Research, vol. 228, 2nd International Conference on Automated Machine Learning, AutoML 2023, Potsdam, Brandenburg, Germany, 12 Nov 2023. https://doi.org/10.48550/arXiv.2304.02396

Mohan, A., Benjamins, C., Wienecke, K., Dockhorn, A., & Lindauer, M. (2023). AutoRL Hyperparameter Landscapes. In Conference proceeding: Second Internatinal Conference on Automated Machine Learning (Proceedings of Machine Learning Research; Vol. 228). https://doi.org/10.48550/arXiv.2304.02396

Mohan A , Benjamins C, Wienecke K, Dockhorn A , Lindauer M. AutoRL Hyperparameter Landscapes. In Conference proceeding: Second Internatinal Conference on Automated Machine Learning. 2023. (Proceedings of Machine Learning Research). doi: 10.48550/arXiv.2304.02396

Mohan, Aditya ; Benjamins, Carolin ; Wienecke, Konrad et al. / AutoRL Hyperparameter Landscapes. Conference proceeding: Second Internatinal Conference on Automated Machine Learning. 2023. (Proceedings of Machine Learning Research).

Download

@inproceedings{13e8453dfc8c46b58ef9c3b318d5952a,

title = "AutoRL Hyperparameter Landscapes",

abstract = "Although Reinforcement Learning (RL) has shown to be capable of producing impressive results, its use is limited by the impact of its hyperparameters on performance. This often makes it difficult to achieve good results in practice. Automated RL (AutoRL) addresses this difficulty, yet little is known about the dynamics of the hyperparameter landscapes that hyperparameter optimization (HPO) methods traverse in search of optimal configurations. In view of existing AutoRL approaches dynamically adjusting hyperparameter configurations, we propose an approach to build and analyze these hyperparameter landscapes not just for one point in time but at multiple points in time throughout training. Addressing an important open question on the legitimacy of such dynamic AutoRL approaches, we provide thorough empirical evidence that the hyperparameter landscapes strongly vary over time across representative algorithms from RL literature (DQN and SAC) in different kinds of environments (Cartpole and Hopper). This supports the theory that hyperparameters should be dynamically adjusted during training and shows the potential for more insights on AutoRL problems that can be gained through landscape analyses.",

keywords = "Reinforcement learning, AutoML, Hyperparameter optimization",

author = "Aditya Mohan and Carolin Benjamins and Konrad Wienecke and Alexander Dockhorn and Marius Lindauer",

note = "Publisher Copyright: {\textcopyright}2023 the authors.; 2nd International Conference on Automated Machine Learning, AutoML 2023, AutoML 2023 ; Conference date: 12-11-2023 Through 15-11-2023",

year = "2023",

month = nov,

day = "12",

doi = "10.48550/arXiv.2304.02396",

language = "English",

series = "Proceedings of Machine Learning Research",

booktitle = "Conference proceeding",

}

Download

TY - GEN

T1 - AutoRL Hyperparameter Landscapes

AU - Mohan, Aditya

AU - Benjamins, Carolin

AU - Wienecke, Konrad

AU - Dockhorn, Alexander

AU - Lindauer, Marius

PY - 2023/11/12

Y1 - 2023/11/12

N2 - Although Reinforcement Learning (RL) has shown to be capable of producing impressive results, its use is limited by the impact of its hyperparameters on performance. This often makes it difficult to achieve good results in practice. Automated RL (AutoRL) addresses this difficulty, yet little is known about the dynamics of the hyperparameter landscapes that hyperparameter optimization (HPO) methods traverse in search of optimal configurations. In view of existing AutoRL approaches dynamically adjusting hyperparameter configurations, we propose an approach to build and analyze these hyperparameter landscapes not just for one point in time but at multiple points in time throughout training. Addressing an important open question on the legitimacy of such dynamic AutoRL approaches, we provide thorough empirical evidence that the hyperparameter landscapes strongly vary over time across representative algorithms from RL literature (DQN and SAC) in different kinds of environments (Cartpole and Hopper). This supports the theory that hyperparameters should be dynamically adjusted during training and shows the potential for more insights on AutoRL problems that can be gained through landscape analyses.

AB - Although Reinforcement Learning (RL) has shown to be capable of producing impressive results, its use is limited by the impact of its hyperparameters on performance. This often makes it difficult to achieve good results in practice. Automated RL (AutoRL) addresses this difficulty, yet little is known about the dynamics of the hyperparameter landscapes that hyperparameter optimization (HPO) methods traverse in search of optimal configurations. In view of existing AutoRL approaches dynamically adjusting hyperparameter configurations, we propose an approach to build and analyze these hyperparameter landscapes not just for one point in time but at multiple points in time throughout training. Addressing an important open question on the legitimacy of such dynamic AutoRL approaches, we provide thorough empirical evidence that the hyperparameter landscapes strongly vary over time across representative algorithms from RL literature (DQN and SAC) in different kinds of environments (Cartpole and Hopper). This supports the theory that hyperparameters should be dynamically adjusted during training and shows the potential for more insights on AutoRL problems that can be gained through landscape analyses.

KW - Reinforcement learning

KW - AutoML

KW - Hyperparameter optimization

UR - http://www.scopus.com/inward/record.url?scp=85184347459&partnerID=8YFLogxK

U2 - 10.48550/arXiv.2304.02396

DO - 10.48550/arXiv.2304.02396

M3 - Conference contribution

T3 - Proceedings of Machine Learning Research

BT - Conference proceeding

T2 - 2nd International Conference on Automated Machine Learning, AutoML 2023

Y2 - 12 November 2023 through 15 November 2023

ER -

Research@Leibniz University

AutoRL Hyperparameter Landscapes

Authors

Research Organisations

Details

Publication series

Abstract

Keywords

ASJC Scopus subject areas

Cite this

By the same author(s)

How Green is AutoML for Tabular Data?

Strategy Game-Playing with Size-Constrained State Abstraction

AMLTK: A Modular AutoML Toolkit in Python

MO-SMAC: Multi-objective Sequential Model-based Algorithm Configuration

AI-based character generation for disease stories: A case study using epidemiological data to highlight preventable risk factors

How Green is AutoML for Tabular Data?

Strategy Game-Playing with Size-Constrained State Abstraction

AMLTK: A Modular AutoML Toolkit in Python

MO-SMAC: Multi-objective Sequential Model-based Algorithm Configuration

AI-based character generation for disease stories: A case study using epidemiological data to highlight preventable risk factors

How Green is AutoML for Tabular Data?