Extended Abstract: AutoRL Hyperparameter Landscapes

Aditya Mohan; Carolin Benjamins; Konrad Wienecke; Alexander Dockhorn; Marius Lindauer

Details

Original language	English
Number of pages	6
Publication status	E-pub ahead of print - 20 Jul 2023
Event	European Workshop on Reinforcement Learning 2023 - Brüssel Duration: 13 Sept 2023 → 16 Sept 2023 https://ewrl.wordpress.com/ewrl16-2023/

Workshop

Workshop	European Workshop on Reinforcement Learning 2023
City	Brüssel
Period	13 Sept 2023 → 16 Sept 2023
Internet address	https://ewrl.wordpress.com/ewrl16-2023/

Abstract

Although Reinforcement Learning (RL) has shown to be capable of producing impressive results, its use is limited by the impact of its hyperparameters on performance. This often makes it difficult to achieve good results in practice. Automated RL (AutoRL) addresses this difficulty, yet little is known about the dynamics of the hyperparameter landscapes that hyperparameter optimization (HPO) methods traverse in search of optimal configurations. In view of existing AutoRL approaches dynamically adjusting hyperparameter configurations, we propose an approach to build and analyze these hyperparameter landscapes not just for one point in time but at multiple points in time throughout training. Addressing an important open question on the legitimacy of such dynamic AutoRL approaches, we provide thorough empirical evidence that the hyperparameter landscapes strongly vary over time across representative algorithms from RL literature (DQN, PPO, and SAC) in different kinds of environments (Cartpole, Bipedal Walker, and Hopper). This supports the theory that hyperparameters should be dynamically adjusted during training and shows the potential for more insights on AutoRL problems that can be gained through landscape analysis.

Cite this

Extended Abstract: AutoRL Hyperparameter Landscapes. / Mohan, Aditya ; Benjamins, Carolin; Wienecke, Konrad et al.
2023. Abstract from European Workshop on Reinforcement Learning 2023, Brüssel.

Research output: Contribution to conference › Abstract › Research › peer review

Mohan, A , Benjamins, C, Wienecke, K, Dockhorn, A & Lindauer, M 2023, 'Extended Abstract: AutoRL Hyperparameter Landscapes', European Workshop on Reinforcement Learning 2023, Brüssel, 13 Sept 2023 - 16 Sept 2023. <https://openreview.net/forum?id=4Zu0l5lBgc>

Mohan, A., Benjamins, C., Wienecke, K., Dockhorn, A., & Lindauer, M. (2023). Extended Abstract: AutoRL Hyperparameter Landscapes. Abstract from European Workshop on Reinforcement Learning 2023, Brüssel. Advance online publication. https://openreview.net/forum?id=4Zu0l5lBgc

Mohan A , Benjamins C, Wienecke K, Dockhorn A , Lindauer M. Extended Abstract: AutoRL Hyperparameter Landscapes. 2023. Abstract from European Workshop on Reinforcement Learning 2023, Brüssel. Epub 2023 Jul 20.

Mohan, Aditya ; Benjamins, Carolin ; Wienecke, Konrad et al. / Extended Abstract : AutoRL Hyperparameter Landscapes. Abstract from European Workshop on Reinforcement Learning 2023, Brüssel.6 p.

Download

@conference{4d63a36a8a73415f95f9a197080dd36f,

title = "Extended Abstract: AutoRL Hyperparameter Landscapes",

abstract = "Although Reinforcement Learning (RL) has shown to be capable of producing impressive results, its use is limited by the impact of its hyperparameters on performance. This often makes it difficult to achieve good results in practice. Automated RL (AutoRL) addresses this difficulty, yet little is known about the dynamics of the hyperparameter landscapes that hyperparameter optimization (HPO) methods traverse in search of optimal configurations. In view of existing AutoRL approaches dynamically adjusting hyperparameter configurations, we propose an approach to build and analyze these hyperparameter landscapes not just for one point in time but at multiple points in time throughout training. Addressing an important open question on the legitimacy of such dynamic AutoRL approaches, we provide thorough empirical evidence that the hyperparameter landscapes strongly vary over time across representative algorithms from RL literature (DQN, PPO, and SAC) in different kinds of environments (Cartpole, Bipedal Walker, and Hopper). This supports the theory that hyperparameters should be dynamically adjusted during training and shows the potential for more insights on AutoRL problems that can be gained through landscape analysis.",

author = "Aditya Mohan and Carolin Benjamins and Konrad Wienecke and Alexander Dockhorn and Marius Lindauer",

year = "2023",

month = jul,

day = "20",

language = "English",

note = "European Workshop on Reinforcement Learning 2023 ; Conference date: 13-09-2023 Through 16-09-2023",

url = "https://ewrl.wordpress.com/ewrl16-2023/",

}

Download

TY - CONF

T1 - Extended Abstract

T2 - European Workshop on Reinforcement Learning 2023

AU - Mohan, Aditya

AU - Benjamins, Carolin

AU - Wienecke, Konrad

AU - Dockhorn, Alexander

AU - Lindauer, Marius

PY - 2023/7/20

Y1 - 2023/7/20

N2 - Although Reinforcement Learning (RL) has shown to be capable of producing impressive results, its use is limited by the impact of its hyperparameters on performance. This often makes it difficult to achieve good results in practice. Automated RL (AutoRL) addresses this difficulty, yet little is known about the dynamics of the hyperparameter landscapes that hyperparameter optimization (HPO) methods traverse in search of optimal configurations. In view of existing AutoRL approaches dynamically adjusting hyperparameter configurations, we propose an approach to build and analyze these hyperparameter landscapes not just for one point in time but at multiple points in time throughout training. Addressing an important open question on the legitimacy of such dynamic AutoRL approaches, we provide thorough empirical evidence that the hyperparameter landscapes strongly vary over time across representative algorithms from RL literature (DQN, PPO, and SAC) in different kinds of environments (Cartpole, Bipedal Walker, and Hopper). This supports the theory that hyperparameters should be dynamically adjusted during training and shows the potential for more insights on AutoRL problems that can be gained through landscape analysis.

AB - Although Reinforcement Learning (RL) has shown to be capable of producing impressive results, its use is limited by the impact of its hyperparameters on performance. This often makes it difficult to achieve good results in practice. Automated RL (AutoRL) addresses this difficulty, yet little is known about the dynamics of the hyperparameter landscapes that hyperparameter optimization (HPO) methods traverse in search of optimal configurations. In view of existing AutoRL approaches dynamically adjusting hyperparameter configurations, we propose an approach to build and analyze these hyperparameter landscapes not just for one point in time but at multiple points in time throughout training. Addressing an important open question on the legitimacy of such dynamic AutoRL approaches, we provide thorough empirical evidence that the hyperparameter landscapes strongly vary over time across representative algorithms from RL literature (DQN, PPO, and SAC) in different kinds of environments (Cartpole, Bipedal Walker, and Hopper). This supports the theory that hyperparameters should be dynamically adjusted during training and shows the potential for more insights on AutoRL problems that can be gained through landscape analysis.

M3 - Abstract

Y2 - 13 September 2023 through 16 September 2023

ER -

Research@Leibniz University

Extended Abstract: AutoRL Hyperparameter Landscapes

Authors

Research Organisations

Details

Workshop

Abstract

Cite this

By the same author(s)

Strategy Game-Playing with Size-Constrained State Abstraction

AMLTK: A Modular AutoML Toolkit in Python

Higher Replay Ratio Empowers Sample-Efficient Multi-Agent Reinforcement Learning

MO-SMAC: Multi-objective Sequential Model-based Algorithm Configuration

How Green is AutoML for Tabular Data?

Strategy Game-Playing with Size-Constrained State Abstraction

AMLTK: A Modular AutoML Toolkit in Python

Higher Replay Ratio Empowers Sample-Efficient Multi-Agent Reinforcement Learning

MO-SMAC: Multi-objective Sequential Model-based Algorithm Configuration

How Green is AutoML for Tabular Data?

Strategy Game-Playing with Size-Constrained State Abstraction