Automatic Risk Adaptation in Distributional Reinforcement Learning

Frederik Schubert; Theresa Eimer; Bodo Rosenhahn; Marius Lindauer

Details

Original language	English
Number of pages	14
Publication status	E-pub ahead of print - 11 Jun 2021

Abstract

The use of Reinforcement Learning (RL) agents in practical applications requires the consideration of suboptimal outcomes, depending on the familiarity of the agent with its environment. This is especially important in safety-critical environments, where errors can lead to high costs or damage. In distributional RL, the risk-sensitivity can be controlled via different distortion measures of the estimated return distribution. However, these distortion functions require an estimate of the risk level, which is difficult to obtain and depends on the current state. In this work, we demonstrate the suboptimality of a static risk level estimation and propose a method to dynamically select risk levels at each environment step. Our method ARA (Automatic Risk Adaptation) estimates the appropriate risk level in both known and unknown environments using a Random Network Distillation error. We show reduced failure rates by up to a factor of 7 and improved generalization performance by up to 14% compared to both risk-aware and risk-agnostic agents in several locomotion environments.

Keywords

cs.LG

Cite this

Automatic Risk Adaptation in Distributional Reinforcement Learning. / Schubert, Frederik; Eimer, Theresa ; Rosenhahn, Bodo et al.
2021.

Research output: Working paper/Preprint › Preprint

Schubert, F, Eimer, T , Rosenhahn, B & Lindauer, M 2021 'Automatic Risk Adaptation in Distributional Reinforcement Learning'. <https://arxiv.org/abs/2106.06317>

Schubert, F., Eimer, T., Rosenhahn, B., & Lindauer, M. (2021). Automatic Risk Adaptation in Distributional Reinforcement Learning. Advance online publication. https://arxiv.org/abs/2106.06317

Schubert F, Eimer T , Rosenhahn B , Lindauer M. Automatic Risk Adaptation in Distributional Reinforcement Learning. 2021 Jun 11. Epub 2021 Jun 11.

Schubert, Frederik ; Eimer, Theresa ; Rosenhahn, Bodo et al. / Automatic Risk Adaptation in Distributional Reinforcement Learning. 2021.

Download

@techreport{0ab334701ff24dbba229756909348834,

title = "Automatic Risk Adaptation in Distributional Reinforcement Learning",

abstract = " The use of Reinforcement Learning (RL) agents in practical applications requires the consideration of suboptimal outcomes, depending on the familiarity of the agent with its environment. This is especially important in safety-critical environments, where errors can lead to high costs or damage. In distributional RL, the risk-sensitivity can be controlled via different distortion measures of the estimated return distribution. However, these distortion functions require an estimate of the risk level, which is difficult to obtain and depends on the current state. In this work, we demonstrate the suboptimality of a static risk level estimation and propose a method to dynamically select risk levels at each environment step. Our method ARA (Automatic Risk Adaptation) estimates the appropriate risk level in both known and unknown environments using a Random Network Distillation error. We show reduced failure rates by up to a factor of 7 and improved generalization performance by up to 14% compared to both risk-aware and risk-agnostic agents in several locomotion environments. ",

keywords = "cs.LG",

author = "Frederik Schubert and Theresa Eimer and Bodo Rosenhahn and Marius Lindauer",

year = "2021",

month = jun,

day = "11",

language = "English",

type = "WorkingPaper",

}

Download

TY - UNPB

T1 - Automatic Risk Adaptation in Distributional Reinforcement Learning

AU - Schubert, Frederik

AU - Eimer, Theresa

AU - Rosenhahn, Bodo

AU - Lindauer, Marius

PY - 2021/6/11

Y1 - 2021/6/11

N2 - The use of Reinforcement Learning (RL) agents in practical applications requires the consideration of suboptimal outcomes, depending on the familiarity of the agent with its environment. This is especially important in safety-critical environments, where errors can lead to high costs or damage. In distributional RL, the risk-sensitivity can be controlled via different distortion measures of the estimated return distribution. However, these distortion functions require an estimate of the risk level, which is difficult to obtain and depends on the current state. In this work, we demonstrate the suboptimality of a static risk level estimation and propose a method to dynamically select risk levels at each environment step. Our method ARA (Automatic Risk Adaptation) estimates the appropriate risk level in both known and unknown environments using a Random Network Distillation error. We show reduced failure rates by up to a factor of 7 and improved generalization performance by up to 14% compared to both risk-aware and risk-agnostic agents in several locomotion environments.

AB - The use of Reinforcement Learning (RL) agents in practical applications requires the consideration of suboptimal outcomes, depending on the familiarity of the agent with its environment. This is especially important in safety-critical environments, where errors can lead to high costs or damage. In distributional RL, the risk-sensitivity can be controlled via different distortion measures of the estimated return distribution. However, these distortion functions require an estimate of the risk level, which is difficult to obtain and depends on the current state. In this work, we demonstrate the suboptimality of a static risk level estimation and propose a method to dynamically select risk levels at each environment step. Our method ARA (Automatic Risk Adaptation) estimates the appropriate risk level in both known and unknown environments using a Random Network Distillation error. We show reduced failure rates by up to a factor of 7 and improved generalization performance by up to 14% compared to both risk-aware and risk-agnostic agents in several locomotion environments.

KW - cs.LG

M3 - Preprint

BT - Automatic Risk Adaptation in Distributional Reinforcement Learning

ER -

Research@Leibniz University

Automatic Risk Adaptation in Distributional Reinforcement Learning

Authors

Research Organisations

Details

Abstract

Keywords

Cite this

By the same author(s)

Robust Shape Fitting for 3D Scene Abstraction

Quantum normalizing flows for anomaly detection

AMLTK: A Modular AutoML Toolkit in Python

A variational autoencoder trained with priors from canonical pathways increases the interpretability of transcriptome data

AutoML in Heavily Constrained Applications