Self-Paced Context Evaluation for Contextual Reinforcement Learning

Theresa Eimer; André Biedenkapp; Frank Hutter; Marius Lindauer

Details

Original language	English
Title of host publication	Proceedings of the international conference on machine learning (ICML)
Publisher	ML Research Press
Pages	2948-2958
Number of pages	11
ISBN (electronic)	9781713845065
ISBN (print)	978-171384506-5
Publication status	Published - 18 Jul 2021

Publication series

Name	Proceedings of Machine Learning Research
Volume	139
ISSN (electronic)	2640-3498

Abstract

Reinforcement learning (RL) has made a lot of advances for solving a single problem in a given environment; but learning policies that generalize to unseen variations of a problem remains challenging. To improve sample efficiency for learning on such instances of a problem domain, we present Self-Paced Context Evaluation (SPaCE). Based on self-paced learning, \spc automatically generates \task curricula online with little computational overhead. To this end, SPaCE leverages information contained in state values during training to accelerate and improve training performance as well as generalization capabilities to new instances from the same problem domain. Nevertheless, SPaCE is independent of the problem domain at hand and can be applied on top of any RL agent with state-value function approximation. We demonstrate SPaCE's ability to speed up learning of different value-based RL agents on two environments, showing better generalization capabilities and up to 10x faster learning compared to naive approaches such as round robin or SPDRL, as the closest state-of-the-art approach.

Keywords

cs.LG

ASJC Scopus subject areas

Computer Science(all)
Artificial Intelligence
Computer Science(all)
Software
Engineering(all)
Control and Systems Engineering
Mathematics(all)
Statistics and Probability

Cite this

Self-Paced Context Evaluation for Contextual Reinforcement Learning. / Eimer, Theresa; Biedenkapp, André; Hutter, Frank et al.
Proceedings of the international conference on machine learning (ICML). ML Research Press, 2021. p. 2948-2958 (Proceedings of Machine Learning Research; Vol. 139).

Research output: Chapter in book/report/conference proceeding › Conference contribution › Research › peer review

Eimer, T, Biedenkapp, A, Hutter, F & Lindauer, M 2021, Self-Paced Context Evaluation for Contextual Reinforcement Learning. in Proceedings of the international conference on machine learning (ICML). Proceedings of Machine Learning Research, vol. 139, ML Research Press, pp. 2948-2958. <https://www.tnt.uni-hannover.de/papers/data/1454/space.pdf>

Eimer, T., Biedenkapp, A., Hutter, F., & Lindauer, M. (2021). Self-Paced Context Evaluation for Contextual Reinforcement Learning. In Proceedings of the international conference on machine learning (ICML) (pp. 2948-2958). (Proceedings of Machine Learning Research; Vol. 139). ML Research Press. https://www.tnt.uni-hannover.de/papers/data/1454/space.pdf

Eimer T, Biedenkapp A, Hutter F, Lindauer M. Self-Paced Context Evaluation for Contextual Reinforcement Learning. In Proceedings of the international conference on machine learning (ICML). ML Research Press. 2021. p. 2948-2958. (Proceedings of Machine Learning Research).

Eimer, Theresa ; Biedenkapp, André ; Hutter, Frank et al. / Self-Paced Context Evaluation for Contextual Reinforcement Learning. Proceedings of the international conference on machine learning (ICML). ML Research Press, 2021. pp. 2948-2958 (Proceedings of Machine Learning Research).

Download

@inproceedings{b7f481e4815a453c97f181a48cc71619,

title = "Self-Paced Context Evaluation for Contextual Reinforcement Learning",

abstract = " Reinforcement learning (RL) has made a lot of advances for solving a single problem in a given environment; but learning policies that generalize to unseen variations of a problem remains challenging. To improve sample efficiency for learning on such instances of a problem domain, we present Self-Paced Context Evaluation (SPaCE). Based on self-paced learning, \spc automatically generates \task curricula online with little computational overhead. To this end, SPaCE leverages information contained in state values during training to accelerate and improve training performance as well as generalization capabilities to new instances from the same problem domain. Nevertheless, SPaCE is independent of the problem domain at hand and can be applied on top of any RL agent with state-value function approximation. We demonstrate SPaCE's ability to speed up learning of different value-based RL agents on two environments, showing better generalization capabilities and up to 10x faster learning compared to naive approaches such as round robin or SPDRL, as the closest state-of-the-art approach. ",

keywords = "cs.LG",

author = "Theresa Eimer and Andr{\'e} Biedenkapp and Frank Hutter and Marius Lindauer",

note = "Publisher Copyright: Copyright {\textcopyright} 2021 by the author(s)",

year = "2021",

month = jul,

day = "18",

language = "English",

isbn = "978-171384506-5",

series = "Proceedings of Machine Learning Research",

publisher = "ML Research Press",

pages = "2948--2958",

booktitle = "Proceedings of the international conference on machine learning (ICML)",

}

Download

TY - GEN

T1 - Self-Paced Context Evaluation for Contextual Reinforcement Learning

AU - Eimer, Theresa

AU - Biedenkapp, André

AU - Hutter, Frank

AU - Lindauer, Marius

PY - 2021/7/18

Y1 - 2021/7/18

N2 - Reinforcement learning (RL) has made a lot of advances for solving a single problem in a given environment; but learning policies that generalize to unseen variations of a problem remains challenging. To improve sample efficiency for learning on such instances of a problem domain, we present Self-Paced Context Evaluation (SPaCE). Based on self-paced learning, \spc automatically generates \task curricula online with little computational overhead. To this end, SPaCE leverages information contained in state values during training to accelerate and improve training performance as well as generalization capabilities to new instances from the same problem domain. Nevertheless, SPaCE is independent of the problem domain at hand and can be applied on top of any RL agent with state-value function approximation. We demonstrate SPaCE's ability to speed up learning of different value-based RL agents on two environments, showing better generalization capabilities and up to 10x faster learning compared to naive approaches such as round robin or SPDRL, as the closest state-of-the-art approach.

AB - Reinforcement learning (RL) has made a lot of advances for solving a single problem in a given environment; but learning policies that generalize to unseen variations of a problem remains challenging. To improve sample efficiency for learning on such instances of a problem domain, we present Self-Paced Context Evaluation (SPaCE). Based on self-paced learning, \spc automatically generates \task curricula online with little computational overhead. To this end, SPaCE leverages information contained in state values during training to accelerate and improve training performance as well as generalization capabilities to new instances from the same problem domain. Nevertheless, SPaCE is independent of the problem domain at hand and can be applied on top of any RL agent with state-value function approximation. We demonstrate SPaCE's ability to speed up learning of different value-based RL agents on two environments, showing better generalization capabilities and up to 10x faster learning compared to naive approaches such as round robin or SPDRL, as the closest state-of-the-art approach.

KW - cs.LG

UR - http://www.scopus.com/inward/record.url?scp=85161344151&partnerID=8YFLogxK

M3 - Conference contribution

SN - 978-171384506-5

T3 - Proceedings of Machine Learning Research

SP - 2948

EP - 2958

BT - Proceedings of the international conference on machine learning (ICML)

PB - ML Research Press

ER -

Research@Leibniz University

Self-Paced Context Evaluation for Contextual Reinforcement Learning

Authors

Research Organisations

External Research Organisations

Details

Publication series

Abstract

Keywords

ASJC Scopus subject areas

Cite this

By the same author(s)

Task Scheduling & Forgetting in Multi-Task Reinforcement Learning

AMLTK: A Modular AutoML Toolkit in Python

AutoML in Heavily Constrained Applications

MO-SMAC: Multi-objective Sequential Model-based Algorithm Configuration

How Green is AutoML for Tabular Data?

Task Scheduling & Forgetting in Multi-Task Reinforcement Learning

AMLTK: A Modular AutoML Toolkit in Python

AutoML in Heavily Constrained Applications

MO-SMAC: Multi-objective Sequential Model-based Algorithm Configuration

How Green is AutoML for Tabular Data?

Task Scheduling & Forgetting in Multi-Task Reinforcement Learning