Loading [MathJax]/extensions/tex2jax.js

Continual Domain Randomization

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

Research Organisations

External Research Organisations

  • Technical University of Munich (TUM)
  • University of Innsbruck

Details

Original languageEnglish
Title of host publication2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages4965-4972
Number of pages8
ISBN (electronic)979-8-3503-7770-5
ISBN (print)979-8-3503-7771-2
Publication statusPublished - 14 Oct 2024
Event2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024 - Abu Dhabi, United Arab Emirates
Duration: 14 Oct 202418 Oct 2024

Publication series

NameIEEE International Conference on Intelligent Robots and Systems
ISSN (Print)2153-0858
ISSN (electronic)2153-0866

Abstract

Domain Randomization (DR) is commonly used for sim2real transfer of reinforcement learning (RL) policies in robotics. Most DR approaches require a simulator with a fixed set of tunable parameters from the start of the training, from which the parameters are randomized simultaneously to train a robust model for use in the real world. However, the combined randomization of many parameters increases the task difficulty and might result in sub-optimal policies. To address this problem and to provide a more flexible training process, we propose Continual Domain Randomization (CDR) for RL that combines domain randomization with continual learning to enable sequential training in simulation on a subset of randomization parameters at a time. Starting from a model trained in a non-randomized simulation where the task is easier to solve, the model is trained on a sequence of randomizations, and continual learning is employed to remember the effects of previous randomizations. Our robotic reaching and grasping tasks experiments show that the model trained in this fashion learns effectively in simulation and performs robustly on the real robot while matching or outperforming baselines that employ combined randomization or sequential randomization without continual learning. Our code and videos are available at https://continual-dr.github.io/.

Keywords

    continual reinforcement learning, Domain randomization, robotic manipulation, sim2real transfer

ASJC Scopus subject areas

Cite this

Continual Domain Randomization. / Josifovski, Josip; Auddy, Sayantan; Malmir, Mohammadhossein et al.
2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024. Institute of Electrical and Electronics Engineers Inc., 2024. p. 4965-4972 (IEEE International Conference on Intelligent Robots and Systems).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Josifovski, J, Auddy, S, Malmir, M, Piater, J, Knoll, A & Navarro-Guerrero, N 2024, Continual Domain Randomization. in 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024. IEEE International Conference on Intelligent Robots and Systems, Institute of Electrical and Electronics Engineers Inc., pp. 4965-4972, 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024, Abu Dhabi, United Arab Emirates, 14 Oct 2024. https://doi.org/10.48550/arXiv.2403.12193, https://doi.org/10.1109/IROS58592.2024.10802060
Josifovski, J., Auddy, S., Malmir, M., Piater, J., Knoll, A., & Navarro-Guerrero, N. (2024). Continual Domain Randomization. In 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024 (pp. 4965-4972). (IEEE International Conference on Intelligent Robots and Systems). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.48550/arXiv.2403.12193, https://doi.org/10.1109/IROS58592.2024.10802060
Josifovski J, Auddy S, Malmir M, Piater J, Knoll A, Navarro-Guerrero N. Continual Domain Randomization. In 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024. Institute of Electrical and Electronics Engineers Inc. 2024. p. 4965-4972. (IEEE International Conference on Intelligent Robots and Systems). doi: 10.48550/arXiv.2403.12193, 10.1109/IROS58592.2024.10802060
Josifovski, Josip ; Auddy, Sayantan ; Malmir, Mohammadhossein et al. / Continual Domain Randomization. 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024. Institute of Electrical and Electronics Engineers Inc., 2024. pp. 4965-4972 (IEEE International Conference on Intelligent Robots and Systems).
Download
@inproceedings{52a0e67c34f24409b3ed91d332cf6d42,
title = "Continual Domain Randomization",
abstract = "Domain Randomization (DR) is commonly used for sim2real transfer of reinforcement learning (RL) policies in robotics. Most DR approaches require a simulator with a fixed set of tunable parameters from the start of the training, from which the parameters are randomized simultaneously to train a robust model for use in the real world. However, the combined randomization of many parameters increases the task difficulty and might result in sub-optimal policies. To address this problem and to provide a more flexible training process, we propose Continual Domain Randomization (CDR) for RL that combines domain randomization with continual learning to enable sequential training in simulation on a subset of randomization parameters at a time. Starting from a model trained in a non-randomized simulation where the task is easier to solve, the model is trained on a sequence of randomizations, and continual learning is employed to remember the effects of previous randomizations. Our robotic reaching and grasping tasks experiments show that the model trained in this fashion learns effectively in simulation and performs robustly on the real robot while matching or outperforming baselines that employ combined randomization or sequential randomization without continual learning. Our code and videos are available at https://continual-dr.github.io/.",
keywords = "continual reinforcement learning, Domain randomization, robotic manipulation, sim2real transfer",
author = "Josip Josifovski and Sayantan Auddy and Mohammadhossein Malmir and Justus Piater and Alois Knoll and Nicol{\'a}s Navarro-Guerrero",
note = "Publisher Copyright: {\textcopyright} 2024 IEEE.; 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024 ; Conference date: 14-10-2024 Through 18-10-2024",
year = "2024",
month = oct,
day = "14",
doi = "10.48550/arXiv.2403.12193",
language = "English",
isbn = "979-8-3503-7771-2",
series = "IEEE International Conference on Intelligent Robots and Systems",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
pages = "4965--4972",
booktitle = "2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024",
address = "United States",

}

Download

TY - GEN

T1 - Continual Domain Randomization

AU - Josifovski, Josip

AU - Auddy, Sayantan

AU - Malmir, Mohammadhossein

AU - Piater, Justus

AU - Knoll, Alois

AU - Navarro-Guerrero, Nicolás

N1 - Publisher Copyright: © 2024 IEEE.

PY - 2024/10/14

Y1 - 2024/10/14

N2 - Domain Randomization (DR) is commonly used for sim2real transfer of reinforcement learning (RL) policies in robotics. Most DR approaches require a simulator with a fixed set of tunable parameters from the start of the training, from which the parameters are randomized simultaneously to train a robust model for use in the real world. However, the combined randomization of many parameters increases the task difficulty and might result in sub-optimal policies. To address this problem and to provide a more flexible training process, we propose Continual Domain Randomization (CDR) for RL that combines domain randomization with continual learning to enable sequential training in simulation on a subset of randomization parameters at a time. Starting from a model trained in a non-randomized simulation where the task is easier to solve, the model is trained on a sequence of randomizations, and continual learning is employed to remember the effects of previous randomizations. Our robotic reaching and grasping tasks experiments show that the model trained in this fashion learns effectively in simulation and performs robustly on the real robot while matching or outperforming baselines that employ combined randomization or sequential randomization without continual learning. Our code and videos are available at https://continual-dr.github.io/.

AB - Domain Randomization (DR) is commonly used for sim2real transfer of reinforcement learning (RL) policies in robotics. Most DR approaches require a simulator with a fixed set of tunable parameters from the start of the training, from which the parameters are randomized simultaneously to train a robust model for use in the real world. However, the combined randomization of many parameters increases the task difficulty and might result in sub-optimal policies. To address this problem and to provide a more flexible training process, we propose Continual Domain Randomization (CDR) for RL that combines domain randomization with continual learning to enable sequential training in simulation on a subset of randomization parameters at a time. Starting from a model trained in a non-randomized simulation where the task is easier to solve, the model is trained on a sequence of randomizations, and continual learning is employed to remember the effects of previous randomizations. Our robotic reaching and grasping tasks experiments show that the model trained in this fashion learns effectively in simulation and performs robustly on the real robot while matching or outperforming baselines that employ combined randomization or sequential randomization without continual learning. Our code and videos are available at https://continual-dr.github.io/.

KW - continual reinforcement learning

KW - Domain randomization

KW - robotic manipulation

KW - sim2real transfer

UR - http://www.scopus.com/inward/record.url?scp=85216474476&partnerID=8YFLogxK

U2 - 10.48550/arXiv.2403.12193

DO - 10.48550/arXiv.2403.12193

M3 - Conference contribution

AN - SCOPUS:85216474476

SN - 979-8-3503-7771-2

T3 - IEEE International Conference on Intelligent Robots and Systems

SP - 4965

EP - 4972

BT - 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024

Y2 - 14 October 2024 through 18 October 2024

ER -

By the same author(s)