Continual Domain Randomization

Josip Josifovski; Sayantan Auddy; Mohammadhossein Malmir; Justus Piater; Alois Knoll; Nicolás Navarro-Guerrero

doi:10.48550/arXiv.2403.12193

Details

Original language	English
Title of host publication	2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	4965-4972
Number of pages	8
ISBN (electronic)	979-8-3503-7770-5
ISBN (print)	979-8-3503-7771-2
Publication status	Published - 14 Oct 2024
Event	2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024 - Abu Dhabi, United Arab Emirates Duration: 14 Oct 2024 → 18 Oct 2024

Publication series

Name	IEEE International Conference on Intelligent Robots and Systems
ISSN (Print)	2153-0858
ISSN (electronic)	2153-0866

Abstract

Domain Randomization (DR) is commonly used for sim2real transfer of reinforcement learning (RL) policies in robotics. Most DR approaches require a simulator with a fixed set of tunable parameters from the start of the training, from which the parameters are randomized simultaneously to train a robust model for use in the real world. However, the combined randomization of many parameters increases the task difficulty and might result in sub-optimal policies. To address this problem and to provide a more flexible training process, we propose Continual Domain Randomization (CDR) for RL that combines domain randomization with continual learning to enable sequential training in simulation on a subset of randomization parameters at a time. Starting from a model trained in a non-randomized simulation where the task is easier to solve, the model is trained on a sequence of randomizations, and continual learning is employed to remember the effects of previous randomizations. Our robotic reaching and grasping tasks experiments show that the model trained in this fashion learns effectively in simulation and performs robustly on the real robot while matching or outperforming baselines that employ combined randomization or sequential randomization without continual learning. Our code and videos are available at https://continual-dr.github.io/.

Keywords

continual reinforcement learning, Domain randomization, robotic manipulation, sim2real transfer

ASJC Scopus subject areas

Engineering(all)
Control and Systems Engineering
Computer Science(all)
Software
Computer Science(all)
Computer Vision and Pattern Recognition
Computer Science(all)
Computer Science Applications

Cite this

Continual Domain Randomization. / Josifovski, Josip; Auddy, Sayantan; Malmir, Mohammadhossein et al.
2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024. Institute of Electrical and Electronics Engineers Inc., 2024. p. 4965-4972 (IEEE International Conference on Intelligent Robots and Systems).

Research output: Chapter in book/report/conference proceeding › Conference contribution › Research › peer review

Josifovski, J, Auddy, S, Malmir, M, Piater, J, Knoll, A & Navarro-Guerrero, N 2024, Continual Domain Randomization. in 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024. IEEE International Conference on Intelligent Robots and Systems, Institute of Electrical and Electronics Engineers Inc., pp. 4965-4972, 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024, Abu Dhabi, United Arab Emirates, 14 Oct 2024. https://doi.org/10.48550/arXiv.2403.12193, https://doi.org/10.1109/IROS58592.2024.10802060

Josifovski, J., Auddy, S., Malmir, M., Piater, J., Knoll, A., & Navarro-Guerrero, N. (2024). Continual Domain Randomization. In 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024 (pp. 4965-4972). (IEEE International Conference on Intelligent Robots and Systems). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.48550/arXiv.2403.12193, https://doi.org/10.1109/IROS58592.2024.10802060

Josifovski J, Auddy S, Malmir M, Piater J, Knoll A, Navarro-Guerrero N. Continual Domain Randomization. In 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024. Institute of Electrical and Electronics Engineers Inc. 2024. p. 4965-4972. (IEEE International Conference on Intelligent Robots and Systems). doi: 10.48550/arXiv.2403.12193, 10.1109/IROS58592.2024.10802060

Josifovski, Josip ; Auddy, Sayantan ; Malmir, Mohammadhossein et al. / Continual Domain Randomization. 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024. Institute of Electrical and Electronics Engineers Inc., 2024. pp. 4965-4972 (IEEE International Conference on Intelligent Robots and Systems).

Download

@inproceedings{52a0e67c34f24409b3ed91d332cf6d42,

title = "Continual Domain Randomization",

abstract = "Domain Randomization (DR) is commonly used for sim2real transfer of reinforcement learning (RL) policies in robotics. Most DR approaches require a simulator with a fixed set of tunable parameters from the start of the training, from which the parameters are randomized simultaneously to train a robust model for use in the real world. However, the combined randomization of many parameters increases the task difficulty and might result in sub-optimal policies. To address this problem and to provide a more flexible training process, we propose Continual Domain Randomization (CDR) for RL that combines domain randomization with continual learning to enable sequential training in simulation on a subset of randomization parameters at a time. Starting from a model trained in a non-randomized simulation where the task is easier to solve, the model is trained on a sequence of randomizations, and continual learning is employed to remember the effects of previous randomizations. Our robotic reaching and grasping tasks experiments show that the model trained in this fashion learns effectively in simulation and performs robustly on the real robot while matching or outperforming baselines that employ combined randomization or sequential randomization without continual learning. Our code and videos are available at https://continual-dr.github.io/.",

keywords = "continual reinforcement learning, Domain randomization, robotic manipulation, sim2real transfer",

author = "Josip Josifovski and Sayantan Auddy and Mohammadhossein Malmir and Justus Piater and Alois Knoll and Nicol{\'a}s Navarro-Guerrero",

note = "Publisher Copyright: {\textcopyright} 2024 IEEE.; 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024 ; Conference date: 14-10-2024 Through 18-10-2024",

year = "2024",

month = oct,

day = "14",

doi = "10.48550/arXiv.2403.12193",

language = "English",

isbn = "979-8-3503-7771-2",

series = "IEEE International Conference on Intelligent Robots and Systems",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "4965--4972",

booktitle = "2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024",

address = "United States",

}

Download

TY - GEN

T1 - Continual Domain Randomization

AU - Josifovski, Josip

AU - Auddy, Sayantan

AU - Malmir, Mohammadhossein

AU - Piater, Justus

AU - Knoll, Alois

AU - Navarro-Guerrero, Nicolás

PY - 2024/10/14

Y1 - 2024/10/14

N2 - Domain Randomization (DR) is commonly used for sim2real transfer of reinforcement learning (RL) policies in robotics. Most DR approaches require a simulator with a fixed set of tunable parameters from the start of the training, from which the parameters are randomized simultaneously to train a robust model for use in the real world. However, the combined randomization of many parameters increases the task difficulty and might result in sub-optimal policies. To address this problem and to provide a more flexible training process, we propose Continual Domain Randomization (CDR) for RL that combines domain randomization with continual learning to enable sequential training in simulation on a subset of randomization parameters at a time. Starting from a model trained in a non-randomized simulation where the task is easier to solve, the model is trained on a sequence of randomizations, and continual learning is employed to remember the effects of previous randomizations. Our robotic reaching and grasping tasks experiments show that the model trained in this fashion learns effectively in simulation and performs robustly on the real robot while matching or outperforming baselines that employ combined randomization or sequential randomization without continual learning. Our code and videos are available at https://continual-dr.github.io/.

AB - Domain Randomization (DR) is commonly used for sim2real transfer of reinforcement learning (RL) policies in robotics. Most DR approaches require a simulator with a fixed set of tunable parameters from the start of the training, from which the parameters are randomized simultaneously to train a robust model for use in the real world. However, the combined randomization of many parameters increases the task difficulty and might result in sub-optimal policies. To address this problem and to provide a more flexible training process, we propose Continual Domain Randomization (CDR) for RL that combines domain randomization with continual learning to enable sequential training in simulation on a subset of randomization parameters at a time. Starting from a model trained in a non-randomized simulation where the task is easier to solve, the model is trained on a sequence of randomizations, and continual learning is employed to remember the effects of previous randomizations. Our robotic reaching and grasping tasks experiments show that the model trained in this fashion learns effectively in simulation and performs robustly on the real robot while matching or outperforming baselines that employ combined randomization or sequential randomization without continual learning. Our code and videos are available at https://continual-dr.github.io/.

KW - continual reinforcement learning

KW - Domain randomization

KW - robotic manipulation

KW - sim2real transfer

UR - http://www.scopus.com/inward/record.url?scp=85216474476&partnerID=8YFLogxK

U2 - 10.48550/arXiv.2403.12193

DO - 10.48550/arXiv.2403.12193

M3 - Conference contribution

AN - SCOPUS:85216474476

SN - 979-8-3503-7771-2

T3 - IEEE International Conference on Intelligent Robots and Systems

SP - 4965

EP - 4972

BT - 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024

Y2 - 14 October 2024 through 18 October 2024

ER -

Research@Leibniz University

Continual Domain Randomization

Authors

Research Organisations

External Research Organisations

Details

Publication series

Abstract

Keywords

ASJC Scopus subject areas

Cite this

By the same author(s)

Quantifying the Effect of Feedback Frequency in Interactive Reinforcement Learning for Robotic Tasks

Visuo-haptic object perception for robots: an overview

Survey on reinforcement learning for language processing

Optimizing BioTac Simulation for Realistic Tactile Perception

Cognitive inspired aspects of robot learning

Quantifying the Effect of Feedback Frequency in Interactive Reinforcement Learning for Robotic Tasks

Visuo-haptic object perception for robots: an overview

Survey on reinforcement learning for language processing

Optimizing BioTac Simulation for Realistic Tactile Perception

Cognitive inspired aspects of robot learning

Quantifying the Effect of Feedback Frequency in Interactive Reinforcement Learning for Robotic Tasks