Continual Domain Randomization

Josip Josifovski; Sayantan Auddy; Mohammadhossein Malmir; Justus Piater; Alois Knoll; Nicolás Navarro-Guerrero

doi:10.48550/arXiv.2403.12193

Details

Originalsprache	Englisch
Titel des Sammelwerks	2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024
Herausgeber (Verlag)	Institute of Electrical and Electronics Engineers Inc.
Seiten	4965-4972
Seitenumfang	8
ISBN (elektronisch)	979-8-3503-7770-5
ISBN (Print)	979-8-3503-7771-2
Publikationsstatus	Veröffentlicht - 14 Okt. 2024
Veranstaltung	2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024 - Abu Dhabi, Vereinigte Arabische Emirate Dauer: 14 Okt. 2024 → 18 Okt. 2024

Publikationsreihe

Name	IEEE International Conference on Intelligent Robots and Systems
ISSN (Print)	2153-0858
ISSN (elektronisch)	2153-0866

Abstract

Domain Randomization (DR) is commonly used for sim2real transfer of reinforcement learning (RL) policies in robotics. Most DR approaches require a simulator with a fixed set of tunable parameters from the start of the training, from which the parameters are randomized simultaneously to train a robust model for use in the real world. However, the combined randomization of many parameters increases the task difficulty and might result in sub-optimal policies. To address this problem and to provide a more flexible training process, we propose Continual Domain Randomization (CDR) for RL that combines domain randomization with continual learning to enable sequential training in simulation on a subset of randomization parameters at a time. Starting from a model trained in a non-randomized simulation where the task is easier to solve, the model is trained on a sequence of randomizations, and continual learning is employed to remember the effects of previous randomizations. Our robotic reaching and grasping tasks experiments show that the model trained in this fashion learns effectively in simulation and performs robustly on the real robot while matching or outperforming baselines that employ combined randomization or sequential randomization without continual learning. Our code and videos are available at https://continual-dr.github.io/.

ASJC Scopus Sachgebiete

Ingenieurwesen (insg.)
Steuerungs- und Systemtechnik
Informatik (insg.)
Software
Informatik (insg.)
Maschinelles Sehen und Mustererkennung
Informatik (insg.)
Angewandte Informatik

Zitieren

Continual Domain Randomization. / Josifovski, Josip; Auddy, Sayantan; Malmir, Mohammadhossein et al.
2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024. Institute of Electrical and Electronics Engineers Inc., 2024. S. 4965-4972 (IEEE International Conference on Intelligent Robots and Systems).

Publikation: Beitrag in Buch/Bericht/Sammelwerk/Konferenzband › Aufsatz in Konferenzband › Forschung › Peer-Review

Josifovski, J, Auddy, S, Malmir, M, Piater, J, Knoll, A & Navarro-Guerrero, N 2024, Continual Domain Randomization. in 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024. IEEE International Conference on Intelligent Robots and Systems, Institute of Electrical and Electronics Engineers Inc., S. 4965-4972, 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024, Abu Dhabi, Vereinigte Arabische Emirate, 14 Okt. 2024. https://doi.org/10.48550/arXiv.2403.12193, https://doi.org/10.1109/IROS58592.2024.10802060

Josifovski, J., Auddy, S., Malmir, M., Piater, J., Knoll, A., & Navarro-Guerrero, N. (2024). Continual Domain Randomization. In 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024 (S. 4965-4972). (IEEE International Conference on Intelligent Robots and Systems). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.48550/arXiv.2403.12193, https://doi.org/10.1109/IROS58592.2024.10802060

Josifovski J, Auddy S, Malmir M, Piater J, Knoll A, Navarro-Guerrero N. Continual Domain Randomization. in 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024. Institute of Electrical and Electronics Engineers Inc. 2024. S. 4965-4972. (IEEE International Conference on Intelligent Robots and Systems). doi: 10.48550/arXiv.2403.12193, 10.1109/IROS58592.2024.10802060

Josifovski, Josip ; Auddy, Sayantan ; Malmir, Mohammadhossein et al. / Continual Domain Randomization. 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024. Institute of Electrical and Electronics Engineers Inc., 2024. S. 4965-4972 (IEEE International Conference on Intelligent Robots and Systems).

Download

@inproceedings{52a0e67c34f24409b3ed91d332cf6d42,

title = "Continual Domain Randomization",

abstract = "Domain Randomization (DR) is commonly used for sim2real transfer of reinforcement learning (RL) policies in robotics. Most DR approaches require a simulator with a fixed set of tunable parameters from the start of the training, from which the parameters are randomized simultaneously to train a robust model for use in the real world. However, the combined randomization of many parameters increases the task difficulty and might result in sub-optimal policies. To address this problem and to provide a more flexible training process, we propose Continual Domain Randomization (CDR) for RL that combines domain randomization with continual learning to enable sequential training in simulation on a subset of randomization parameters at a time. Starting from a model trained in a non-randomized simulation where the task is easier to solve, the model is trained on a sequence of randomizations, and continual learning is employed to remember the effects of previous randomizations. Our robotic reaching and grasping tasks experiments show that the model trained in this fashion learns effectively in simulation and performs robustly on the real robot while matching or outperforming baselines that employ combined randomization or sequential randomization without continual learning. Our code and videos are available at https://continual-dr.github.io/.",

keywords = "continual reinforcement learning, Domain randomization, robotic manipulation, sim2real transfer",

author = "Josip Josifovski and Sayantan Auddy and Mohammadhossein Malmir and Justus Piater and Alois Knoll and Nicol{\'a}s Navarro-Guerrero",

note = "Publisher Copyright: {\textcopyright} 2024 IEEE.; 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024 ; Conference date: 14-10-2024 Through 18-10-2024",

year = "2024",

month = oct,

day = "14",

doi = "10.48550/arXiv.2403.12193",

language = "English",

isbn = "979-8-3503-7771-2",

series = "IEEE International Conference on Intelligent Robots and Systems",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "4965--4972",

booktitle = "2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024",

address = "United States",

}

Download

TY - GEN

T1 - Continual Domain Randomization

AU - Josifovski, Josip

AU - Auddy, Sayantan

AU - Malmir, Mohammadhossein

AU - Piater, Justus

AU - Knoll, Alois

AU - Navarro-Guerrero, Nicolás

PY - 2024/10/14

Y1 - 2024/10/14

N2 - Domain Randomization (DR) is commonly used for sim2real transfer of reinforcement learning (RL) policies in robotics. Most DR approaches require a simulator with a fixed set of tunable parameters from the start of the training, from which the parameters are randomized simultaneously to train a robust model for use in the real world. However, the combined randomization of many parameters increases the task difficulty and might result in sub-optimal policies. To address this problem and to provide a more flexible training process, we propose Continual Domain Randomization (CDR) for RL that combines domain randomization with continual learning to enable sequential training in simulation on a subset of randomization parameters at a time. Starting from a model trained in a non-randomized simulation where the task is easier to solve, the model is trained on a sequence of randomizations, and continual learning is employed to remember the effects of previous randomizations. Our robotic reaching and grasping tasks experiments show that the model trained in this fashion learns effectively in simulation and performs robustly on the real robot while matching or outperforming baselines that employ combined randomization or sequential randomization without continual learning. Our code and videos are available at https://continual-dr.github.io/.

AB - Domain Randomization (DR) is commonly used for sim2real transfer of reinforcement learning (RL) policies in robotics. Most DR approaches require a simulator with a fixed set of tunable parameters from the start of the training, from which the parameters are randomized simultaneously to train a robust model for use in the real world. However, the combined randomization of many parameters increases the task difficulty and might result in sub-optimal policies. To address this problem and to provide a more flexible training process, we propose Continual Domain Randomization (CDR) for RL that combines domain randomization with continual learning to enable sequential training in simulation on a subset of randomization parameters at a time. Starting from a model trained in a non-randomized simulation where the task is easier to solve, the model is trained on a sequence of randomizations, and continual learning is employed to remember the effects of previous randomizations. Our robotic reaching and grasping tasks experiments show that the model trained in this fashion learns effectively in simulation and performs robustly on the real robot while matching or outperforming baselines that employ combined randomization or sequential randomization without continual learning. Our code and videos are available at https://continual-dr.github.io/.

KW - continual reinforcement learning

KW - Domain randomization

KW - robotic manipulation

KW - sim2real transfer

UR - http://www.scopus.com/inward/record.url?scp=85216474476&partnerID=8YFLogxK

U2 - 10.48550/arXiv.2403.12193

DO - 10.48550/arXiv.2403.12193

M3 - Conference contribution

AN - SCOPUS:85216474476

SN - 979-8-3503-7771-2

T3 - IEEE International Conference on Intelligent Robots and Systems

SP - 4965

EP - 4972

BT - 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024

Y2 - 14 October 2024 through 18 October 2024

ER -

Research@Leibniz University

Continual Domain Randomization

Autorschaft

Organisationseinheiten

Externe Organisationen

Details

Publikationsreihe

Abstract

ASJC Scopus Sachgebiete

Zitieren

Von denselben Autoren

Quantifying the Effect of Feedback Frequency in Interactive Reinforcement Learning for Robotic Tasks

Visuo-haptic object perception for robots: an overview

Survey on reinforcement learning for language processing

Optimizing BioTac Simulation for Realistic Tactile Perception

Cognitive inspired aspects of robot learning

Quantifying the Effect of Feedback Frequency in Interactive Reinforcement Learning for Robotic Tasks

Visuo-haptic object perception for robots: an overview

Survey on reinforcement learning for language processing

Optimizing BioTac Simulation for Realistic Tactile Perception

Cognitive inspired aspects of robot learning

Quantifying the Effect of Feedback Frequency in Interactive Reinforcement Learning for Robotic Tasks