Quantifying the Effect of Feedback Frequency in Interactive Reinforcement Learning for Robotic Tasks

Nicolás Navarro-Guerrero

doi:10.1007/s00521-022-07949-0

Details

Originalsprache	Englisch
Seiten (von - bis)	16931–16943
Seitenumfang	13
Fachzeitschrift	Neural Computing and Applications
Jahrgang	35
Ausgabenummer	23
Frühes Online-Datum	5 Dez. 2022
Publikationsstatus	Veröffentlicht - Aug. 2023
Extern publiziert	Ja

Abstract

Reinforcement learning (RL) has become widely adopted in robot control. Despite many successes, one major persisting problem can be very low data efficiency. One solution is interactive feedback, which has been shown to speed up RL considerably. As a result, there is an abundance of different strategies, which are, however, primarily tested on discrete grid-world and small scale optimal control scenarios. In the literature, there is no consensus about which feedback frequency is optimal or at which time the feedback is most beneficial. To resolve these discrepancies we isolate and quantify the effect of feedback frequency in robotic tasks with continuous state and action spaces. The experiments encompass inverse kinematics learning for robotic manipulator arms of different complexity. We show that seemingly contradictory reported phenomena occur at different complexity levels. Furthermore, our results suggest that no single ideal feedback frequency exists. Rather that feedback frequency should be changed as the agent’s proficiency in the task increases.

ASJC Scopus Sachgebiete

Informatik (insg.)
Software
Informatik (insg.)
Artificial intelligence

Zitieren

Quantifying the Effect of Feedback Frequency in Interactive Reinforcement Learning for Robotic Tasks. / Navarro-Guerrero, Nicolás.
in: Neural Computing and Applications, Jahrgang 35, Nr. 23, 08.2023, S. 16931–16943.

Publikation: Beitrag in Fachzeitschrift › Artikel › Forschung › Peer-Review

Navarro-Guerrero, N 2023, 'Quantifying the Effect of Feedback Frequency in Interactive Reinforcement Learning for Robotic Tasks', Neural Computing and Applications, Jg. 35, Nr. 23, S. 16931–16943. https://doi.org/10.1007/s00521-022-07949-0

Navarro-Guerrero, N. (2023). Quantifying the Effect of Feedback Frequency in Interactive Reinforcement Learning for Robotic Tasks. Neural Computing and Applications, 35(23), 16931–16943. https://doi.org/10.1007/s00521-022-07949-0

Navarro-Guerrero N. Quantifying the Effect of Feedback Frequency in Interactive Reinforcement Learning for Robotic Tasks. Neural Computing and Applications. 2023 Aug;35(23):16931–16943. Epub 2022 Dez 5. doi: 10.1007/s00521-022-07949-0

Navarro-Guerrero, Nicolás. / Quantifying the Effect of Feedback Frequency in Interactive Reinforcement Learning for Robotic Tasks. in: Neural Computing and Applications. 2023 ; Jahrgang 35, Nr. 23. S. 16931–16943.

Download

@article{94cef40c549043b1940cb864d7d185c1,

title = "Quantifying the Effect of Feedback Frequency in Interactive Reinforcement Learning for Robotic Tasks",

abstract = "Reinforcement learning (RL) has become widely adopted in robot control. Despite many successes, one major persisting problem can be very low data efficiency. One solution is interactive feedback, which has been shown to speed up RL considerably. As a result, there is an abundance of different strategies, which are, however, primarily tested on discrete grid-world and small scale optimal control scenarios. In the literature, there is no consensus about which feedback frequency is optimal or at which time the feedback is most beneficial. To resolve these discrepancies we isolate and quantify the effect of feedback frequency in robotic tasks with continuous state and action spaces. The experiments encompass inverse kinematics learning for robotic manipulator arms of different complexity. We show that seemingly contradictory reported phenomena occur at different complexity levels. Furthermore, our results suggest that no single ideal feedback frequency exists. Rather that feedback frequency should be changed as the agent{\textquoteright}s proficiency in the task increases.",

keywords = "Guided exploration, Human-aligned reinforcement learning, Interactive reinforcement learning, Intrinsic feedback homology",

author = "Nicol{\'a}s Navarro-Guerrero",

note = "Publisher Copyright: {\textcopyright} 2022, The Author(s).",

year = "2023",

month = aug,

doi = "10.1007/s00521-022-07949-0",

language = "English",

volume = "35",

pages = "16931–16943",

journal = "Neural Computing and Applications",

issn = "0941-0643",

publisher = "Springer London",

number = "23",

}

Download

TY - JOUR

T1 - Quantifying the Effect of Feedback Frequency in Interactive Reinforcement Learning for Robotic Tasks

AU - Navarro-Guerrero, Nicolás

PY - 2023/8

Y1 - 2023/8

N2 - Reinforcement learning (RL) has become widely adopted in robot control. Despite many successes, one major persisting problem can be very low data efficiency. One solution is interactive feedback, which has been shown to speed up RL considerably. As a result, there is an abundance of different strategies, which are, however, primarily tested on discrete grid-world and small scale optimal control scenarios. In the literature, there is no consensus about which feedback frequency is optimal or at which time the feedback is most beneficial. To resolve these discrepancies we isolate and quantify the effect of feedback frequency in robotic tasks with continuous state and action spaces. The experiments encompass inverse kinematics learning for robotic manipulator arms of different complexity. We show that seemingly contradictory reported phenomena occur at different complexity levels. Furthermore, our results suggest that no single ideal feedback frequency exists. Rather that feedback frequency should be changed as the agent’s proficiency in the task increases.

AB - Reinforcement learning (RL) has become widely adopted in robot control. Despite many successes, one major persisting problem can be very low data efficiency. One solution is interactive feedback, which has been shown to speed up RL considerably. As a result, there is an abundance of different strategies, which are, however, primarily tested on discrete grid-world and small scale optimal control scenarios. In the literature, there is no consensus about which feedback frequency is optimal or at which time the feedback is most beneficial. To resolve these discrepancies we isolate and quantify the effect of feedback frequency in robotic tasks with continuous state and action spaces. The experiments encompass inverse kinematics learning for robotic manipulator arms of different complexity. We show that seemingly contradictory reported phenomena occur at different complexity levels. Furthermore, our results suggest that no single ideal feedback frequency exists. Rather that feedback frequency should be changed as the agent’s proficiency in the task increases.

KW - Guided exploration

KW - Human-aligned reinforcement learning

KW - Interactive reinforcement learning

KW - Intrinsic feedback homology

UR - http://www.scopus.com/inward/record.url?scp=85143315822&partnerID=8YFLogxK

U2 - 10.1007/s00521-022-07949-0

DO - 10.1007/s00521-022-07949-0

M3 - Article

VL - 35

SP - 16931

EP - 16943

JO - Neural Computing and Applications

JF - Neural Computing and Applications

SN - 0941-0643

IS - 23

ER -

Research@Leibniz University

Quantifying the Effect of Feedback Frequency in Interactive Reinforcement Learning for Robotic Tasks

Autorschaft

Externe Organisationen

Details

Abstract

ASJC Scopus Sachgebiete

Zitieren

Von denselben Autoren

Cognitive inspired aspects of robot learning

Visuo-haptic object perception for robots: an overview

Survey on reinforcement learning for language processing

Continual Domain Randomization

Optimizing BioTac Simulation for Realistic Tactile Perception

Cognitive inspired aspects of robot learning

Visuo-haptic object perception for robots: an overview

Survey on reinforcement learning for language processing

Continual Domain Randomization

Optimizing BioTac Simulation for Realistic Tactile Perception

Cognitive inspired aspects of robot learning