Details
Original language | English |
---|---|
Patent number | US2021008718 |
IPC | G05B 17/ 02 A I |
Priority date | 12 Jul 2019 |
Publication status | Published - 14 Jan 2021 |
Abstract
A method for producing a strategy for a robot. The method includes the following steps: initializing the strategy and an episode length; repeated execution of the loop including the following steps: producing a plurality of further strategies as a function of the strategy; applying the plurality of the further strategies for the length of the episode length; ascertaining respectively a cumulative reward, which is obtained in the application of the respective further strategy; updating the strategy as a function of a second plurality of the further strategies that obtained the greatest cumulative rewards. After each execution of the loop, the episode length is increased. A computer program, a device for carrying out the method, and a machine-readable memory element on which the computer program is stored, are also described.
Cite this
- Standard
- Harvard
- Apa
- Vancouver
- BibTeX
- RIS
Patent No.: US2021008718. Jan 14, 2021.
Research output: Patent
}
TY - PAT
T1 - METHOD, DEVICE AND COMPUTER PROGRAM FOR PRODUCING A STRATEGY FOR A ROBOT
AU - Hutter, Frank
AU - Fuks, Lior
AU - Lindauer, Marius
AU - Awad, Noor
PY - 2021/1/14
Y1 - 2021/1/14
N2 - A method for producing a strategy for a robot. The method includes the following steps: initializing the strategy and an episode length; repeated execution of the loop including the following steps: producing a plurality of further strategies as a function of the strategy; applying the plurality of the further strategies for the length of the episode length; ascertaining respectively a cumulative reward, which is obtained in the application of the respective further strategy; updating the strategy as a function of a second plurality of the further strategies that obtained the greatest cumulative rewards. After each execution of the loop, the episode length is increased. A computer program, a device for carrying out the method, and a machine-readable memory element on which the computer program is stored, are also described.
AB - A method for producing a strategy for a robot. The method includes the following steps: initializing the strategy and an episode length; repeated execution of the loop including the following steps: producing a plurality of further strategies as a function of the strategy; applying the plurality of the further strategies for the length of the episode length; ascertaining respectively a cumulative reward, which is obtained in the application of the respective further strategy; updating the strategy as a function of a second plurality of the further strategies that obtained the greatest cumulative rewards. After each execution of the loop, the episode length is increased. A computer program, a device for carrying out the method, and a machine-readable memory element on which the computer program is stored, are also described.
M3 - Patent
M1 - US2021008718
ER -