Details
Originalsprache | Englisch |
---|---|
Veröffentlichungsnummer (amtliches Aktenzeichen) | US2021008718 |
IPC | G05B 17/ 02 A I |
Prioritätsdatum | 12 Juli 2019 |
Publikationsstatus | Veröffentlicht - 14 Jan. 2021 |
Abstract
A method for producing a strategy for a robot. The method includes the following steps: initializing the strategy and an episode length; repeated execution of the loop including the following steps: producing a plurality of further strategies as a function of the strategy; applying the plurality of the further strategies for the length of the episode length; ascertaining respectively a cumulative reward, which is obtained in the application of the respective further strategy; updating the strategy as a function of a second plurality of the further strategies that obtained the greatest cumulative rewards. After each execution of the loop, the episode length is increased. A computer program, a device for carrying out the method, and a machine-readable memory element on which the computer program is stored, are also described.
Zitieren
- Standard
- Harvard
- Apa
- Vancouver
- BibTex
- RIS
Patent Nr.: US2021008718. Jan. 14, 2021.
Publikation: Schutzrecht/Patent › Patent
}
TY - PAT
T1 - METHOD, DEVICE AND COMPUTER PROGRAM FOR PRODUCING A STRATEGY FOR A ROBOT
AU - Hutter, Frank
AU - Fuks, Lior
AU - Lindauer, Marius
AU - Awad, Noor
PY - 2021/1/14
Y1 - 2021/1/14
N2 - A method for producing a strategy for a robot. The method includes the following steps: initializing the strategy and an episode length; repeated execution of the loop including the following steps: producing a plurality of further strategies as a function of the strategy; applying the plurality of the further strategies for the length of the episode length; ascertaining respectively a cumulative reward, which is obtained in the application of the respective further strategy; updating the strategy as a function of a second plurality of the further strategies that obtained the greatest cumulative rewards. After each execution of the loop, the episode length is increased. A computer program, a device for carrying out the method, and a machine-readable memory element on which the computer program is stored, are also described.
AB - A method for producing a strategy for a robot. The method includes the following steps: initializing the strategy and an episode length; repeated execution of the loop including the following steps: producing a plurality of further strategies as a function of the strategy; applying the plurality of the further strategies for the length of the episode length; ascertaining respectively a cumulative reward, which is obtained in the application of the respective further strategy; updating the strategy as a function of a second plurality of the further strategies that obtained the greatest cumulative rewards. After each execution of the loop, the episode length is increased. A computer program, a device for carrying out the method, and a machine-readable memory element on which the computer program is stored, are also described.
M3 - Patent
M1 - US2021008718
ER -