Details
Originalsprache | undefiniert/unbekannt |
---|---|
Publikationsstatus | Elektronisch veröffentlicht (E-Pub) - 24 Mai 2024 |
Abstract
Zitieren
- Standard
- Harvard
- Apa
- Vancouver
- BibTex
- RIS
2024.
Publikation: Arbeitspapier/Preprint › Preprint
}
TY - UNPB
T1 - Model-free reinforcement learning with noisy actions for automated experimental control in optics
AU - Richtmann, Lea
AU - Schmiesing, Viktoria-S
AU - Wilken, Dennis
AU - Heine, Jan
AU - Tranter, Aaron
AU - Anand, Avishek
AU - Osborne, Tobias J.
AU - Heurs, Michèle
N1 - 10 pages + 10 pages appendices, 3 + 11 figures
PY - 2024/5/24
Y1 - 2024/5/24
N2 - Experimental control involves a lot of manual effort with non-trivial decisions for precise adjustments. Here, we study the automatic experimental alignment for coupling laser light into an optical fiber using reinforcement learning (RL). We face several real-world challenges, such as time-consuming training, partial observability, and noisy actions due to imprecision in the mirror steering motors. We show that we can overcome these challenges: To save time, we use a virtual testbed to tune our environment for dealing with partial observability and use relatively sample-efficient model-free RL algorithms like Soft Actor-Critic (SAC) or Truncated Quantile Critics (TQC). Furthermore, by fully training on the experiment, the agent learns directly to handle the noise present. In our extensive experimentation, we show that we are able to achieve 90% coupling, showcasing the effectiveness of our proposed approaches. We reach this efficiency, which is comparable to that of a human expert, without additional feedback loops despite the motors' inaccuracies. Our result is an example of the readiness of RL for real-world tasks. We consider RL a promising tool for reducing the workload in labs.
AB - Experimental control involves a lot of manual effort with non-trivial decisions for precise adjustments. Here, we study the automatic experimental alignment for coupling laser light into an optical fiber using reinforcement learning (RL). We face several real-world challenges, such as time-consuming training, partial observability, and noisy actions due to imprecision in the mirror steering motors. We show that we can overcome these challenges: To save time, we use a virtual testbed to tune our environment for dealing with partial observability and use relatively sample-efficient model-free RL algorithms like Soft Actor-Critic (SAC) or Truncated Quantile Critics (TQC). Furthermore, by fully training on the experiment, the agent learns directly to handle the noise present. In our extensive experimentation, we show that we are able to achieve 90% coupling, showcasing the effectiveness of our proposed approaches. We reach this efficiency, which is comparable to that of a human expert, without additional feedback loops despite the motors' inaccuracies. Our result is an example of the readiness of RL for real-world tasks. We consider RL a promising tool for reducing the workload in labs.
KW - cs.LG
KW - physics.optics
KW - J.2; I.2.1
M3 - Preprint
BT - Model-free reinforcement learning with noisy actions for automated experimental control in optics
ER -