Abstract
Deep reinforcement learning (DRL) is a powerful machine learning paradigm for generating agents that control autonomous systems. However, the 'black box' nature of DRL agents limits their deployment in real-world safety-critical applications. A promising approach for providing strong guarantees on an agent's behavior is to use Neural Lyapunov Barrier (NLB) certificates, which are learned functions over the system whose properties indirectly imply that an agent behaves as desired. However, NLB-based certificates are typically difficult to learn and even more difficult to verify, especially for complex systems. In this work, we present a novel method for training and verifying NLB-based certificates for discrete-time systems. Specifically, we introduce a technique for certificate composition, which simplifies the verification of highly-complex systems by strategically designing a sequence of certificates. When jointly verified with neural network verification engines, these certificates provide a formal guarantee that a DRL agent both achieves its goals and avoids unsafe behavior. Furthermore, we introduce a technique for certificate filtering, which significantly simplifies the process of producing formally verified certificates. We demonstrate the merits of our approach with a case study on providing safety and liveness guarantees for a DRL-controlled spacecraft.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 24th Conference on Formal Methods in Computer-Aided Design, FMCAD 2024 |
| Editors | Nina Narodytska, Philipp Rummer, Philipp Rummer, Warren A. Hunt, Georg Weissenbacher |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 95-106 |
| Number of pages | 12 |
| Edition | 2024 |
| ISBN (Electronic) | 9783854480655 |
| DOIs | |
| State | Published - 2024 |
| Event | 24th Conference on Formal Methods in Computer-Aided Design, FMCAD 2024 - Prague, Czech Republic Duration: 15 Oct 2024 → 18 Oct 2024 |
Conference
| Conference | 24th Conference on Formal Methods in Computer-Aided Design, FMCAD 2024 |
|---|---|
| Country/Territory | Czech Republic |
| City | Prague |
| Period | 15/10/24 → 18/10/24 |
Bibliographical note
Publisher Copyright:© 2024 FMCAD Association (and authors).
Fingerprint
Dive into the research topics of 'Formally Verifying Deep Reinforcement Learning Controllers with Lyapunov Barrier Certificates'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver