Shielded Reinforcement Learning: A review of reactive methods for safe learning

Arana-Arexolaleiba, Nestor

Ver/Abrir

Shielded_Reinforcement_Learning__A_review_of_reactive_methods_for_safe_learning.pdf (571.2Kb)

Registro completo

Impacto

Guarda la referencia

Título

Shielded Reinforcement Learning: A review of reactive methods for safe learning

Otras instituciones

https://ror.org/03hp1m080
Mondragon Unibertsitatea

Versión

Postprint

Tipo de documento

Contribución a congreso

Fin de la fecha de embargo

2025-02-15

Idioma

Inglés

Derechos

Acceso

Acceso embargado

URI

https://hdl.handle.net/20.500.11984/6031

Versión de la editorial

https://doi.org/10.1109/SII55687.2023.10039301

Publicado en

2023 IEEE/SICE International Symposium on System Integrations (SII) Atlanta. 17-20 January,

Editorial

IEEE

Palabras clave

Reinforcement learning
System integration
Control systems
3D printing ... [+]

Reinforcement learning
System integration
Control systems
3D printing
Robot sensing systems
Robustness
Safety [-]

Resumen

Reinforcement Learning (RL) algorithms are showing promising results in simulated environments, but their replication in real physical applications, even more so in safety-critical applications, is not yet guaranteed. Ensuring the functional safety of RL algorithms is not a trivial task since the physical integrity of the target system, also called environment, especially when there is interaction with humans, may depend on it. Among the methods recently developed with the objective of guaranteeing safety, Shielded Reinforcement Learning is defined, which defines an interaction mechanism if the action event proposed by the agent causes a non-safe state. This article provides an overview of the different Shielding Reinforcement Learning approaches. In addition to summarising the techniques used by each of them, their advantages and disadvantages are discussed. Finally, the shortcomings associated with Shielded Reinforcement Learning methods that can lead to risk or unsafe situations are discussed. [-]