Fear Field: Adaptive constraints for safe environment transitions in Shielded Reinforcement Learning

Odriozola Olalde, Haritz; Arana-Arexolaleiba, Nestor

dc.rights.license	Attribution 4.0 International	*
dc.contributor.author	Odriozola Olalde, Haritz
dc.contributor.author	Arana-Arexolaleiba, Nestor
dc.contributor.other	Zamalloa, Maider
dc.contributor.other	Perez-Cerrolaza, Jon
dc.contributor.other	Arozamena-Rodríguez, Jokin
dc.date.accessioned	2024-03-21T13:35:22Z
dc.date.available	2024-03-21T13:35:22Z
dc.date.issued	2023
dc.identifier.issn	1613-0073	en
dc.identifier.other	https://katalogoa.mondragon.edu/janium-bin/janium_login_opac.pl?find&ficha_no=174296	en
dc.identifier.uri	https://hdl.handle.net/20.500.11984/6301
dc.description.abstract	Shielding methods for Reinforcement Learning agents show potential for safety-critical industrial applications. However, they still lack robustness on nominal safety, a key property for safety control systems. In the case of a significant change in the environment dynamic, shielding methods cannot guarantee safety until their inherent dynamics model is updated to the new scenario. The agent could reach risky states because the model cannot predict well. These situations could lead to catastrophic outcomes, such as damage to the cyber-physical system or loss of human lives, which are not allowed on safety-critical applications. The novel method presented in this paper, Fear Field, replicates human behaviour in those scenarios, adapting safety constraints whenever a drastic environmental change is introduced. Fear Field reduces safety violations by one order of magnitude compared to an RL agent implementing only a shield.	en
dc.language.iso	eng	en
dc.publisher	CEUR-WS.org	en
dc.rights	© 2023 The Authors	en
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/	*
dc.subject	Reinforcement Learning	en
dc.subject	Shielding	en
dc.subject	Adaptive constraints	en
dc.subject	Robustness	en
dc.subject	Safe AI	en
dc.subject	ODS 9 Industria, innovación e infraestructura
dc.title	Fear Field: Adaptive constraints for safe environment transitions in Shielded Reinforcement Learning	en
dc.type	http://purl.org/coar/resource_type/c_c94f
dcterms.accessRights	http://purl.org/coar/access_right/c_abf2	en
dcterms.source	Proceedings of the IJCAI-23 Joint Workshop on Artificial Intelligence Safety and Safe Reinforcement Learning (AISafety-SafeRL), co-located with the 32nd International Joint Conference on Artificial Intelligence (IJCAI2023)	en
local.contributor.group	Robótica y automatización	es
local.description.peerreviewed	true	en
local.contributor.otherinstitution	Ikerlan	es
local.source.details	Macao, June, 2023	en
oaire.format.mimetype	application/pdf	en
oaire.file	$DSPACE\assetstore	en
oaire.resourceType	http://purl.org/coar/resource_type/c_c94f	en
oaire.version	http://purl.org/coar/version/c_970fb48d4fbd8a85	en