Novel automated interactive reinforcement learning framework with a constraint-based supervisor for procedural tasks

Elguea, Íñigo; Aguirre, Aitor; Izagirre, Unai; Inziarte Hidalgo, Ibai; Bogh, Simon; Arana-Arexolaleiba, Nestor

Ver/Abrir

Novel automated interactive reinforcement learning framework with a constraint-based supervisor for procedural tasks.pdf (2.643Mb)

Registro completo

Impacto

Guarda la referencia

Título

Novel automated interactive reinforcement learning framework with a constraint-based supervisor for procedural tasks

Autor-a

Elguea, Íñigo

Aguirre, Aitor

Izagirre, Unai

Inziarte Hidalgo, Ibai

Bogh, Simon

Arana-Arexolaleiba, Nestor

Grupo de investigación

Análisis de datos y ciberseguridad
Robótica y automatización

Otras instituciones

Electrotecnica Alavesa S.L.
Mondragon Unibertsitatea
Montajes Mantenimiento y Automatismos Eléctricos Navarra
https://ror.org/04m5j1k67

Versión

Version publicada

Tipo de documento

Artículo

Idioma

Inglés

Derechos

Acceso

Acceso abierto

URI

https://hdl.handle.net/20.500.11984/6898

Versión de la editorial

https://doi.org/10.1016/j.knosys.2024.112870

Publicado en

Knowledge-Based Systems Vol. 309. N. art. 112870. 30 January, 2025

Editorial

Elsevier

Palabras clave

Automated supervisor
Contact-rich manipulation
Industrial manipulators
Interactive reinforcement learning ... [+]

Automated supervisor
Contact-rich manipulation
Industrial manipulators
Interactive reinforcement learning
Sample efficiency
Procedural tasks [-]

Materia (Tesauro UNESCO)

Automatización

Clasificación UNESCO

Tecnología de la automatización

Resumen

Learning to perform procedural motion or manipulation tasks in unstructured or uncertain environments poses significant challenges for intelligent agents. Although reinforcement learning algorithms have demonstrated positive results on simple tasks, the hard-to-engineer reward functions and the impractical amount of trial-and-error iterations these agents require in long-experience streams still present challenges for deployment in industrially relevant environments. In this regard, interactive reinforcement learning has emerged as a promising approach to mitigate these limitations, whereby a human supervisor provides evaluative or corrective feedback to the learning agent during training. However, the requirement of a human-in-the-loop approach throughout the learning process can be impractical for tasks that span several hours. This study aims to overcome this limitation by automating the learning process and substituting human feedback with an artificial supervisor grounded in constraint-based modeling techniques. In contrast to the logical constraints commonly used for conventional reinforcement learning, constraint-based modeling techniques offer enhanced adaptability in terms of conceptualizing and modeling the human knowledge of a task. This modeling capability allows an automated supervisor to acquire a closer approximation to human reasoning by dividing complex tasks into more manageable components and identifying the associated subtask and contextual cues in which the agent is involved. The supervisor then adjusts the evaluative and corrective feedback to suit the specific subtask under consideration. The framework was assessed using three actor-critic agents in a human–robot interaction environment, demonstrating a sample efficiency improvement of 50% and success rates of [-]

Financiador

Gobierno Vasco

Programa

Ikertalde Convocatoria 2022-2025

Número

IT1676-22

URI de la ayuda

Sin información

Proyecto

Grupo de sistemas inteligentes para sistemas industriales (IKERTALDE 2022-2025)

Colecciones

Artículos - Ingeniería [894]

El ítem tiene asociados los siguientes ficheros de licencia:

Creative Commons

Excepto si se señala otra cosa, la licencia del ítem se describe como Attribution-NonCommercial-NoDerivatives 4.0 International

eBiltegia