Título
Novel automated interactive reinforcement learning framework with a constraint-based supervisor for procedural tasksAutor-a
Fecha de publicación
2025Otras instituciones
Electrotecnica Alavesa S.L.Montajes Mantenimiento y Automatismos Eléctricos Navarra
Aalborg Universitet (Denmark)
Versión
Version publicadaTipo de documento
ArtículoArtículoIdioma
InglésDerechos
© 2024 The AuthorsAcceso
Acceso abiertoVersión de la editorial
https://doi.org/10.1016/j.knosys.2024.112870Publicado en
Knowledge-Based Systems Vol. 309. N. art. 112870. 30 January, 2025Editorial
ElsevierPalabras clave
Automated supervisor
Contact-rich manipulation
Industrial manipulators
Interactive reinforcement learning ... [+]
Contact-rich manipulation
Industrial manipulators
Interactive reinforcement learning ... [+]
Automated supervisor
Contact-rich manipulation
Industrial manipulators
Interactive reinforcement learning
Sample efficiency
Procedural tasks [-]
Contact-rich manipulation
Industrial manipulators
Interactive reinforcement learning
Sample efficiency
Procedural tasks [-]
Clasificación UNESCO
Tecnología de la automatizaciónResumen
Learning to perform procedural motion or manipulation tasks in unstructured or uncertain environments poses significant challenges for intelligent agents. Although reinforcement learning algorithms ha ... [+]
Learning to perform procedural motion or manipulation tasks in unstructured or uncertain environments poses significant challenges for intelligent agents. Although reinforcement learning algorithms have demonstrated positive results on simple tasks, the hard-to-engineer reward functions and the impractical amount of trial-and-error iterations these agents require in long-experience streams still present challenges for deployment in industrially relevant environments. In this regard, interactive reinforcement learning has emerged as a promising approach to mitigate these limitations, whereby a human supervisor provides evaluative or corrective feedback to the learning agent during training. However, the requirement of a human-in-the-loop approach throughout the learning process can be impractical for tasks that span several hours. This study aims to overcome this limitation by automating the learning process and substituting human feedback with an artificial supervisor grounded in constraint-based modeling techniques. In contrast to the logical constraints commonly used for conventional reinforcement learning, constraint-based modeling techniques offer enhanced adaptability in terms of conceptualizing and modeling the human knowledge of a task. This modeling capability allows an automated supervisor to acquire a closer approximation to human reasoning by dividing complex tasks into more manageable components and identifying the associated subtask and contextual cues in which the agent is involved. The supervisor then adjusts the evaluative and corrective feedback to suit the specific subtask under consideration. The framework was assessed using three actor-critic agents in a human–robot interaction environment, demonstrating a sample efficiency improvement of 50% and success rates of [-]
Colecciones
- Artículos - Ingeniería [758]
El ítem tiene asociados los siguientes ficheros de licencia:




















