Design and evaluation of a voice-controlled elevator system to improve safety and accessibility

Olaizola, Jon; Mendicute, Mikel

Ver/Abrir

Design and Evaluation of a Voice-Controlled Elevator System to Improve Safety and Accessibility.pdf (2.308Mb)

Registro completo

Impacto

Guarda la referencia

Título

Design and evaluation of a voice-controlled elevator system to improve safety and accessibility

Otras instituciones

Mondragon Unibertsitatea
https://ror.org/0023sah13

Versión

Postprint

Tipo de documento

Artículo

Idioma

Inglés

Derechos

Acceso

Acceso abierto

URI

https://hdl.handle.net/20.500.11984/6776

Versión de la editorial

https://doi.org/10.1109/OJIES.2024.3483552

Publicado en

IEEE Open Journal of the Industrial Electronics Society Early Access

Editorial

IEEE

Palabras clave

Speech recognition
Embedded systems
Human machine interaction
ODS 9 Industria, innovación e infraestructura ... [+]

Speech recognition
Embedded systems
Human machine interaction
ODS 9 Industria, innovación e infraestructura
ODS 10 Reducción de las desigualdades [-]

Materia (Tesauro UNESCO)

http://vocabularies.unesco.org/thesaurus/mt5.40

Clasificación UNESCO

Tecnología de los ordenadores

Resumen

This work introduces the design and assessment of a voice-controlled elevator system aimed at facilitating touchless interaction between users and hardware, thereby minimising contact and improving accessibility for individuals with disabilities. The research distinguishes three distinct deployment scenarios – on cloud, on edge and embedded – with the ultimate goal of integrating the entire system into a low-resource environment on a custom carrier board. An objective evaluation measured acoustic conditions rigorously using a dataset of 2900 audio files recorded inside a laboratory elevator cabin featuring two internal coatings, five audio input devices, and under four distinct noise conditions. The study evaluated the performance of two Automatic Speech Recognition systems: Google's Speech-to-Text API and a Kaldi model adapted for this task, deployed using Vosk. Additionally, latency times for these transcribers and two communication protocols were measured to enhance efficiency. Finally, two subjective evaluations on clean and noisy conditions were conducted simulating a real world scenario. The results, yielding 84.7 and 77.2 points respectively in a System Usability Scale questionnaire, affirm the reliability of the presented prototype for industrial deployment. [-]

Financiador

Gobierno Vasco

Programa

Elkartek 2021

Número

KK-2021-00038

URI de la ayuda

Sin información

Proyecto

Investigación en tecnologías de reconocimiento de voz para la interacción máquina-usuario (IVOZ)

Colecciones

Artículos - Ingeniería [894]

El ítem tiene asociados los siguientes ficheros de licencia:

Creative Commons

Excepto si se señala otra cosa, la licencia del ítem se describe como Attribution 4.0 International

eBiltegia