Título
Design and evaluation of a voice-controlled elevator system to improve safety and accessibilityAutor-a (de otra institución)
Otras instituciones
VicomtechVersión
Postprint
Derechos
© 2024 The AuthorsAcceso
Acceso abiertoVersión del editor
https://doi.org/10.1109/OJIES.2024.3483552Publicado en
IEEE Open Journal of the Industrial Electronics Society Early AccessEditor
IEEEPalabras clave
Speech recognition
Embedded systems
Human machine interaction
ODS 9 Industria, innovación e infraestructura ... [+]
Embedded systems
Human machine interaction
ODS 9 Industria, innovación e infraestructura ... [+]
Speech recognition
Embedded systems
Human machine interaction
ODS 9 Industria, innovación e infraestructura
ODS 10 Reducción de las desigualdades [-]
Embedded systems
Human machine interaction
ODS 9 Industria, innovación e infraestructura
ODS 10 Reducción de las desigualdades [-]
Materia (Tesauro UNESCO)
http://vocabularies.unesco.org/thesaurus/mt5.40Clasificación UNESCO
Tecnología de los ordenadoresResumen
This work introduces the design and assessment of a voice-controlled elevator system aimed at facilitating touchless interaction between users and hardware, thereby minimising contact and improving ac ... [+]
This work introduces the design and assessment of a voice-controlled elevator system aimed at facilitating touchless interaction between users and hardware, thereby minimising contact and improving accessibility for individuals with disabilities. The research distinguishes three distinct deployment scenarios – on cloud, on edge and embedded – with the ultimate goal of integrating the entire system into a low-resource environment on a custom carrier board. An objective evaluation measured acoustic conditions rigorously using a dataset of 2900 audio files recorded inside a laboratory elevator cabin featuring two internal coatings, five audio input devices, and under four distinct noise conditions. The study evaluated the performance of two Automatic Speech Recognition systems: Google's Speech-to-Text API and a Kaldi model adapted for this task, deployed using Vosk. Additionally, latency times for these transcribers and two communication protocols were measured to enhance efficiency. Finally, two subjective evaluations on clean and noisy conditions were conducted simulating a real world scenario. The results, yielding 84.7 and 77.2 points respectively in a System Usability Scale questionnaire, affirm the reliability of the presented prototype for industrial deployment. [-]
Financiador
Gobierno VascoPrograma
Elkartek 2021Número
KK-2021-00038URI de la ayuda
Sin informaciónProyecto
Investigación en tecnologías de reconocimiento de voz para la interacción máquina-usuario (IVOZ)Colecciones
- Artículos - Ingeniería [684]
El ítem tiene asociados los siguientes ficheros de licencia: