| dc.contributor.author | Ayerdi, Jon Joseba | |
| dc.contributor.author | Iriarte, Asier | |
| dc.contributor.author | Valle Entrena, Pablo | |
| dc.contributor.author | Roman Txopitea, Ibai | |
| dc.contributor.author | Illarramendi, Miren | |
| dc.contributor.author | Arrieta, Aitor | |
| dc.date.accessioned | 2026-06-05T15:08:34Z | |
| dc.date.available | 2026-06-05T15:08:34Z | |
| dc.date.issued | 2024 | |
| dc.identifier.issn | 1557-7392 | en |
| dc.identifier.other | https://katalogoa.mondragon.edu/janium-bin/janium_login_opac.pl?find&ficha_no=179959 | en |
| dc.identifier.uri | https://hdl.handle.net/20.500.11984/14502 | |
| dc.description.abstract | Autonomous driving systems (ADSs) are complex cyber-physical systems (CPSs) that must ensure safety even in uncertain conditions. Modern ADSs often employ deep neural networks (DNNs), which may not produce correct results in every possible driving scenario. Thus, an approach to estimate the confidence of an ADS at runtime is necessary to prevent potentially dangerous situations. In this article we propose MarMot, an online monitoring approach for ADSs based on metamorphic relations (MRs), which are properties of a system that hold among multiple inputs and the corresponding outputs. Using domain-specific MRs, MarMot estimates the uncertainty of the ADS at runtime, allowing the identification of anomalous situations that are likely to cause a faulty behavior of the ADS, such as driving off the road.
We perform an empirical assessment of MarMot with five different MRs, using two different subject ADSs, including a small-scale physical ADS and a simulated ADS. Our evaluation encompasses the identification of both external anomalies, e.g., fog, as well as internal anomalies, e.g., faulty DNNs due to mislabeled training data. Our results show that MarMot can identify up to 65% of the external anomalies and 100% of the internal anomalies in the physical ADS, and up to 54% of the external anomalies and 88% of the internal anomalies in the simulated ADS. With these results, MarMot outperforms or is comparable to other state-of-the-art approaches, including SelfOracle, Ensemble, and MC Dropout-based ADS monitors. | es |
| dc.language.iso | eng | en |
| dc.publisher | ACM | en |
| dc.rights | © ACM | en |
| dc.subject | Software safety | en |
| dc.subject | Autonomous Driving System | en |
| dc.subject | Runtime Monitoring | en |
| dc.subject | Metamorphic Testing | en |
| dc.subject | Cyber-Physical Systems | en |
| dc.subject | Deep Neural Networks | en |
| dc.title | MarMot: Metamorphic Runtime Monitoring of Autonomous Driving Systems | en |
| dcterms.accessRights | http://purl.org/coar/access_right/c_abf2 | en |
| dcterms.source | ACM Transactions on Software Engineering and Methodology | en |
| local.contributor.group | Ingeniería del Software y Sistemas | es |
| local.description.peerreviewed | true | en |
| local.description.publicationfirstpage | 1 | en |
| local.description.publicationlastpage | 35 | en |
| local.identifier.doi | https://doi.org/10.1145/3678171 | en |
| local.source.details | Vol. 34 (1). N. art. 18. | en |
| oaire.format.mimetype | application/pdf | en |
| oaire.file | $DSPACE\assetstore | en |
| oaire.resourceType | http://purl.org/coar/resource_type/c_6501 | en |
| oaire.version | http://purl.org/coar/version/c_ab4af688f83e57aa | en |
| dc.unesco.tesauro | http://vocabularies.unesco.org/thesaurus/concept450 | en |
| oaire.funderName | Gobierno Vasco | en |
| oaire.funderIdentifier | https://ror.org/00pz2fp31 / http://data.crossref.org/fundingdata/funder/10.13039/501100003086 | en |
| oaire.fundingStream | Elkartek 2022 | en |
| oaire.fundingStream | Elkartek 2022 | en |
| oaire.fundingStream | Ikertalde Convocatoria 2022-2023 | en |
| oaire.awardNumber | KK-2022/00119 | en |
| oaire.awardNumber | KK-2022/00007 | en |
| oaire.awardNumber | IT1519-22 | en |
| oaire.awardTitle | Edge Technologies for Industrial Distributed AI Applications (EGIA) | en |
| oaire.awardTitle | Smart, robust, secure and ethical Industrial Systems for Industry 5.0: advanced paradigms for specification, design, evaluation and monitoring (SIIRSE) | en |
| oaire.awardTitle | Ingeniería de Software y Sistemas (IKERTALDE 2022-2023) | en |
| dc.unesco.clasificacion | http://skos.um.es/unesco6/120317 | en |