Why did the AI make that decision? Towards an explainable artificial intelligence (XAI) for autonomous driving systems

Dong, Jiqian; Chen, Sikai; Miralinaghi, Mohammad; Chen, Tiantian; Li, Pei; Labi, Samuel

doi:10.1016/j.trc.2023.104358

SAFETYLIT WEEKLY UPDATE

We compile citations and summaries of about 400 new articles every week.

RSS Feed

HELP: Tutorials | FAQ

CONTACT US: Contact info

Search Results

Journal Article

Why did the AI make that decision? Towards an explainable artificial intelligence (XAI) for autonomous driving systems
Citation	Dong J, Chen S, Miralinaghi M, Chen T, Li P, Labi S. Transp. Res. C Emerg. Technol. 2023; 156: e104358.
Copyright	(Copyright © 2023, Elsevier Publishing)
DOI	10.1016/j.trc.2023.104358
PMID	unavailable
Abstract	User trust has been identified as a critical issue that is pivotal to the success of autonomous vehicle (AV) operations where artificial intelligence (AI) is widely adopted. For such integrated AI-based driving systems, one promising way of building user trust is through the concept of explainable artificial intelligence (XAI) which requires the AI system to provide the user with the explanations behind each decision it makes. Motivated by both the need to enhance user trust and the promise of novel XAI technology in addressing such need, this paper seeks to enhance trustworthiness in autonomous driving systems through the development of explainable Deep Learning (DL) models. First, the paper casts the decision-making process of the AV system not as a classification task (which is the traditional process) but rather as an image-based language generation (image captioning) task. As such, the proposed approach makes driving decisions by first generating textual descriptions of the driving scenarios, which serve as explanations that humans can understand. To this end, a novel multi-modal DL architecture is proposed to jointly model the correlation between an image (driving scenario) and language (descriptions). It adopts a fully Transformer-based structure and therefore has the potential to perform global attention and imitate effectively, the learning processes of human drivers. The results suggest that the proposed model can and does generate legal and meaningful sentences to describe a given driving scenario, and subsequently to correctly generate appropriate driving decisions in autonomous vehicles (AVs). It is also observed that the proposed model significantly outperforms multiple baseline models in terms of generating both explanations and driving actions. From the end user's perspective, the proposed model can be beneficial in enhancing user trust because it provides the rationale behind an AV's actions. From the AV developer's perspective, the explanations from this explainable system could serve as a "debugging" tool to detect potential weaknesses in the existing system and identify specific directions for improvement. Language: en
Keywords	Autonomous driving; Computer vision; End-to-end transformer; Explainable AI (XAI); User trust; Visual attention