Abstract
For many use-cases, it is often important to explain the prediction of a black-box model by identifying the most influential training data samples. Existing approaches lack customization for user intent and often provide a homogeneous set of explanation samples, failing to reveal the model's reasoning from different angles. In this paper, we propose AIDE, an approach for providing antithetical (i.e., contrastive), intent-based, diverse explanations for opaque and complex models. AIDE distinguishes three types of explainability intents: interpreting a correct, investigating a wrong, and clarifying an ambiguous prediction. For each intent, AIDE selects an appropriate set of influential training samples that support or oppose the prediction either directly or by contrast. To provide a succinct summary, AIDE uses diversity-aware sampling to avoid redundancy and increase coverage of the training data. We demonstrate the effectiveness of AIDE on image and text classification tasks, in three ways: quantitatively, assessing correctness and continuity; qualitatively, comparing anecdotal evidence from AIDE and other example-based approaches; and via a user study, evaluating multiple aspects of AIDE. The results show that AIDE addresses the limitations of existing methods and exhibits desirable traits for an explainability method.
Original language | English |
---|---|
Title of host publication | Proceedings of the Seventh AAAI/ACM Conference on AI, Ethics, and Society (AIES-24) |
Number of pages | 12 |
Volume | 7 |
Publisher | AAAI Press Association for the Advancement of Artificial Intelligence |
Publication date | 16 Oct 2024 |
ISBN (Print) | ISBN-10 1-57735-892-9 |
ISBN (Electronic) | ISBN-13 978-1-57735-892-3 |
DOIs | |
Publication status | Published - 16 Oct 2024 |
Event | Seventh AAAI/ACM Conference on AI, Ethics, and Society - San Jose McEnery, San Hose, United States Duration: 21 Oct 2024 → 23 Oct 2024 https://www.aies-conference.com/2024/ |
Conference
Conference | Seventh AAAI/ACM Conference on AI, Ethics, and Society |
---|---|
Location | San Jose McEnery |
Country/Territory | United States |
City | San Hose |
Period | 21/10/2024 → 23/10/2024 |
Internet address |
Keywords
- Explainable AI