Explainable AI seeks to unveil the intricacies of black box models through post-hoc strategies or self-interpretable models. In this paper, we tackle the problem of building layers that are intrinsically explainable through logical rules. In particular, we address current state-of-the-art methods' lack of fidelity and expressivity by introducing a transparent explainable logic layer (TELL). We propose to constrain a feed-forward layer with positive weights, which, combined with particular activation functions, offer the possibility of a direct translation into logic rules. Additionally, this approach overcomes the limitations of previous models, linked to their applicability to binary data only, by proposing a new way to automatically threshold real values and incorporate the obtained predicates into logical rules. We show that, compared to state-of-the-art, TELL achieves similar classification performances and, at the same time, provides higher explanatory power, measured by the agreement between models’ outputs and the activation of the logical explanations. In addition, TELL offers a broader spectrum of applications thanks to the possibility of its use on real data.
Transparent Explainable Logic Layers / Ragno, Alessio; Plantevit, Marc; Robardet, Celine; Capobianco, Roberto. - 392:(2024), pp. 914-921. (Intervento presentato al convegno European Conference on Artificial Intelligence (ECAI 2024) tenutosi a Santiago de Compostela; Spain) [10.3233/FAIA240579].
Transparent Explainable Logic Layers
Alessio Ragno
Primo
;Roberto CapobiancoUltimo
2024
Abstract
Explainable AI seeks to unveil the intricacies of black box models through post-hoc strategies or self-interpretable models. In this paper, we tackle the problem of building layers that are intrinsically explainable through logical rules. In particular, we address current state-of-the-art methods' lack of fidelity and expressivity by introducing a transparent explainable logic layer (TELL). We propose to constrain a feed-forward layer with positive weights, which, combined with particular activation functions, offer the possibility of a direct translation into logic rules. Additionally, this approach overcomes the limitations of previous models, linked to their applicability to binary data only, by proposing a new way to automatically threshold real values and incorporate the obtained predicates into logical rules. We show that, compared to state-of-the-art, TELL achieves similar classification performances and, at the same time, provides higher explanatory power, measured by the agreement between models’ outputs and the activation of the logical explanations. In addition, TELL offers a broader spectrum of applications thanks to the possibility of its use on real data.File | Dimensione | Formato | |
---|---|---|---|
Ragno_Transparent_2024.pdf
accesso aperto
Tipologia:
Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza:
Creative commons
Dimensione
489.42 kB
Formato
Adobe PDF
|
489.42 kB | Adobe PDF |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.