We address a multi-class classification problem on a tabular dataset comprising both continuous and categorical features, as commonly found in financial and actuarial domains. To achieve robust and reliable predictions, we propose a credal classifier built from an ensemble of augmented probabilistic models, each capable of self-assessing its confidence in the predicted class distribution for a given instance. Each base learner in the ensemble is implemented as a Confidence-Aware TabTransformer: a neural architecture that combines a sequence of transformer layers to process categorical variables, followed by a multi-layer perceptron for classification. In parallel, a dedicated confidence head outputs the model's self-assessed reliability score. For ensemble aggregation, we explore multiple confidence-aware combination rules, applied after delta-quantile filtering step to remove low-discordant outlier distribution. We demonstrate the effectiveness of this methodology on a fraud detection task, which is a representative binary classification problem with severe class imbalance.
Credal Classification through an Ensemble of Confidence-Aware TabTransformers and its Application to Fraud Detection / Galiani, Stefano; Petturiti, Davide; Vantaggi, Barbara. - In: INTERNATIONAL JOURNAL OF UNCERTAINTY, FUZZINESS AND KNOWLEDGE BASED SYSTEMS. - ISSN 0218-4885. - 33:7(2025). [10.1142/S0218488525400136]
Credal Classification through an Ensemble of Confidence-Aware TabTransformers and its Application to Fraud Detection
Stefano Galiani
;Davide Petturiti;Barbara Vantaggi
2025
Abstract
We address a multi-class classification problem on a tabular dataset comprising both continuous and categorical features, as commonly found in financial and actuarial domains. To achieve robust and reliable predictions, we propose a credal classifier built from an ensemble of augmented probabilistic models, each capable of self-assessing its confidence in the predicted class distribution for a given instance. Each base learner in the ensemble is implemented as a Confidence-Aware TabTransformer: a neural architecture that combines a sequence of transformer layers to process categorical variables, followed by a multi-layer perceptron for classification. In parallel, a dedicated confidence head outputs the model's self-assessed reliability score. For ensemble aggregation, we explore multiple confidence-aware combination rules, applied after delta-quantile filtering step to remove low-discordant outlier distribution. We demonstrate the effectiveness of this methodology on a fraud detection task, which is a representative binary classification problem with severe class imbalance.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


