This paper investigates the firm-level predictors of artificial intelligence (AI) innovation in Europe using a machine learning (ML) approach. Drawing on a matched dataset that combines patent information from OECD REGPAT with firm-level financial and structural indicators from ORBIS Intellectual Property, we model the probability that a firm filed at least one AI-related patent at the European Patent Office (EPO) in 2020. AI patenting is identified using the World Intellectual Property Organization (WIPO) taxonomy. At the same time, the predictor set includes sectoral dummies, innovation and technological capabilities, balance sheet indicators, and interaction terms, resulting in a high-dimensional feature space. We compare traditional econometric models (Probit) with ML classifiers, including LASSO, Elastic Net, and Random Forest. Results reveal that ML algorithms significantly outperform Probit in identifying AI innovators, with Random Forest achieving the highest sensitivity (0.815) and balanced accuracy (0.854). These improvements are particularly valuable given the small share of firms performing AI patenting in the sample (3.7%). Variable importance analysis highlights ICT specialisation, patent stock, firm size, and market concentration as key drivers of AI innovation. Moreover, several interaction terms—such as those involving size, sector, and innovation intensity—emerge as critical in improving classification performance. These findings suggest that heterogeneities and nonlinear factors should be considered when predicting AI patenting. From a policy perspective, our results highlight that ML could help improve the tailoring of innovation policies by identifying firms with transformative potential. Supporting digital infrastructure, facilitating the diffusion of innovation beyond ICTintensive hubs and tailoring strategies to firm size could make Europe's AI transformation more inclusive. Overall, this study demonstrates the analytical power of ML in uncovering complex innovation dynamics and providing actionable insights for scholars and policymakers.

Predicting AI patenting of European firms: A Machine Learning approach / Bloise, Francesco; Fiorelli, Cristiana; Meliciani, Valentina. - 4/2025:(2025).

Predicting AI patenting of European firms: A Machine Learning approach

Francesco Bloise;Cristiana Fiorelli;
2025

Abstract

This paper investigates the firm-level predictors of artificial intelligence (AI) innovation in Europe using a machine learning (ML) approach. Drawing on a matched dataset that combines patent information from OECD REGPAT with firm-level financial and structural indicators from ORBIS Intellectual Property, we model the probability that a firm filed at least one AI-related patent at the European Patent Office (EPO) in 2020. AI patenting is identified using the World Intellectual Property Organization (WIPO) taxonomy. At the same time, the predictor set includes sectoral dummies, innovation and technological capabilities, balance sheet indicators, and interaction terms, resulting in a high-dimensional feature space. We compare traditional econometric models (Probit) with ML classifiers, including LASSO, Elastic Net, and Random Forest. Results reveal that ML algorithms significantly outperform Probit in identifying AI innovators, with Random Forest achieving the highest sensitivity (0.815) and balanced accuracy (0.854). These improvements are particularly valuable given the small share of firms performing AI patenting in the sample (3.7%). Variable importance analysis highlights ICT specialisation, patent stock, firm size, and market concentration as key drivers of AI innovation. Moreover, several interaction terms—such as those involving size, sector, and innovation intensity—emerge as critical in improving classification performance. These findings suggest that heterogeneities and nonlinear factors should be considered when predicting AI patenting. From a policy perspective, our results highlight that ML could help improve the tailoring of innovation policies by identifying firms with transformative potential. Supporting digital infrastructure, facilitating the diffusion of innovation beyond ICTintensive hubs and tailoring strategies to firm size could make Europe's AI transformation more inclusive. Overall, this study demonstrates the analytical power of ML in uncovering complex innovation dynamics and providing actionable insights for scholars and policymakers.
2025
File allegati a questo prodotto
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1742109
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact