The Mallows model(MM) occupies a central role in parametric modelling of ranking data to learn preferences of a population of judges. Despite the wide range of metrics for rankings that can be considered in the model specification, the choice is typically limited to the Kendall, Cayley or Hamming distances, due to the closed-form expression of the related model normalizing constant. This work instead focuses on the Mallows model with Spearman distance (MMS). A novel approximation of the normalizing constant is introduced to allow inference even with a large number of items. This allows us to develop and implement an efficient and accurate EM algorithm for estimating finite mixtures of MMS aimed at i) enlarging the applicability to samples drawn from heterogeneous populations, and ii) dealing with partial rankings affected by diverse forms of censoring. These novelties encompass the critical inferential steps that traditionally limited the use of this distance in practice, and render the MMS comparable (or even preferable) to the MMs with other metrics in terms of computational burden. The inferential ability of the EM scheme and the effectiveness of the approximation are assessed by extensive simulation studies. Finally, we show that the application to three real-world datasets endorses our proposals also in the comparison with competing mixtures of ranking models.

Efficient and accurate inference for mixtures of Mallows models with Spearman distance / Crispino, Marta; Mollica, Cristina; Astuti, Valerio; Tardella, Luca. - In: STATISTICS AND COMPUTING. - ISSN 0960-3174. - 33:98(2023), pp. 1-15. [10.1007/s11222-023-10266-8]

Efficient and accurate inference for mixtures of Mallows models with Spearman distance

Cristina Mollica
;
Valerio Astuti;Luca Tardella
2023

Abstract

The Mallows model(MM) occupies a central role in parametric modelling of ranking data to learn preferences of a population of judges. Despite the wide range of metrics for rankings that can be considered in the model specification, the choice is typically limited to the Kendall, Cayley or Hamming distances, due to the closed-form expression of the related model normalizing constant. This work instead focuses on the Mallows model with Spearman distance (MMS). A novel approximation of the normalizing constant is introduced to allow inference even with a large number of items. This allows us to develop and implement an efficient and accurate EM algorithm for estimating finite mixtures of MMS aimed at i) enlarging the applicability to samples drawn from heterogeneous populations, and ii) dealing with partial rankings affected by diverse forms of censoring. These novelties encompass the critical inferential steps that traditionally limited the use of this distance in practice, and render the MMS comparable (or even preferable) to the MMs with other metrics in terms of computational burden. The inferential ability of the EM scheme and the effectiveness of the approximation are assessed by extensive simulation studies. Finally, we show that the application to three real-world datasets endorses our proposals also in the comparison with competing mixtures of ranking models.
2023
ranking data; distance-based models; model-based clustering; EM algorithm; censoring
01 Pubblicazione su rivista::01a Articolo in rivista
Efficient and accurate inference for mixtures of Mallows models with Spearman distance / Crispino, Marta; Mollica, Cristina; Astuti, Valerio; Tardella, Luca. - In: STATISTICS AND COMPUTING. - ISSN 0960-3174. - 33:98(2023), pp. 1-15. [10.1007/s11222-023-10266-8]
File allegati a questo prodotto
File Dimensione Formato  
Crispino_Efficient-and-accurate_2023.pdf

solo gestori archivio

Note: Full-text access to a view-only version of the paper as a part of the Springer Nature Content Sharing Initiative
Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 844.33 kB
Formato Adobe PDF
844.33 kB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1684686
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact