The surge in the volume and complexity of user-generated content (UGC) and data on digital tourism platforms has raise both opportunities and challenges for its automated analysis. Advanced topic modeling techniques are now necessitated to cater the variety, dynamism, and multifaceted nature of this data, yet their application in digital tourism encounters unique challenges. This study comprehensively reviews prominent and emerging topic models from the categories of probabilistic models, matrix factorization-based models, and neural embedding-based models, describing their systemic architectures and operational mechanisms. In the application context of digital tourism, the study follows an experimental evaluation of the models’ performance on five datasets, across multiple coherence and diversity parameters. Results do not reveal optimality of a single model universally; rather, a model’s effectiveness depends on size and structure characteristics of the data as extensively analyzed in this article. Additionally, the study presents quantitative and qualitative findings, implicit shortcomings along with conclusive deductions, digital tourism application related open issues of topic models, followed by future directions of research.

Exploring Digital Tourism Through Topic Models: A Review and Experimental Study / Kamal, M.; Romani, G.; Ricciuti, G.; Anagnostopoulos, A.; Chatzigiannakis, I.. - In: JOURNAL OF DATA SCIENCE AND INTELLIGENT SYSTEMS. - ISSN 2972-3841. - 4:2(2026), pp. 137-154. [10.47852/bonviewJDSIS62024472]

Exploring Digital Tourism Through Topic Models: A Review and Experimental Study

Kamal M.;Ricciuti G.;Anagnostopoulos A.;Chatzigiannakis I.
2026

Abstract

The surge in the volume and complexity of user-generated content (UGC) and data on digital tourism platforms has raise both opportunities and challenges for its automated analysis. Advanced topic modeling techniques are now necessitated to cater the variety, dynamism, and multifaceted nature of this data, yet their application in digital tourism encounters unique challenges. This study comprehensively reviews prominent and emerging topic models from the categories of probabilistic models, matrix factorization-based models, and neural embedding-based models, describing their systemic architectures and operational mechanisms. In the application context of digital tourism, the study follows an experimental evaluation of the models’ performance on five datasets, across multiple coherence and diversity parameters. Results do not reveal optimality of a single model universally; rather, a model’s effectiveness depends on size and structure characteristics of the data as extensively analyzed in this article. Additionally, the study presents quantitative and qualitative findings, implicit shortcomings along with conclusive deductions, digital tourism application related open issues of topic models, followed by future directions of research.
2026
comparative analysis; digital tourism; text-mining; topic modeling
01 Pubblicazione su rivista::01a Articolo in rivista
Exploring Digital Tourism Through Topic Models: A Review and Experimental Study / Kamal, M.; Romani, G.; Ricciuti, G.; Anagnostopoulos, A.; Chatzigiannakis, I.. - In: JOURNAL OF DATA SCIENCE AND INTELLIGENT SYSTEMS. - ISSN 2972-3841. - 4:2(2026), pp. 137-154. [10.47852/bonviewJDSIS62024472]
File allegati a questo prodotto
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1768016
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact