Recent years have witnessed the proliferation of offensive content online such as fake news, propaganda, misinformation, and disinformation. While initially this was mostly about textual content, over time images and videos gained popularity, as they are much easier to consume, attract more attention, and spread further than text. As a result, researchers started leveraging different modalities and combinations thereof to tackle online multimodal offensive content. In this study, we offer a survey on the state-of-the-art on multimodal disinformation detection covering various combinations of modalities: text, images, speech, video, social media network structure, and temporal information. Moreover, while some studies focused on factuality, others investigated how harmful the content is. While these two components in the definition of disinformation – (i) factuality, and (ii) harmfulness –, are equally important, they are typically studied in isolation. Thus, we argue for the need to tackle disinformation detection by taking into account multiple modalities as well as both factuality and harmfulness, in the same framework. Finally, we discuss current challenges and future research directions.

A Survey on Multimodal Disinformation Detection / Alam, Firoj; Cresci, Stefano; Chakraborty, Tanmoy; Silvestri, Fabrizio; Dimitrov, Dimiter; Da San Martino, Giovanni; Shaar, Shaden; Firooz, Hamed; Nakov, Preslav. - 29:1(2022), pp. 6625-6643. ( 29th International Conference on Computational Linguistics, COLING 2022 Gyeongju, Republic of Korea ).

A Survey on Multimodal Disinformation Detection

Fabrizio Silvestri
;
2022

Abstract

Recent years have witnessed the proliferation of offensive content online such as fake news, propaganda, misinformation, and disinformation. While initially this was mostly about textual content, over time images and videos gained popularity, as they are much easier to consume, attract more attention, and spread further than text. As a result, researchers started leveraging different modalities and combinations thereof to tackle online multimodal offensive content. In this study, we offer a survey on the state-of-the-art on multimodal disinformation detection covering various combinations of modalities: text, images, speech, video, social media network structure, and temporal information. Moreover, while some studies focused on factuality, others investigated how harmful the content is. While these two components in the definition of disinformation – (i) factuality, and (ii) harmfulness –, are equally important, they are typically studied in isolation. Thus, we argue for the need to tackle disinformation detection by taking into account multiple modalities as well as both factuality and harmfulness, in the same framework. Finally, we discuss current challenges and future research directions.
2022
29th International Conference on Computational Linguistics, COLING 2022
multimodal disinformation; transformer models
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
A Survey on Multimodal Disinformation Detection / Alam, Firoj; Cresci, Stefano; Chakraborty, Tanmoy; Silvestri, Fabrizio; Dimitrov, Dimiter; Da San Martino, Giovanni; Shaar, Shaden; Firooz, Hamed; Nakov, Preslav. - 29:1(2022), pp. 6625-6643. ( 29th International Conference on Computational Linguistics, COLING 2022 Gyeongju, Republic of Korea ).
File allegati a questo prodotto
File Dimensione Formato  
Alam_A-Survey_2022.pdf

accesso aperto

Note: https://aclanthology.org/2022.coling-1.576/
Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 608.58 kB
Formato Adobe PDF
608.58 kB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1678450
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 67
  • ???jsp.display-item.citation.isi??? ND
social impact