The paper presents the training of a surrogate model to mimic the functioning of a text mining social profiling method, emotional Text Mining, applied on the Italian tweets containing #Covid-19 during the first lockdown. This surrogate model is based on stateof-the-art Deep learning models in the field of Natural language processing (Nlp). These models feature an architecture called Transformer, which adopts a mechanism of self-attention that weighs the significance of each part of the input data. The model was trained on the tweets and factors of an initial time period to predict the polarisation of a tweet according to each factor found using the ETM technique. The model was then applied to subsequent periods. Using appropriate metrics, it was then possible to quantify the need to re-run the technique from scratch on the following time periods, while also being able to assess which factors still remained relevant over time.
Leveraging Deep Learning models to assess the temporal validity of Emotional Text Mining procedures / Greco, Francesca; Polli, Alessandro; Siciliano, Federico. - 2:(2022), pp. 475-481. (Intervento presentato al convegno 16th International Conference on Statistical Analysis of Textual Data tenutosi a Napoli).
Leveraging Deep Learning models to assess the temporal validity of Emotional Text Mining procedures
Alessandro Polli;Federico Siciliano
2022
Abstract
The paper presents the training of a surrogate model to mimic the functioning of a text mining social profiling method, emotional Text Mining, applied on the Italian tweets containing #Covid-19 during the first lockdown. This surrogate model is based on stateof-the-art Deep learning models in the field of Natural language processing (Nlp). These models feature an architecture called Transformer, which adopts a mechanism of self-attention that weighs the significance of each part of the input data. The model was trained on the tweets and factors of an initial time period to predict the polarisation of a tweet according to each factor found using the ETM technique. The model was then applied to subsequent periods. Using appropriate metrics, it was then possible to quantify the need to re-run the technique from scratch on the following time periods, while also being able to assess which factors still remained relevant over time.File | Dimensione | Formato | |
---|---|---|---|
Greco Polli Siciliano_JADT2022.pdf
accesso aperto
Tipologia:
Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
645.54 kB
Formato
Adobe PDF
|
645.54 kB | Adobe PDF |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.