Violence against woman is a persistent problem afecting our society. The use of Big Data represents a new challenging opportunity to integrate ofcial statistics with more updated information. Social media constitute, in fact, a particularly useful data source for analysing gender-based violence, cyber-violence and gender stereotypes. This report describes the results of a methodological approach, aimed at studying gender stereotypes, using textual data published on social media, showing which positive or negative efects may be generated in public opinion when certain messages are spread. Sentiment and emotion analysis has been carried out to measure how the phenomenon is represented among social network users. The statistical quality of the results was assessed. The proposal of a methodology to generate a linguistic resource aimed at improving the capacity of the machine learning BERT algorithm to classify social media data on gender stereotypes integrates the specific contribution of this study.
Challenging Big Data for studying gender-based violence: a methodological proposal / Deriu, Fiorenza; Villante, Claudia; Muratore, Maria Giuseppina; Gallo, Raffaella; Toppetti, Edoardo. - In: QUALITY & QUANTITY. - ISSN 0033-5177. - (2025), pp. 1-19. [10.1007/s11135-025-02209-4]
Challenging Big Data for studying gender-based violence: a methodological proposal
Deriu, Fiorenza
Writing – Review & Editing
;Gallo, RaffaellaWriting – Review & Editing
;
2025
Abstract
Violence against woman is a persistent problem afecting our society. The use of Big Data represents a new challenging opportunity to integrate ofcial statistics with more updated information. Social media constitute, in fact, a particularly useful data source for analysing gender-based violence, cyber-violence and gender stereotypes. This report describes the results of a methodological approach, aimed at studying gender stereotypes, using textual data published on social media, showing which positive or negative efects may be generated in public opinion when certain messages are spread. Sentiment and emotion analysis has been carried out to measure how the phenomenon is represented among social network users. The statistical quality of the results was assessed. The proposal of a methodology to generate a linguistic resource aimed at improving the capacity of the machine learning BERT algorithm to classify social media data on gender stereotypes integrates the specific contribution of this study.| File | Dimensione | Formato | |
|---|---|---|---|
|
Deriu_challenging-big-data_2025.pdf
solo gestori archivio
Tipologia:
Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
906.99 kB
Formato
Adobe PDF
|
906.99 kB | Adobe PDF | Contatta l'autore |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


