In this paper we perform a large-scale homophily analysis on Twitter using a hierarchical representation of users' interests which we call a Twixonomy. In order to build a population, community, or single-user Twixonomy we first associate "topical" friends in users' friendship lists (i.e. friends representing an interest rather than a social relation between peers) with Wikipedia categories. A wordsense disambiguation algorithm is used to select the appropriate wikipage for each topical friend. Starting from the set of wikipages representing "primitive" interests, we extract all paths connecting these pages with topmost Wikipedia category nodes, and we then prune the resulting graph G efficiently so as to induce a direct acyclic graph. This graph is the Twixonomy. Then, to analyze homophily, we compare different methods to detect communities in a peer friends Twitter network, and then for each community we compute the degree of homophily on the basis of a measure of pairwise semantic similarity. We show that the Twixonomy provides a means for describing users' interests in a compact and readable way and allows for a fine-grained homophily analysis. Furthermore, we show that midlow level categories in the Twixonomy represent the best balance between informativeness and compactness of the representation.

Large scale homophily analysis in twitter using a twixonomy / Faralli, Stefano; Stilo, Giovanni; Velardi, Paola. - ELETTRONICO. - 2015-:(2015), pp. 2334-2340. (Intervento presentato al convegno 24th International Joint Conference on Artificial Intelligence (IJCAI-15) tenutosi a Buenos Aires, Argentina).

Large scale homophily analysis in twitter using a twixonomy

FARALLI, Stefano;STILO, GIOVANNI;VELARDI, Paola
2015

Abstract

In this paper we perform a large-scale homophily analysis on Twitter using a hierarchical representation of users' interests which we call a Twixonomy. In order to build a population, community, or single-user Twixonomy we first associate "topical" friends in users' friendship lists (i.e. friends representing an interest rather than a social relation between peers) with Wikipedia categories. A wordsense disambiguation algorithm is used to select the appropriate wikipage for each topical friend. Starting from the set of wikipages representing "primitive" interests, we extract all paths connecting these pages with topmost Wikipedia category nodes, and we then prune the resulting graph G efficiently so as to induce a direct acyclic graph. This graph is the Twixonomy. Then, to analyze homophily, we compare different methods to detect communities in a peer friends Twitter network, and then for each community we compute the degree of homophily on the basis of a measure of pairwise semantic similarity. We show that the Twixonomy provides a means for describing users' interests in a compact and readable way and allows for a fine-grained homophily analysis. Furthermore, we show that midlow level categories in the Twixonomy represent the best balance between informativeness and compactness of the representation.
2015
24th International Joint Conference on Artificial Intelligence (IJCAI-15)
Artificial Intelligence
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Large scale homophily analysis in twitter using a twixonomy / Faralli, Stefano; Stilo, Giovanni; Velardi, Paola. - ELETTRONICO. - 2015-:(2015), pp. 2334-2340. (Intervento presentato al convegno 24th International Joint Conference on Artificial Intelligence (IJCAI-15) tenutosi a Buenos Aires, Argentina).
File allegati a questo prodotto
File Dimensione Formato  
Stilo_Large_2015.pdf

accesso aperto

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 1.05 MB
Formato Adobe PDF
1.05 MB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/997503
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 26
  • ???jsp.display-item.citation.isi??? 18
social impact