Semantic similarity is an essential component of many Natural Language Processing applications. However, prior methods for computing semantic similarity often operate at different levels, e.g., single words or entire documents, which requires adapting the method for each data type. We present a unified approach to semantic similarity that operates at multiple levels, all the way from comparing word senses to comparing text documents. Our method leverages a common probabilistic representation over word senses in order to compare different types of linguistic data. This unified representation shows state-ofthe-art performance on three tasks: semantic textual similarity, word similarity, and word sense coarsening. © 2013 Association for Computational Linguistics.
Align, disambiguate and walk: A unified approach for measuring semantic similarity / Pilehvar, MOHAMMED TAHER; Jurgens, DAVID ALAN; Navigli, Roberto. - STAMPA. - 1:(2013), pp. 1341-1351. (Intervento presentato al convegno 51st Annual Meeting of the Association for Computational Linguistics, ACL 2013 tenutosi a Sofia, Bulgaria nel 4 August 2013 through 9 August 2013).
Align, disambiguate and walk: A unified approach for measuring semantic similarity
PILEHVAR, MOHAMMED TAHER;JURGENS, DAVID ALAN;NAVIGLI, ROBERTO
2013
Abstract
Semantic similarity is an essential component of many Natural Language Processing applications. However, prior methods for computing semantic similarity often operate at different levels, e.g., single words or entire documents, which requires adapting the method for each data type. We present a unified approach to semantic similarity that operates at multiple levels, all the way from comparing word senses to comparing text documents. Our method leverages a common probabilistic representation over word senses in order to compare different types of linguistic data. This unified representation shows state-ofthe-art performance on three tasks: semantic textual similarity, word similarity, and word sense coarsening. © 2013 Association for Computational Linguistics.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.