Word embeddings have recently gained considerable popularity for modeling words in different Natural Language Processing (NLP) tasks including semantic similarity measurement. However, notwithstanding their success, word embeddings are by their very nature unable to capture polysemy, as different meanings of a word are conflated into a single representation. In addition, their learning process usually relies on massive corpora only, preventing them from taking advantage of structured knowledge. We address both issues by proposing a multifaceted approach that transforms word embeddings to the sense level and leverages knowledge from a large semantic network for effective semantic similarity measurement. We evaluate our approach on word similarity and relational similarity frameworks, reporting state-of-the-art performance on multiple datasets.
SensEmbed: Learning sense embeddings for word and relational similarity / Iacobacci, IGNACIO JAVIER; Pilehvar, MOHAMMED TAHER; Navigli, Roberto. - ELETTRONICO. - 1:(2015), pp. 95-105. (Intervento presentato al convegno Association for Computational Linguistics tenutosi a Beijing, China nel July 26 - 31, 2015).
SensEmbed: Learning sense embeddings for word and relational similarity
IACOBACCI, IGNACIO JAVIER;PILEHVAR, MOHAMMED TAHER;NAVIGLI, ROBERTO
2015
Abstract
Word embeddings have recently gained considerable popularity for modeling words in different Natural Language Processing (NLP) tasks including semantic similarity measurement. However, notwithstanding their success, word embeddings are by their very nature unable to capture polysemy, as different meanings of a word are conflated into a single representation. In addition, their learning process usually relies on massive corpora only, preventing them from taking advantage of structured knowledge. We address both issues by proposing a multifaceted approach that transforms word embeddings to the sense level and leverages knowledge from a large semantic network for effective semantic similarity measurement. We evaluate our approach on word similarity and relational similarity frameworks, reporting state-of-the-art performance on multiple datasets.File | Dimensione | Formato | |
---|---|---|---|
Iacobacci_SensEmbed_2015.pdf
accesso aperto
Tipologia:
Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
293.89 kB
Formato
Adobe PDF
|
293.89 kB | Adobe PDF |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.