In this paper we propose a new approach to the generation of pseudowords, i.e., artificial words which model real polysemous words. Our approach simultaneously addresses the two important issues that hamper the generation of large pseudosense-annotated datasets: semantic awareness and coverage. We evaluate these pseudowords from three different perspectives showing that they can be used as reliable substitutes for their real counterparts.
Paving the Way to a Large-scale Pseudosense-annotated Dataset / Pilehvar, MOHAMMED TAHER; Navigli, Roberto. - STAMPA. - (2013), pp. 1100-1109. (Intervento presentato al convegno NAACL-HLT 2013 tenutosi a Atlanta, USA nel 10-12 June 2013).
Paving the Way to a Large-scale Pseudosense-annotated Dataset
PILEHVAR, MOHAMMED TAHER;NAVIGLI, ROBERTO
2013
Abstract
In this paper we propose a new approach to the generation of pseudowords, i.e., artificial words which model real polysemous words. Our approach simultaneously addresses the two important issues that hamper the generation of large pseudosense-annotated datasets: semantic awareness and coverage. We evaluate these pseudowords from three different perspectives showing that they can be used as reliable substitutes for their real counterparts.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.