The current trend in NLP is the use of highly opaque models, e.g. neural networks and word embeddings. While these models yield state-of-the-art results on a range of tasks, their drawback is poor interpretability. On the example of word sense induction and disambiguation (WSID), we show that it is possible to develop an interpretable model that matches the state-of-the-art models in accuracy. Namely, we present an unsupervised, knowledge-free WSID approach, which is interpretable at three levels: word sense inventory, sense feature representations, and disambiguation procedure. Experiments show that our model performs on par with state-of-the-art word sense embeddings and other unsupervised systems while offering the possibility to justify its decisions in human-readable form.
Unsupervised does not mean uninterpretable: The case for word sense induction & disambiguation / Panchenko, A.; Ruppert, E.; Faralli, S.; Ponzetto, S. P.; Biemann, C.. - 1:(2017), pp. 86-98. (Intervento presentato al convegno 15th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2017 tenutosi a esp) [10.18653/v1/e17-1009].
Unsupervised does not mean uninterpretable: The case for word sense induction & disambiguation
Faralli S.
Co-primo
;Ponzetto S. P.
Co-primo
;
2017
Abstract
The current trend in NLP is the use of highly opaque models, e.g. neural networks and word embeddings. While these models yield state-of-the-art results on a range of tasks, their drawback is poor interpretability. On the example of word sense induction and disambiguation (WSID), we show that it is possible to develop an interpretable model that matches the state-of-the-art models in accuracy. Namely, we present an unsupervised, knowledge-free WSID approach, which is interpretable at three levels: word sense inventory, sense feature representations, and disambiguation procedure. Experiments show that our model performs on par with state-of-the-art word sense embeddings and other unsupervised systems while offering the possibility to justify its decisions in human-readable form.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.