Catalogo dei prodotti della ricerca

Accurate semantic representation models are essential in text mining applications. For a successful application of the text mining process, the text representation adopted must keep the interesting patterns to be discovered. Although competitive results for automatic text classification may be achieved with traditional bag of words, such representation model cannot provide satisfactory classification performances on hard settings where richer text representations are required. In this paper, we present an approach to represent document collections based on embedded representations of words and word senses. We bring together the power of word sense disambiguation and the semantic richness of word- and word-sense embedded vectors to construct embedded representations of document collections. Our approach results in semantically enhanced and low-dimensional representations. We overcome the lack of interpretability of embedded vectors, which is a drawback of this kind of representation, with the use of word sense embedded vectors. Moreover, the experimental evaluation indicates that the use of the proposed representations provides stable classifiers with strong quantitative results, especially in semantically-complex classification scenarios

Knowledge-enhanced document embeddings for text classification / Akemi Sinoara, Roberta; José, Camacho-Collados; Geraldeli Rossi, Rafael; Navigli, Roberto; Rezende, Solange O.. - In: KNOWLEDGE-BASED SYSTEMS. - ISSN 0950-7051. - 163:(2019), pp. 955-971. [10.1016/j.knosys.2018.10.026]

Knowledge-enhanced document embeddings for text classification

Roberta Akemi Sinoara;José Camacho-Collados^Secondo;Rafael Geraldeli Rossi;Roberto Navigli^Penultimo;Solange O. Rezende

2019

Abstract

Accurate semantic representation models are essential in text mining applications. For a successful application of the text mining process, the text representation adopted must keep the interesting patterns to be discovered. Although competitive results for automatic text classification may be achieved with traditional bag of words, such representation model cannot provide satisfactory classification performances on hard settings where richer text representations are required. In this paper, we present an approach to represent document collections based on embedded representations of words and word senses. We bring together the power of word sense disambiguation and the semantic richness of word- and word-sense embedded vectors to construct embedded representations of document collections. Our approach results in semantically enhanced and low-dimensional representations. We overcome the lack of interpretability of embedded vectors, which is a drawback of this kind of representation, with the use of word sense embedded vectors. Moreover, the experimental evaluation indicates that the use of the proposed representations provides stable classifiers with strong quantitative results, especially in semantically-complex classification scenarios

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2019
			
	Parole chiave
	
				knowledge-based; text classification; semantics
			
	Tipologia
	
				01 Pubblicazione su rivista::01a Articolo in rivista
			
	Citazione
	
				Knowledge-enhanced document embeddings for text classification / Akemi Sinoara, Roberta; José, Camacho-Collados; Geraldeli Rossi, Rafael; Navigli, Roberto; Rezende, Solange O.. - In: KNOWLEDGE-BASED SYSTEMS. - ISSN 0950-7051. - 163:(2019), pp. 955-971. [10.1016/j.knosys.2018.10.026]
			
	Appartiene alla tipologia:
	
				01a Articolo in rivista

File allegati a questo prodotto

File	Dimensione	Formato
Navigli_Knowledge_2019.pdf solo gestori archivio Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 706.26 kB Formato Adobe PDF Contatta l'autore	706.26 kB	Adobe PDF	Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1217585

Citazioni

ND

108

91

social impact