Catalogo dei prodotti della ricerca

A general-purpose data mining model for Arabic texts (Arabic Meaning Extraction through Lexical Resources, ArMExLeR) is proposed which employs a chained pipeline of existing public domain and published lexical resources (Stanford Parser, WordNet, Arabic WordNet, SUMO, AraMorph, A Frequency Dictionary of Arabic) in order to extract a weakly hierarchised, single-predicate level, representation of meaning. This kind of model would be of high impact on the study of the computational analysis of Arabic for there is no such comparable tool for this language, and will be a challenge for the nature of its specificities. One should, in fact, cope with the unique writing system that is mostly consonant-based and doesn’t always mark vowels explicitly. This is crucial when you want to analyze an Arabic corpus for the same consonantal ductus may be read in several ways.

Arabic meaning extraction through lexical resources: A general-purpose data mining model for arabic texts / Lancioni, Giuliano; Pepe, Ivana; Alessandra, Silighini; Valeria, Pettinari; Cicola, Ilaria; Leila, Benassi; Campanelli, Marta. - ELETTRONICO. - (2013), pp. 107-112. ( IMMM 2013, The Third International Conference on Advances in Information Mining and Management Lisbona, Portogallo 17/11/2013).

Arabic meaning extraction through lexical resources: A general-purpose data mining model for arabic texts

LANCIONI, Giuliano;PEPE, IVANA;Alessandra Silighini;Valeria Pettinari;CICOLA, ILARIA;Leila Benassi;CAMPANELLI, MARTA

2013

Abstract

A general-purpose data mining model for Arabic texts (Arabic Meaning Extraction through Lexical Resources, ArMExLeR) is proposed which employs a chained pipeline of existing public domain and published lexical resources (Stanford Parser, WordNet, Arabic WordNet, SUMO, AraMorph, A Frequency Dictionary of Arabic) in order to extract a weakly hierarchised, single-predicate level, representation of meaning. This kind of model would be of high impact on the study of the computational analysis of Arabic for there is no such comparable tool for this language, and will be a challenge for the nature of its specificities. One should, in fact, cope with the unique writing system that is mostly consonant-based and doesn’t always mark vowels explicitly. This is crucial when you want to analyze an Arabic corpus for the same consonantal ductus may be read in several ways.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2013
			
	Nome convegno
	
				IMMM 2013, The Third International Conference on Advances in Information Mining and Management
			
	Parole chiave
	
				Arabic data mining; content extraction; automatic parsing techniques; ontologies
			
	Tipologia
	
				04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
			
	Citazione
	
				Arabic meaning extraction through lexical resources: A general-purpose data mining model for arabic texts / Lancioni, Giuliano; Pepe, Ivana; Alessandra, Silighini; Valeria, Pettinari; Cicola, Ilaria; Leila, Benassi; Campanelli, Marta. - ELETTRONICO. - (2013), pp. 107-112. ( IMMM 2013, The Third International Conference on Advances in Information Mining and Management Lisbona, Portogallo 17/11/2013).
			
	Appartiene alla tipologia:
	
				04b Atto di convegno in volume

File allegati a questo prodotto

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/541237

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

ND

ND

social impact