The paper offers a general introduction to the use of meta-information in a text mining perspective. The aim is to build a meta-dictionary as an available linguistic resource useful for different applications. The procedure is based on the use of a hybrid system. The suggested algorithm employs, conjointly and in a recursive way, dictionaries and rules, the latter both lexical and textual. An application on a corpus of diaries from the Time Use Survey (TUS) by Istat is illustrated

Automatic dictionary and rule-based systems for extracting information from text / Bolasco, Sergio; Pavone, P.. - STAMPA. - (2010), pp. 189-198. (Intervento presentato al convegno CLADAG 2007 - Classification and Data Analysis Group (of Italian statistical Society) tenutosi a Macerata; Italy) [10.1007/978-3-642-03739-9_22].

Automatic dictionary and rule-based systems for extracting information from text

BOLASCO, Sergio;
2010

Abstract

The paper offers a general introduction to the use of meta-information in a text mining perspective. The aim is to build a meta-dictionary as an available linguistic resource useful for different applications. The procedure is based on the use of a hybrid system. The suggested algorithm employs, conjointly and in a recursive way, dictionaries and rules, the latter both lexical and textual. An application on a corpus of diaries from the Time Use Survey (TUS) by Istat is illustrated
2010
CLADAG 2007 - Classification and Data Analysis Group (of Italian statistical Society)
linguistic resources; hybrid systems; meta-data
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Automatic dictionary and rule-based systems for extracting information from text / Bolasco, Sergio; Pavone, P.. - STAMPA. - (2010), pp. 189-198. (Intervento presentato al convegno CLADAG 2007 - Classification and Data Analysis Group (of Italian statistical Society) tenutosi a Macerata; Italy) [10.1007/978-3-642-03739-9_22].
File allegati a questo prodotto
File Dimensione Formato  
Bolasco_Automatic_2010.pdf

accesso aperto

Tipologia: Documento in Pre-print (manoscritto inviato all'editore, precedente alla peer review)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 91.17 kB
Formato Adobe PDF
91.17 kB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/61376
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 8
social impact