The paper offers a general introduction to the use of meta-information in a text mining perspective. The aim is to build a meta-dictionary as an available linguistic resource useful for different applications. The procedure is based on the use of a hybrid system. The suggested algorithm employs, conjointly and in a recursive way, dictionaries and rules, the latter both lexical and textual. An application on a corpus of diaries from the Time Use Survey (TUS) by Istat is illustrated
Automatic dictionary and rule-based systems for extracting information from text / Bolasco, Sergio; Pavone, P.. - STAMPA. - (2010), pp. 189-198. (Intervento presentato al convegno CLADAG 2007 - Classification and Data Analysis Group (of Italian statistical Society) tenutosi a Macerata; Italy) [10.1007/978-3-642-03739-9_22].
Automatic dictionary and rule-based systems for extracting information from text
BOLASCO, Sergio;
2010
Abstract
The paper offers a general introduction to the use of meta-information in a text mining perspective. The aim is to build a meta-dictionary as an available linguistic resource useful for different applications. The procedure is based on the use of a hybrid system. The suggested algorithm employs, conjointly and in a recursive way, dictionaries and rules, the latter both lexical and textual. An application on a corpus of diaries from the Time Use Survey (TUS) by Istat is illustratedFile | Dimensione | Formato | |
---|---|---|---|
Bolasco_Automatic_2010.pdf
accesso aperto
Tipologia:
Documento in Pre-print (manoscritto inviato all'editore, precedente alla peer review)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
91.17 kB
Formato
Adobe PDF
|
91.17 kB | Adobe PDF |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.