The Internet is first of all a world of words. Everyday an immense amount of data and information transits on the Web and the advent of Web 2.0 has intensified these movements. Our management and calculus capabilities are not yet fully able to use the information extracted from the exponential growth of digital texts. However, the speed at which has improved in recent years the relationship between memorization and the management of great amounts of data, makes us hope well for the near future. The encounter between computer science and linguistics is strongly intertwined with statistics and mathematics. The synthesis takes place both at basic research and technological application levels, in particular in the fields of automatic translation, digital recognition, summary of spoken language and in the management of large information systems. Today, more than ever, the quantitative analysis of language is an extraordinary challenge for social science research methodology. In the treatment of digital texts there is a fertile meeting between disciplines that study the uniqueness and the particularity of their subject and disciplines that try to generalize observations by selecting their properties and creating classes of objects. This distinction brought in the past to the separation of human and natural sciences, of interpretation and explanation sciences. Now a synthesis is possible and it is up to the human and social sciences to accept the challenge and to move in the direction of eliminating the presumed contrast between quality and quantity.

The Value of Words. Automatic Text Analysis Tools in Web 2.0 / Giuliano, Luca Carlo. - ELETTRONICO. - 3:(2014). [10.13140/2.1.1765.7281]

The Value of Words. Automatic Text Analysis Tools in Web 2.0

GIULIANO, Luca Carlo
2014

Abstract

The Internet is first of all a world of words. Everyday an immense amount of data and information transits on the Web and the advent of Web 2.0 has intensified these movements. Our management and calculus capabilities are not yet fully able to use the information extracted from the exponential growth of digital texts. However, the speed at which has improved in recent years the relationship between memorization and the management of great amounts of data, makes us hope well for the near future. The encounter between computer science and linguistics is strongly intertwined with statistics and mathematics. The synthesis takes place both at basic research and technological application levels, in particular in the fields of automatic translation, digital recognition, summary of spoken language and in the management of large information systems. Today, more than ever, the quantitative analysis of language is an extraordinary challenge for social science research methodology. In the treatment of digital texts there is a fertile meeting between disciplines that study the uniqueness and the particularity of their subject and disciplines that try to generalize observations by selecting their properties and creating classes of objects. This distinction brought in the past to the separation of human and natural sciences, of interpretation and explanation sciences. Now a synthesis is possible and it is up to the human and social sciences to accept the challenge and to move in the direction of eliminating the presumed contrast between quality and quantity.
2014
9788890875724
Text Mining; Automatic Text Analysis; Data science; Social Sciences
03 Monografia::03a Saggio, Trattato Scientifico
The Value of Words. Automatic Text Analysis Tools in Web 2.0 / Giuliano, Luca Carlo. - ELETTRONICO. - 3:(2014). [10.13140/2.1.1765.7281]
File allegati a questo prodotto
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/618668
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact