In the past 20 years, the Life Sciences have witnessed a paradigm shift in the way research is performed. Indeed, the computational part of biological and clinical studies has become central or is becoming so. Correspondingly, the amount of data that one needs to process, compare and analyze, has experienced an exponential growth. As a consequence, High Performance Computing (HPC, for short) is being used intensively, in particular in terms of multi-core architectures. However, recently and thanks to the advances in the processing of other scientific and commercial data, Distributed Computing is also being considered for Bioinformatics applications. In particular, the MapReduce paradigm, together with the main middleware supporting it, i.e., Hadoop and Spark, is becoming increasingly popular. Here we provide a short review in which the state of the art of MapReduce bioinformatics applications is presented, together with a qualitative evaluation of each of the software systems that have been here included. In order to make the paper self-contained, computer architectural and middleware issues are also briefly presented.

Mapreduce in computational biology - A synopsis / Cattaneo, Giuseppe; Giancarlo, Raffaele; Piotto, Stefano; FERRARO PETRILLO, Umberto; Roscigno, Gianluca; Di Biasi, Luigi. - STAMPA. - 708:(2017), pp. 53-64. (Intervento presentato al convegno 11th Italian Workshop on Artificial Life and Evolutionary Computation, WIVACE 2016 tenutosi a Fisciano (Italy) nel 2016) [10.1007/978-3-319-57711-1_5].

Mapreduce in computational biology - A synopsis

FERRARO PETRILLO, UMBERTO;
2017

Abstract

In the past 20 years, the Life Sciences have witnessed a paradigm shift in the way research is performed. Indeed, the computational part of biological and clinical studies has become central or is becoming so. Correspondingly, the amount of data that one needs to process, compare and analyze, has experienced an exponential growth. As a consequence, High Performance Computing (HPC, for short) is being used intensively, in particular in terms of multi-core architectures. However, recently and thanks to the advances in the processing of other scientific and commercial data, Distributed Computing is also being considered for Bioinformatics applications. In particular, the MapReduce paradigm, together with the main middleware supporting it, i.e., Hadoop and Spark, is becoming increasingly popular. Here we provide a short review in which the state of the art of MapReduce bioinformatics applications is presented, together with a qualitative evaluation of each of the software systems that have been here included. In order to make the paper self-contained, computer architectural and middleware issues are also briefly presented.
2017
11th Italian Workshop on Artificial Life and Evolutionary Computation, WIVACE 2016
bioinformatics; distributed computing; hadoop; MapReduce; spark; computer science (all)
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Mapreduce in computational biology - A synopsis / Cattaneo, Giuseppe; Giancarlo, Raffaele; Piotto, Stefano; FERRARO PETRILLO, Umberto; Roscigno, Gianluca; Di Biasi, Luigi. - STAMPA. - 708:(2017), pp. 53-64. (Intervento presentato al convegno 11th Italian Workshop on Artificial Life and Evolutionary Computation, WIVACE 2016 tenutosi a Fisciano (Italy) nel 2016) [10.1007/978-3-319-57711-1_5].
File allegati a questo prodotto
File Dimensione Formato  
Cattaneo_MapReduce_2017.pdf

accesso aperto

Tipologia: Documento in Pre-print (manoscritto inviato all'editore, precedente alla peer review)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 1.57 MB
Formato Adobe PDF
1.57 MB Adobe PDF
Cattaneo_MapReduce_2017.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 5.25 MB
Formato Adobe PDF
5.25 MB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/970999
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 9
  • ???jsp.display-item.citation.isi??? 3
social impact