We present the most recent release of our parallel implementation of the BFS and BC algorithms for the study of large scale graphs. Although our reference platform is a high-end cluster of new generation Nvidia GPUs and some of our optimisations are CUDA specific, most of our ideas can be applied to other platforms offering multiple levels of parallelism. We exploit multi level parallel processing through a hybrid programming paradigm that combines highly tuned CUDA kernels, for the computations performed by each node, and explicit data exchange through the Message Passing Interface (MPI), for the communications among nodes. The results of the numerical experiments show that the performance of our code is comparable or better with respect to other state-of-the-art solutions. For the BFS, for instance, we reach a peak performance of 200 Giga Teps on a single GPU and 5.5 Tera Teps on 1024 Pascal …

Multilevel parallelism for the exploration of large-scale graphs / Bernaschi, Massimo; Bisson, Mauro; Mastrostefano, Enrico; Vella, Flavio. - In: IEEE TRANSACTIONS ON MULTI-SCALE COMPUTING SYSTEMS. - ISSN 2332-7766. - 4:(2018), pp. 204-216. [10.1109/TMSCS.2018.2797195]

Multilevel parallelism for the exploration of large-scale graphs

Bernaschi, Massimo;Bisson, Mauro;Mastrostefano, Enrico;Vella, Flavio
2018

Abstract

We present the most recent release of our parallel implementation of the BFS and BC algorithms for the study of large scale graphs. Although our reference platform is a high-end cluster of new generation Nvidia GPUs and some of our optimisations are CUDA specific, most of our ideas can be applied to other platforms offering multiple levels of parallelism. We exploit multi level parallel processing through a hybrid programming paradigm that combines highly tuned CUDA kernels, for the computations performed by each node, and explicit data exchange through the Message Passing Interface (MPI), for the communications among nodes. The results of the numerical experiments show that the performance of our code is comparable or better with respect to other state-of-the-art solutions. For the BFS, for instance, we reach a peak performance of 200 Giga Teps on a single GPU and 5.5 Tera Teps on 1024 Pascal …
2018
Algorithm design and analysis; Clustering algorithms; Graphics processing units; Parallel processing; Sparse matrices; Symmetric matrices; Two dimensional displays; Control and Systems Engineering; Information Systems; Hardware and Architecture
01 Pubblicazione su rivista::01a Articolo in rivista
Multilevel parallelism for the exploration of large-scale graphs / Bernaschi, Massimo; Bisson, Mauro; Mastrostefano, Enrico; Vella, Flavio. - In: IEEE TRANSACTIONS ON MULTI-SCALE COMPUTING SYSTEMS. - ISSN 2332-7766. - 4:(2018), pp. 204-216. [10.1109/TMSCS.2018.2797195]
File allegati a questo prodotto
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1213536
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 6
  • ???jsp.display-item.citation.isi??? 4
social impact