High Performance Parallelization of COMPSYN on a Cluster of Multicore Processors with GPUs

Alessi, Ferdinando; Massini, Annalisa; Basili, Roberto

doi:10.1016/j.procs.2012.04.103

In this work we propose a high performance parallelization of the software package COMPSYN, devoted to the production of syntethic seismograms, on a cluster of multicore processors with multiple GPUs. To design and implement the proposed high performance version, we started from a naive parallel version of COMPSYN. The naive version consists in a simple parallelization on both device side, obtained by exploiting CUDA, and host side, obtained by exploiting the MPI paradigm and OpenMP API. The proposed high performance version implements several practical techniques of CUDA programming and deeply exploits the GPU architecture, thus achieving a much better performance with respect to the naive version. We compare the performance of the proposed high performance version and that of the naive one with the performance of the version running on the cluster of multicore processors without invoking the GPUs. We obtain for the high performance GPU version a speedup of 25x over the version running on the cluster of multicore processors without GPUs against the 10x of the naive version. Regarding the sequential version, we estimate about 380x the speedup of the high performance GPU version against the about 140x of the naive version.

High Performance Parallelization of COMPSYN on a Cluster of Multicore Processors with GPUs / Ferdinando, A., Massini, A., Roberto, B.. - STAMPA. - 9:(2012), pp. 966-975. (International Conference on Computational Science (ICCS) Omaha, NE JUN 04-06, 2012) [10.1016/j.procs.2012.04.103].

High Performance Parallelization of COMPSYN on a Cluster of Multicore Processors with GPUs

Ferdinando Alessi;MASSINI, Annalisa;Roberto Basili

2012

Abstract

In this work we propose a high performance parallelization of the software package COMPSYN, devoted to the production of syntethic seismograms, on a cluster of multicore processors with multiple GPUs. To design and implement the proposed high performance version, we started from a naive parallel version of COMPSYN. The naive version consists in a simple parallelization on both device side, obtained by exploiting CUDA, and host side, obtained by exploiting the MPI paradigm and OpenMP API. The proposed high performance version implements several practical techniques of CUDA programming and deeply exploits the GPU architecture, thus achieving a much better performance with respect to the naive version. We compare the performance of the proposed high performance version and that of the naive one with the performance of the version running on the cluster of multicore processors without invoking the GPUs. We obtain for the high performance GPU version a speedup of 25x over the version running on the cluster of multicore processors without GPUs against the 10x of the naive version. Regarding the sequential version, we estimate about 380x the speedup of the high performance GPU version against the about 140x of the naive version.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2012
			
	Nome convegno
	
				International Conference on Computational Science (ICCS)
			
	Parole chiave
	
				cuda; gpu; synthetic seismogram
			
	Tipologia
	
				04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
			
	Citazione
	
				High Performance Parallelization of COMPSYN on a Cluster of Multicore Processors with GPUs / Ferdinando, A., Massini, A., Roberto, B.. - STAMPA. - 9:(2012), pp. 966-975. (International Conference on Computational Science (ICCS) Omaha, NE JUN 04-06, 2012) [10.1016/j.procs.2012.04.103].
			
	Appartiene alla tipologia:
	
				04b Atto di convegno in volume

File allegati a questo prodotto

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/463732

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

1

1

Catalogo dei prodotti della ricerca