Catalogo dei prodotti della ricerca

We present the results of a first SYCL vs CUDA performance assessment for the case of the three-dimensional XCA-Flow subsurface Extended Cellular Automata model. A grid domain of 100 x 100 x 50 cubic cells of 0.3 m side was considered, where two different heterogeneous hydraulic conductivity fields were imposed, resulting in different computational loads. For each conductivity field, a 10 days test case simulation with a constant infiltration rate over the central part of the upper domain's interface and no-flow condition at other boundaries was designed as a benchmark for performance assessment. The stencil-based kernels of the XCA-Flow model were implemented by considering the one-thread-one-cell thread-to-cell mapping and global memory accesses. A global reduction, needed by the algorithm at each computational step to find the minimum of a domain's state variable, exploited the device's on-chip local memory (shared memory in the CUDA nomenclature). The CUDA back-end featured SYCL compiler adopted was the Intel DPC++. The experiments were performed on an Nvidia Titan Xp GPU by considering different configurations of SYCL/CUDA thread blocks. Each simulation was re-executed four times by selecting the minimum elapsed time. As expected, the CUDA implementation performed slightly better. Nevertheless, SYCL provided satisfying results, with a limited mean gap of approximately 8%.

First Sycl implementation of the Three-Dimensional Subsruface XCA-FLOW Cellular Automation and Performance Comparison Against CUDA / D'Ambrosio, D., Terremoto, G., De Rango, A., Furnari, L., Senatore, A., Mendicino, G.. - (2022), pp. 47-54. (International Conference on Applied Computing 2022 and WWW/Internet 2022 Lisbon ).

First Sycl implementation of the Three-Dimensional Subsruface XCA-FLOW Cellular Automation and Performance Comparison Against CUDA

D'Ambrosio D.;Terremoto G.;De Rango A.;Furnari L.;Senatore A.;Mendicino G.

2022

Abstract

We present the results of a first SYCL vs CUDA performance assessment for the case of the three-dimensional XCA-Flow subsurface Extended Cellular Automata model. A grid domain of 100 x 100 x 50 cubic cells of 0.3 m side was considered, where two different heterogeneous hydraulic conductivity fields were imposed, resulting in different computational loads. For each conductivity field, a 10 days test case simulation with a constant infiltration rate over the central part of the upper domain's interface and no-flow condition at other boundaries was designed as a benchmark for performance assessment. The stencil-based kernels of the XCA-Flow model were implemented by considering the one-thread-one-cell thread-to-cell mapping and global memory accesses. A global reduction, needed by the algorithm at each computational step to find the minimum of a domain's state variable, exploited the device's on-chip local memory (shared memory in the CUDA nomenclature). The CUDA back-end featured SYCL compiler adopted was the Intel DPC++. The experiments were performed on an Nvidia Titan Xp GPU by considering different configurations of SYCL/CUDA thread blocks. Each simulation was re-executed four times by selecting the minimum elapsed time. As expected, the CUDA implementation performed slightly better. Nevertheless, SYCL provided satisfying results, with a limited mean gap of approximately 8%.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2022
			
	Nome convegno
	
				International Conference on Applied Computing 2022 and WWW/Internet 2022
			
	Parole chiave
	
				Extended Cellular Automata; Performance Assessment; SYCL vs CUDA; XCA-Flow 3D Subsurface Model
			
	Tipologia
	
				04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
			
	Citazione
	
				First Sycl implementation of the Three-Dimensional Subsruface XCA-FLOW Cellular Automation and Performance Comparison Against CUDA / D'Ambrosio, D., Terremoto, G., De Rango, A., Furnari, L., Senatore, A., Mendicino, G.. - (2022), pp. 47-54. (International Conference on Applied Computing 2022 and WWW/Internet 2022 Lisbon ).
			
	Appartiene alla tipologia:
	
				04b Atto di convegno in volume

File allegati a questo prodotto

File	Dimensione	Formato
DAmbrosio_First_2022.pdf accesso aperto Note: https://www.iadisportal.org/components/com_booklibrary/ebooks/202208L006.pdf Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 1.6 MB Formato Adobe PDF	1.6 MB	Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1690605

Citazioni

ND

2

ND

social impact