Scaling implicit parallelism via dynamic control replication

Bauer, Michael; Lee, Wonchan; Slaughter, Elliott; Jia, Zhihao; Di Renzo, Mario; Papadakis, Manolis; Shipman, Galen; Mccormick, Patrick; Garland, Michael; Aiken, Alex

doi:10.1145/3437801.3441587

We present dynamic control replication, a run-time program analysis that enables scalable execution of implicitly parallel programs on large machines through a distributed and effi- cient dynamic dependence analysis. Dynamic control repli- cation distributes dependence analysis by executing multiple copies of an implicitly parallel program while ensuring that they still collectively behave as a single execution. By dis- tributing and parallelizing the dependence analysis, dynamic control replication supports efficient, on-the-fly computation of dependences for programs with arbitrary control flow at scale. We describe an asymptotically scalable algorithm for implementing dynamic control replication that maintains the sequential semantics of implicitly parallel programs. An implementation of dynamic control replication in the Legion runtime delivers the same programmer productivity as writing in other implicitly parallel programming models, such as Dask or TensorFlow, while providing better perfor- mance (11.4X and 14.9X respectively in our experiments), and scalability to hundreds of nodes. We also show that dynamic control replication provides good absolute performance and scaling for HPC applications, competitive in many cases with explicitly parallel programming systems.

Scaling implicit parallelism via dynamic control replication / Bauer, Michael; Lee, Wonchan; Slaughter, Elliott; Jia, Zhihao; Di Renzo, Mario; Papadakis, Manolis; Shipman, Galen; Mccormick, Patrick; Garland, Michael; Aiken, Alex. - (2021), pp. 105-118. (Intervento presentato al convegno 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming tenutosi a Republic of Korea) [10.1145/3437801.3441587].

Scaling implicit parallelism via dynamic control replication

Bauer, Michael;Lee, Wonchan;Slaughter, Elliott;Jia, Zhihao;Di Renzo, Mario;Papadakis, Manolis;Shipman, Galen;McCormick, Patrick;Garland, Michael;Aiken, Alex

2021

Abstract

We present dynamic control replication, a run-time program analysis that enables scalable execution of implicitly parallel programs on large machines through a distributed and effi- cient dynamic dependence analysis. Dynamic control repli- cation distributes dependence analysis by executing multiple copies of an implicitly parallel program while ensuring that they still collectively behave as a single execution. By dis- tributing and parallelizing the dependence analysis, dynamic control replication supports efficient, on-the-fly computation of dependences for programs with arbitrary control flow at scale. We describe an asymptotically scalable algorithm for implementing dynamic control replication that maintains the sequential semantics of implicitly parallel programs. An implementation of dynamic control replication in the Legion runtime delivers the same programmer productivity as writing in other implicitly parallel programming models, such as Dask or TensorFlow, while providing better perfor- mance (11.4X and 14.9X respectively in our experiments), and scalability to hundreds of nodes. We also show that dynamic control replication provides good absolute performance and scaling for HPC applications, competitive in many cases with explicitly parallel programming systems.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
			2021
		
	Nome convegno
	
			26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
		
	Parole chiave
	
			implicit parallelism, scalable dependence analysis, dynamic control replication, Legion, task-based runtime
		
	Tipologia
	
			04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
		
	Citazione
	
			Scaling implicit parallelism via dynamic control replication / Bauer, Michael; Lee, Wonchan; Slaughter, Elliott; Jia, Zhihao; Di Renzo, Mario; Papadakis, Manolis; Shipman, Galen; Mccormick, Patrick; Garland, Michael; Aiken, Alex. - (2021), pp. 105-118. (Intervento presentato al  convegno 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming tenutosi a Republic of Korea) [10.1145/3437801.3441587].
		
	Appartiene alla tipologia:
	
			04b Atto di convegno in volume

File allegati a questo prodotto

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1516410

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

9

ND

Catalogo dei prodotti della ricerca