Skip to Main content Skip to Navigation

Programmation haute performance pour architectures hybrides

Abstract : Clusters of multicore/GPU nodes connected with a fast network offer very high therotical peak performances, reaching tens of TeraFlops. Unfortunately, the efficient programing of such architectures remains challenging because of their complexity and the diversity of the existing programming models. The purpose of this thesis is to improve the programmability of dense scientific applications on hybrid architectures in three ways: reducing the execution times, processing larger data sets and reducing the programming effort. We propose DSTEP, a directive-based programming model expressing both data and computation distribution. A large set of distribution types are unified in a "dstep distribute" directive and the replication of some distributed elements can be expressed using a "halo". The "dstep gridify" directive expresses both the computation distribution and the schedule constraints of loop iterations. We define a distribution model and demonstrate the correctness of the code transformation from the sequential domain to the parallel domain. From the distribution model, we derive a generic compilation scheme transforming DSTEP annotated input programs into parallel hybrid ones. We have implemented such a tool as a compiler integrated to the PIPS compilation workbench together with a library offering the runtime functionality, especially the communication. Our solution is validated on scientific programs from the NAS Parallel Benchmarks and the PolyBenchs as well as on an industrial signal procesing application.
Complete list of metadata

Cited literature [90 references]  Display  Hide  Download
Contributor : ABES STAR :  Contact
Submitted on : Monday, January 18, 2016 - 5:49:22 AM
Last modification on : Wednesday, November 17, 2021 - 12:31:48 PM
Long-term archiving on: : Tuesday, April 19, 2016 - 10:31:25 AM


Version validated by the jury (STAR)


  • HAL Id : tel-01101782, version 2


Rachid Habel. Programmation haute performance pour architectures hybrides. Calcul parallèle, distribué et partagé [cs.DC]. Ecole Nationale Supérieure des Mines de Paris, 2014. Français. ⟨NNT : 2014ENMP0025⟩. ⟨tel-01101782v2⟩



Record views


Files downloads