SC22 Proceedings

The International Conference for High Performance Computing, Networking, Storage, and Analysis

Workshops Archive

SABO: Dynamic MPI+OpenMP Resource Balancer


Workshop: Fifth International Workshop on Emerging Parallel Distributed Runtime Systems and Middleware

Authors: Cassandra Rocha Barbosa, Pierre Lemarinier, and Guillaume Papauré (Atos); Marc Pérache (Atomic Energy and Alternative Energies Commission (CEA)); and Michaël Krajecki (University of Reims Champagne-Ardenne (URCA))


Abstract: Scientific parallel applications often use MPI for inter-node communications and OpenMP for intra-node orchestration. Parallel applications such as particle transport, seismic wave propagation simulator, or finite-element applications often exhibit workload imbalance due to their nature of ongoing data movement. These applications usually develop software balancing strategies triggered when some imbalance thresholds are detected to reduce this imbalance. These developments are complex to implement and impact the entire distributed applications’ performance by synchronizing and exchanging the load over the network. This preseentation proposes a method to dynamically detect load imbalance and balance the computation by redistributing OpenMP threads between MPI processes local to the node. With minimal impact on the applications’ codes, we demonstrate how this technique can improve the overall applications’ performance up to 28% on MiniFE, 17% on Quicksilver, and 3% on Ondes3D. We also present its impact when executing multiple nodes and our proposed approach’s limitations.





Back to Fifth International Workshop on Emerging Parallel Distributed Runtime Systems and Middleware Archive Listing



Back to Full Workshop Archive Listing