01. September 2022
-
30. November 2025
BMFTR
- Green HPC call
The project has demonstrated, that adjusting system parameters during job execution can lead to significant energy savings. In particular, adjusting the power cap can often lead to a substantial reduction of energy-to-solution without significant loss of application performance. However, the project has also shown, that measuring application performance is not trivial. Often, simply counting instruction is not sufficient as many of those instructions will not do any meaningful work as for instance when busy-waiting for completion of MPI operations. At HLRS, we have investigated using recurring events related to the usage of MPI as a measure to more reliably estimate application performance.
EE-HPC has treated of compute nodes in a job as behaving the same with regard to their energy usage. While this is a good starting point, and sufficient, for many applications, in a future project, we would like to explore scenarios where nodes behave differently, as for instance in the precedence of dynamic load shifts or heterogenous applications. In particular we wish to investigate the potential of tuning system parameters on each node independently under the constraint of achieving overall energy reductions.