The prediction of the climate system is highly depended on the efficient integration of observations and simulations of the Earth. This is regarded as a canonical example of the cyber-physical system. The climate system model, the simulation engine in this cyber-physical system, is one of most challenging applications in scientific computing. It utilizes the multi-physics simulation that couples multiple components, conducts decadal to millennium simulations, and has long been an important application on supercomputers.
However, current climate system models suffer from the inefficient task scheduling methods resulting in an intolerable simulation time. Take the Community Earth System Model (CESM), the most widely used climate system model, as an example, one major reason that CESM suffers from bad performances is the huge overhead to rationally distribute processes among the coupled heterogeneous components. According to the report of NCAR, every percent improvement in CESM performance frees up to the equivalent of 250,000incomputingresourcesintheirscientificexperiments.Toaddresssuchchallenge,ourpaperfirstlyconstructsalightweightandaccurateperformancemodelforeffectivelycapturingandpredictingtheheterogeneoustimeβtoβsolutionperformanceofendβtoβendCESMcomponentswithagivensimulationconfiguration.Then,basedontheperformancemodel,wefurtherproposeanefficientschedulingstrategybasedonrectangularpackingmethodtodeterminethebestprocesslayoutamongdifferentcomponents,andtheprocessnumbersassignedtoeachcomponent.Ourevaluationsshowthatwecanachieve5840,089 with a charge of $0.01 per CPU hour.
Meanwhile, 26% extra performance improvements could also be gained in our methods comparing to the heuristic branch and bound algorithm with the guidance of the known curve-fitting performance model
The prediction of the climate system is highly depended on the efficient integration of observations and simulations of the Earth. This is regarded as a canonical example of the cyber-physical system. The climate system model, the simulation engine in this cyber-physical system, is one of most challenging applications in scientific computing. It utilizes the multi-physics simulation that couples multiple components, conducts decadal to millennium simulations, and has long been an important application on supercomputers.
However, current climate system models suffer from the inefficient task scheduling methods resulting in an intolerable simulation time. Take the Community Earth System Model (CESM), the most widely used climate system model, as an example, one major reason that CESM suffers from bad performances is the huge overhead to rationally distribute processes among the coupled heterogeneous components. According to the report of NCAR, every percent improvement in CESM performance frees up to the equivalent of 250,000incomputingresourcesintheirscientificexperiments.Toaddresssuchchallenge,ourpaperfirstlyconstructsalightweightandaccurateperformancemodelforeffectivelycapturingandpredictingtheheterogeneoustimeβtoβsolutionperformanceofendβtoβendCESMcomponentswithagivensimulationconfiguration.Then,basedontheperformancemodel,wefurtherproposeanefficientschedulingstrategybasedonrectangularpackingmethodtodeterminethebestprocesslayoutamongdifferentcomponents,andtheprocessnumbersassignedtoeachcomponent.Ourevaluationsshowthatwecanachieve5840,089 with a charge of $0.01 per CPU hour.
Meanwhile, 26% extra performance improvements could also be gained in our methods comparing to the heuristic branch and bound algorithm with the guidance of the known curve-fitting performance model