Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Page Properties
1.Poster TitleParallel Ensemble Simulations for ACME Performance and Verification
2.AuthorsAbigail Gaddis (Unlicensed), Matthew Norman, Kate Evans (Unlicensed), Salil Mahajan, Mark Taylor
3.GroupAtmosphere, Performance
4.Experiment 
5.Poster CategoryProblem/Solution
6.Submission TypePoster
7.Poster Linkhttps://acme-climate.atlassian.net/wiki/download/attachments/31130387/ACME_Problem_Poster_48x48.pptx?api=v2

Abstract

 

The status quo for high-resolution climate simulation is to perform a very small ensemble (order five) of long simulations (roughly a century) for various scenarios arising from IPCC specifications. To succeed in feasible time, a throughput constraint of five Simulated model Years Per wallclock Day (SYPD) is generally accepted as necessary. To achieve this, CAM-SE is used and is scaled over many Processing Elements (PEs), and work per node is very small. At this scale, parallel data transfer overheads are 40% of the total runtime or more, and there are very few threadable indices to use on an accelerator (e.g. Graphics Processing Unit, GPU). Also, even at these scaling limits, ACME is barely achieving a “capability-scale” portion of Titan (i.e., > 25% of the machine), and throughput is still only around one SYPD for the 28km-mesh water cycle experiment targeted by ACME. This, in turn, means (1) a low benefit from using GPUs and (2) poor usage of computer allocations, and (3) less likelihood of receiving large computing allocations in the future.

...