/
Progress on porting the Community Atmosphere Model - Spectral Element (CAM-SE) to the GPU-CPU hybrid architecture.
Progress on porting the Community Atmosphere Model - Spectral Element (CAM-SE) to the GPU-CPU hybrid architecture.
Abstract
In this poster we summarize our progress on porting CAM-SE model to Titan's GPU-CPU hybrid architecture. The existing CUDA port has been improved in many ways. First, thread block sizes were generalized so that they could be any multiple of 16. This improved kernel efficiency and allowed any number of vertical levels to be specified. Next, additional variables were placed in local GPU cache for improved efficiency for hyperviscosity routines. The packing and unpacking routines used to average boundaries for the SE method and to prepare data for MPI message passing were entirely revamped and improved for memory efficiency. In all, this resulted in over 4x improvement in runtime for some individual kernels, and roughly 30% runtime savings overall. More importantly, it allows more general science cases to be performed, which require many vertical levels. Additional to our optimization work we continue porting the rest of the CAM-SE code to GPUs, such as advection limiters. Existing CUDA version contains only one, simplified limiter for the advection operator and it has been improved by adding a CUDA implementation for the "Optimal limiter". This provides or CUDA implementation with all necessary capabilities implemented in CAM-SE.
, multiple selections available,
Related content
P06: GPU Performance of the Simple Cloud Resolving E3SM Atmosphere Model (SCREAM)
P06: GPU Performance of the Simple Cloud Resolving E3SM Atmosphere Model (SCREAM)
More like this
B07. Climate Simulations On Summit
B07. Climate Simulations On Summit
More like this
Parallel Ensemble Simulations for ACME Performance and Verification
Parallel Ensemble Simulations for ACME Performance and Verification
More like this
#22 Lightweight threading and vectorization with OpenMP in ACME
#22 Lightweight threading and vectorization with OpenMP in ACME
More like this
Computational Benefits of an Ensemble-Based Approach to Climate Modeling and Testing at Scale
Computational Benefits of an Ensemble-Based Approach to Climate Modeling and Testing at Scale
More like this
#X01 C++/Kokkos Refactor of HOMME
#X01 C++/Kokkos Refactor of HOMME
More like this