Running E3SM on New Atmosphere Grids

The purpose of this page is to document the procedure for adding support for new atmosphere grids. The process should be the same for new uniform resolutions as well as for new regionally-refined meshes, although some settings will need to be changed for new regionally-refined mesh configurations. This page is a work in progress, and will be updated as this process is refined and (eventually) made more automated. This documentation is an update of a document written by @Mark Taylor and Colin Zarzycki, available as a Google Doc here.

Table of Contents

Child Pages

Useful Example Pages:

File

Tool

Type

Note

File

Tool

Type

Note



ncks, ncremap

NCO



mapping files and mesh template files

TempestRemap

C++

GenerateCSMesh:  make cubed sphere  Exodus (.g) files (spectral element "np" grid)
GenerateVolumetricMesh  create a FV "pg" from a spectral element "np" grid
ConvertMeshToSCRIP convert a FV "pg" Exodus file into a SCRIP file
GenerateOverlapMesh:  used in making mapping files
GenerateOfflineMap:  generate mapping files.  Only tool which can make mapping files directly from SE Exodus files.  

RRM Exodus (.g)  mesh files

SquadGen

C++



topo files

homme_tool

Fortran

Included with E3SM.  Should build and run on any system which can run the HOMME_P24.f19_g16_rx1.A test

Generate (obsolete) SCRIP files for the spectral element "np" dual grid

Used for topo smoothing for both FV "pg" and SE "np" grids. 

Can also do parallel interpolation from SE np4 grid to any SCRIP grid

topo files

cube_to_target

Fortran

NCAR utility for generating unsmoothed topography from high-res USGF data, and generating surface roughness fields from smoothed topography

mapping files

ESMF_Regridweightgen



Make FV->FV mapping files from SCRIP grid template files

Only tool which supports the montone 2nd order "bilin" map



gen_domain_files 

CIME and ELM tools



land surface dataset 

mksurfdata.pl, mksurfdata_map,

Perl and Fortran



ELM initial condition

interpinic

Fortran

4 options:

  1. cold start: (no IC file).  only suitable for testing functionality in other components.

  2. Run long spinup with prescribed atmosphere 

  3. Interpolate from a spunup IC file from a different simulation, via "interpinc" utility

  4. Inline interpolation for land initial condition is available in E3SM code, but this capability might get broken with the new land subgrid structure





Archives of already-created atmosphere grid and mapping files can be accessed at https://web.lcrc.anl.gov/public/e3sm/mapping/grids/ and https://web.lcrc.anl.gov/public/e3sm/mapping/maps/ . This page focuses on creating new versions of these files.

For the purpose of this step-by-step guide, we will walk through each step using an example of regenerating the ne4 grid that is already currently supported. For a new grid (i.e., ne512), just replace ne4 with the desired grid identifier. This is determined in Step 1 Generate a new "grid" file below, and the rest should follow.

For this example, I will assume a few environment variables, which we should set now. First, determine where to put the generated output files. For big grids, this can take up quite a bit of space, so choose a location with sufficient space. I have a symlink to my CSCRATCH directory on NESRC in my home directory, so I will put grid files there for this example:

output_root=${HOME}/cscratch/e3sm/grids/ne4 mkdir -p ${output_root}

Types of Atmosphere grid metadata files

See SE Atmosphere Grid Overview (EAM & CAM) for description of the spectral elements, GLL nodes, subcell grid and dual grid.   

  • Exodus file: "ne4.g".   This is a netcdf file following Exodus conventions.  It gives the corners of all elements on the sphere and their connectivity.  It is independent of the polynomial order used inside the element ("np").  

    • This file is used by TempestRemap (TR) to generate mapping files.  The polynomial order is a command line option and the GLL nodes are internally generated by TR.  

  • SCRIP file:  "ne4pg2.scrip.nc".   This file contains a description of the atmosphere physics grid n the format used by the original incremental remap tool SCRIP. It is used for most output and also used to generate mapping files between components and for post-processing of most output.

  • Less common “GLL” metadata files needed for specialized purposes:

    • SCRIP file:  "ne4np4_scrip.nc".   This file contains a description (SCRIP format) of the GLL dual grid. It includes the locations of the GLL nodes and artificial bounding polygons around those nodes.   Ideally the spherical area of each polygon will match the GLL weight ("exact GLL areas"), but not all tools can achieve exact areas.  Inexact areas does not impact the accuracy of the resulting mapping algorithm, it just means that mass will not be exactly conserved by the mapping algorithm.  

    • latlon file:  "ne4np4_latlon.nc".   This file contains a list of all the GLL nodes in the mesh (in latitude/longitude coordinates).   The list of GLL nodes must be in the the internal HOMME global id ordering, matching the ordering used in CAM and EAM native grid output.   It also contains the connectivity of the GLL subcell grid.   

      • This file is used by CAM's interpic_new utility, and graphics programs Paraview and Visit when plotting native grid GLL output.




Step-by-step guide

1. Generate a new atmosphere "grid" file

Requirements: TempestRemap

In order to generate mapping files to do interpolation between a new atmosphere grid and the other component grids, we need to create a file that describes the atmosphere grid representation. The TempestRemap tool (https://github.com/ClimateGlobalChange/tempestremap) can be used to recreate the internal representation of the spectral element grids used in HOMME. The grid mesh file will be saved in an "exodus" type file (with a .g extension). 

TempestRemap can now be pulled in with NCO via conda, or can be built from source from the Github repository. To build from source, make sure a netCDF module is loaded. This can be accomplished by sourcing an env_mach_specific.sh from a working case on the target machine before building. Then, 

git clone https://github.com/ClimateGlobalChange/tempestremap.git cd tempestremap make -f Makefile.gmake cd ..

This will build TempestRemap into tempestremap/bin. For the rest of the tutorial, assume that the environment variable ${tempest_root} points to this directory (or where a conda-build of TempestRemap exists):

tempest_root=${PWD}/tempestremap

We can generate an ne4 mesh easily now:

For RRM grids, SQuadGen should be used (available in a separate repository on Github here). See https://acme-climate.atlassian.net/wiki/spaces/DOC/pages/1028128773 for details and examples. The example above follows the convention ne<res>.g. RRM meshes should follow the convention <area refined>_<base resolution>x<refinement level>.g. For example, for a RRM with 4x refinement from ne30 to ne120 over CONUS, we should use the convention conus_ne30x4.g (note that existing meshes use the naming convention <area of refinement>x<refinement level>v<version>.g, but future meshes should use the new naming convention; we also plan to eventually change the extension to _exodus.nc, but I believe this will break nco functionality to automatically detect when to use TempestRemap for mapping when detecting exodus files).

The Exodus file contains only information about the position of the spectral element on the sphere. For SE aware utilities such as TempestRemap, they can use the polynomial order and the reference element map to fill in necessary data such as the locations of the nodal GLL points. For non-SE aware utilities, we need additional meta data, described in the next section.   

2A. Generate control volume mesh files for E3SM v2 "pg2" grids 

Requirements:

  • exodus mesh file

  • TempestRemap

In E3SM v2, we switched to running physics on a FV pg2 grid and mapping files between the atmosphere and other components will be FV to FV type maps using the pg2 grids.  These can be generated by TempestRemap directly from the exodus mesh file:



2B. Generate "dual grid" mesh files (SCRIP and lat/lon format) for E3SM v1 "np4" GLL grids

Requirements:

  • exodus mesh file

  • homme_tool

Note: in E3SM v2, we switched to running physics on a FV pg2 grid and mapping files between the atmosphere and other components will be FV to FV type maps using the pg2 grids.   The spectral element "np4" grid is still used internally by the dynamics and for initial conditions, so the metadata described in this section is still needed for some analysis and preparation of initial conditions.   

Mapping files used by the coupler for mapping fluxes between SE "np4" and FV grids should be generated with TempestRemap and only need the exodus grid description (which provides the locations of the corners of the quadrilaterals that form the elements of the cube-sphere mesh) generated in Step 1 above. However, a handful of pre- and post-processing tools require a finite-volume equivalent of the spectral element grid (these tools include the surface roughness calculation in the land tool cube_to_target, ESMF mapping files used for interpolating land surface data to target grid, and post-processing regional and subgrid remapping tools). We refer to this finite-volume description of the SE grid as the "dual grid" to the SE grid (see page describing atmosphere grids in more detail here).     

Generate SCRIP “dual grid” with homme_tool:

  1. Be sure your environment matches the software environment loaded by E3SM by executing the output of this command:   e3sm/cime/scripts/Tools/get_case_env

  2. Use cmake to configure and compile standalone HOMME.  On a supported platform with the CIME environement, this should work out-of-the-box.  See e3sm/components/homme/README.cmake

  3. compile the HOMME tool utility: 

    1. cd /path/to/workingdir 

    2. make -j4 homme_tool

    3. executable:   /path/to/workingdir/src/tool/homme_tool

  4. Edit e3sm/components/homme/test/tool/namelist/template.nl and specify the grid resolution or RRM file

    1. For ne512, this would be ne = 512. For RRM grids, leave ne = 0, but will need to edit where the exodus grid file comes from

    2. for non-RRM grids using the older E3SM v1 dycore, add cubed_sphere_map=0 to template.nl

  5. See e3sm/components/homme/test/tool/test.job for examples of how to run the script and then use an NCL utilities to process the tool output into SCRIP and latlon formats.  

Specific details for running at NERSC on Cori(knl):

  1. Create a batch script hange "account" in the sbatch directives at the top of the script. For example, set #SBATCH --account=e3sm

  2. cmake -C /path/to/e3sm/components/homme/cmake/machineFiles/cori-knl.cmake  -DPREQX_NP=4 /path/to/workingdir

  3. Make sure a working NCL is in your PATH. On Cori, add the following to the script: module load ncl.

2C. Atmospheric mesh quality

Atmospheric RRM mesh quality can be measured with the “Max Dinv-based element distortion” metric. This will be printed in the log file for standalone HOMME or EAM simulations (and can be obtained from the log files during the topo generation step). It measures how distorted the elements become in the mesh transition region. It is the ratio of the two singular values of the 2x2 derivative matrix of the element map to the unit square, representing the ration of the largest length scale to the smallest length scale. A grid of perfect quadrilaterals will have a value of 1.0. The equal-angle cubed-sphere grid has a value of 1.7.   A high quality regionally refined grid will have a value less than 4. With a high quality grid, usually one can run with the timesteps used in a uniform grid with matching fine resolution. RRM grids with a value > 4 may require smaller timesteps for stability. Very large values indicate a problem with the grid and it should be redesigned.

3. Generate mapping files

Requirements:

  • TempestRemap

  • ESMF_RegridWeightGen

  • ncremap

  • grid descriptor files for each component that exists on a different grid
    (atmosphere, ocean, possibly land if on a different grid than the atmosphere)

In order to pass data between different components at runtime, a set of mapping files between each component is generated offline. These mapping files will also be used in Step 4 below (generating domain files).

See Transition to TempestRemap for Atmosphere grids for a discussion of different remap algorithms and when to use each.

TempestRemap and ESMF are the backends that generate the mapping weights, but this is all nicely encapsulated using ncremap. Tempest is the preferred method for creating mapping files.     The ncremap function, calling TempestRemap or ESMF, will decide which of these two tools to use based on the atmospheric input file.  If the *.scrip file was used, then ESMF will be called.  If the *.g file was used, then TempestRemap will be called.   The ESMF tools are adequate for making atmosphere-only-type component sets for E3SM, but this tool is less conservative than TempestRemap.   If you are making grids for a coupled run, then TempestRemap should be used.  

The easiest way to make sure ncremap is up to date and will work for this workflow, we can source the ESMF-Unified conda environment. Instructions for supported machines can be found here. For Edison and Cori, it is as simple as:

For unsupported machines you may need to install the conda environment via:

TempestRemap was built in Step #1 above.     We need to tell ncremap where to find our TempestRemap binaries, so we need to append to our PATH:

And now we can use ncremap to generate ALL the needed mapping files between two grids, in this example the ne4 atmosphere and the oQU240 ocean grid (for the moment, we will put the land on the same grid as the atmosphere):

In the above code, the "-P mwf" option triggers a specific procedure that invokes multiple ncremap commands which each invoke multiple Tempest commands.It produces 3 mapping files each for a2o and o2a. This multitude of mapping files for each component→component case are needed because fluxes need conservative mapping and states need non-conservative mapping. Because "-P mwf" results in a great deal of output being flushed to screen, it is mostly suppressed by default. To see all the output or to figure out how to run all the nco and tempest remap commands one-by-one, add "--dbg_lvl=2" to the ncremap command. This will print the commands but not execute them. The user can then use one of the printed commands, again with "--dbg_lvl=2" to see the actual commands being sent to Tempest.

If this command is successful, it should produce many mapping files in ${output_root}, that look something like

where ${ocn_grid}, ${atm_grid}, and ${lnd_grid} are the names of the ocean and atmos grid provided above in the --nm_src and --nm_dst arguments, and ${datestring} is the date string provided above in the --dt_sng argument to ncremap.   The ${method} can be monotr, highorder, mono, intbilin, aave, blin, ndtos, nstod, and patc.  (TODO:  What do all these mean?)   If using the tri-grid option, the land grid files will be created from the ${lnd_grd}.  

For pg2 grids, the ncremap invocations are as follows, using ne30pg2 atmosphere, oEC60to30v3 ocean, and half-degree land grids as examples.

4. Generate domain files

Domain files are needed by the coupler and the land model at runtime.    The land model uses the mask to determine where to run and the coupler use the land fraction to merge fluxes from multiple surface types to the atmosphere above them.  Domain files are created from the mapping files created in the previous step, using a tool provided with CIME in ${e3sm_root}/cime/tools/mapping/gen_domain_files. This directory contains the source code for the tool (in Fortran 90) and a Makefile.    Cloning E3SM is now required to obtain code within the distribution.   To clone E3SM, 

Assuming ${e3sm_root} is set and the location of TempestRemap binaries is added to ${PATH}, the following script should generate the appropriate domain files:

NOTE - on Perlmutter the “OS” environment variable is not set, so to work around this simply set “OS=LINUX“
If this command is successful, it should produce many domain files in ${output_root}/domain_file that look something like



5. Generate topography file 

Generating the topography and related surface roughness data sets is a detailed process that has been moved to it’s own page, with detailed instructions depending on the model version (V1, V2, V3)

https://acme-climate.atlassian.net/wiki/spaces/DOC/pages/2720202817

 

6. Generate and spin-up a new atmosphere initial condition

Generating a new initial condition for the atmosphere is a two-step process. First, an existing initial condition is interpolated to the target resolution, then the interpolated initial condition is used to spin-up a new initial condition that is in balance and consistent with the dynamics (I am probably not explaining this very well; bottom line, without this step the model will probably blow up if you try to run with the interpolated initial condition, at least for RRM grids). The spin-up requires initially lowering the timestep and increasing the hyperviscosity, and then gradually relaxing these back to more reasonable values. Both of these steps are described below.

Step 1:  Generating a "first-guess" initial condition

A starting point for a new initial condition is first interpolated from an existing initial condition. Traditionally, this has been done using the interpic_new tool, which does both horizontal (for new grids) and vertical (for potentially different vertical grids/numbers of levels) interpolation of the fields in the initial condition file. This is a Fortran code that is included in E3SM, within the atmosphere tools directory. Unfortunately, the tool only supports interpolating from lat/lon grids, and cannot interpolate from unstructured to unstructured. So, if you want to use this tool to interpolate an existing initial condition to an SE grid, you will have to start with an older initial condition on an FV grid or similar. Thus, the use of interpic_new is NO LONGER SUPPORTED OR RECOMMENDED. The script below for building and using interpic_new is included only to document the process in case someone wanted to revive this workflow. The script will not work on NERSC, as the paths do not exists:

A more flexible approach is to use TempestRemap to do the horizontal interpolation from an existing SE (or FV) grid. This is straightforward. For example, to interpolate from an existing ne120 grid:

Note that although this approach is flexible in the source grid, it does require an existing initial condition with the same vertical grid. However, NCO will now do vertical interpolation, and will also wrap TempestRemap horizontal remapping. This is well documented on the NCO homepage (http://nco.sourceforge.net/nco.html).



Step 2:   Generate the atmosphere initial condition 

There are two general approaches to generating a new atmospheric initial condition file: remapping an existing initial condition file or remapping reanalysis data. Both of these situations can be addressed with the new HICCUP tool.

Generating atmosphere initial condition data using HICCUP

The procedure to generate new initial conditions (outlined here) has been built into into the HICCUP tool (see ). HICCUP is a set of flexible and robust python routines to automate and streamline the task of generating a new atmospheric initial condition for E3SM. Most of these routines are essentially wrappers to NCO commands, but there are also routines that directly modify the data using xarray. HICCUP is setup to use ERA5 reanlaysis data or regrid an existing atmospheric initial condition file (either horizontal, vertical, or both). HICCUP includes a surface adjustment routine that follows the original adjustment routine published by ECMWF, which was also  reproduced in similar tools by Jerry Olson and Wuyin Lin. HICCUP was built with code readability and documentation as a top priority and includes some unit tests. HICCUP has also successfully been used to generate initial conditions for ne1024 grids using large memory analysis nodes available at HPC centers. Development of HICCUP is ongoing and new features are planned, so any feedback or feature requests are welcome. 

The spin-up procedure described below is often not needed when using an initial condition derived from ERA5 reanalysis. However, for certain situations a series of spin-up simulations is required for stability. An example of this is when an RCE initial condition needs to be produced from an aqua planet initial condition.

Spinning up the atmosphere

The following procedure is copied from the recommendations in Mark and Colin's Google Doc https://docs.google.com/document/d/1ymlTgKz2SIvveRS72roKvNHN6a79B4TLOGrypPjRvg0/edit on running new RRM configurations (TODO: clean this up and update):

Determining stable timestep values is also complicated by spinup.  If the initial condition file is very much out of balance (say it came from real planet or a very different aqua planet simulation), then you may require very small timesteps and larger hyperviscosity coefficients in order to get past an initial adjustment period.  Only perform the tuning of se_nsplit and hypervis_subcycle with a spun up initial condition file.

With a high quality grid, usually one can run with the uniform se_nsplit value and a slight increase in hypervis_subcycle.   Standalone HOMME or EAM simulations using the new grid will compute this number and output “Max Dinv-based element distortion” in the log file. The equal-angle cubed-sphere grid has a value of 1.7.   A high quality regionally refined grid will have a value less than 4.

Recommend procedure:

  • Run 10 days with very large se_nsplit and hypervis_subcycle options and create a new IC file (see INITHIST in CAM).  For aqua planet, this step is often not necessary, but for simulations with topography this step is critical. One may also need to reduce dtime and increase the viscosity coefficients.  In an extreme case, something like this might be needed:

  •  

    1. run 5-10 days with dtime 3x smaller than the default  (and viscosity coefficients 2x larger)

    2. restarting from step a, run 5-10 days with dtime 3x smaller and default viscosity coefficients

    3. restarting from step b, run 5-10 days with dtime 2x smaller and default viscosity coefficients

    4. restarting from step c, run 5-10 days with default dtime and se_nsplit 2x larger  

    5. restarting from step d, run with all default parameters

  • Use this new IC file for all future runs below

  • First we determine a stable value of se_nsplit.  To do this, we first ensure the viscosity timestep is not causing problems (this is especially true if nu_div > nu), so start with a large value of hypervis_subcycle, say hypervis_subcycle=20

  • Find the smallest value of se_nsplit for which the code is stable using 1 month runs.  Start with se_nsplit that is used by the corresponding high resolution uniform grid (assuming they all have the same physics timestep, dtime)

  • One a stable value of se_nsplit has been found, decrease hypervis_subcycle until the smallest stable value is found.  Note that these are not independent. You have to find the stable value of se_nsplit before finding the stable value of hypervis_subcycle.

  • Final step:  The procedure outlined above can find timesteps that are borderline unstable, but don’t blow up do to various dissipation mechanisms in CAM.  Hence it is a good idea to run 3 months, and look at the monthly mean OMEGA500 from the 3rd month. This field will be noisy, but there should not be any obvious grid artifacts.  Weak instabilities can be masked by the large transients in flow snapshots, so it best to look at time averages.

  • Note that for simulations with topography, we often increase nu_div.  This can trigger a restrictive CFL condition which requires reducing hypervis_subcycle.  

During this tuning process, it is useful to compare the smallest ‘dx’ from the atmosphere log file to the smallest ‘dx’ from the global uniform high resolution run.  Use the ‘dx’ based on the singular values of Dinv, not the ‘dx’ based on element area. If the ‘dx’ for newmesh.g is 20% smaller than the value from the global uniform grid, it suggests the advective timesteps might need to be 20% smaller, and the viscous timesteps might need to be 44% smaller (they go like dx^2).  The code prints out CFL estimates that are rough approximation that can be used to check if you are in the correct ballpark.

7. Generate land surface data (fsurdat)

Requirements:

Overview:

A large number of input files need to be interpolated for the land model. This (rather awkward) workflow uses a chain of tools that downloads the inputdata and grid descriptor files for each input dataset, generates mapping files from each input data grid to our target grid, and then applies the mapping weight files to do the interpolation.

Notes:

  • the steps below are for maint1-0 code base. Post-v1 release changes (to add phosphorus) broke existing land initial condition files (finidat) and may require changes to this methodology. 

  • the focus here is on creating an fsurdat file in cases where land use land cover change (LULCC) does NOT change. Additional steps will be needed to create a transient LULCC file.

  • questions for the land team are in red

  • NOTE: The land surface generation tool is currently BROKEN in E3SM master, following addition of FAN and fertilizer mods (PR: 5981). A PR was issued to fix this by removing the unsupported FAN dataset from surfdata files, but was subsequently closed (PR: 6237). For the time being, land surface data can be generated by checking out git hash cd22d14bb8 (master immediately before 5981 was merged).

Steps:

  1. Create mapping files for each land surface type if needed. An (older and deprecated) example of doing this can be found here. Updated instructions follow:


    1. Obtain or generate a target grid file in SCRIP format. For these example, we will use a ne1024pg2 grid file, which we will need to create (note that most np4 grid files can be found within the inputdata repository, for example, the ne1024np4 grid file is at https://web.lcrc.anl.gov/public/e3sm/mapping/grids/ne1024np4_scrip_c20191023.nc). To generate the pg2 SCRIP file: 

    2. Get list of input grid files for each land surface input data file. This is done by running the components/elm/tools/mkmapdata/mkmapdata.sh script in debug mode to output a list of needed files (along with the commands that will be used to generate each map file; also make sure GRIDFILE is set to the SCRIP file from the above step): 

    3. Download needed input grid files. The above command will output a list of needed files to clm.input_data_list. We need to download all of these before calling the script without the debug flag to actually perform the mapping. This is possible using check_input_data in CIME, but needs to be done from a dummy case directory. So, one can create a dummy case, cd to that case, and then call ./check_input_data --data-list-dir <path where mkmapdata was run from> --download. However, this failed to connect to the CESM SVN server for me. So instead, I used the following one-off script: 

    4. Create mapping files. Should just be able to run the above mkmapdata.sh command without the –debug --list flags. We need to append the --outputfile-type 64bit_offset flag for our large files (no reason not to do this by default anyways). NOTE - This step requires NCL, which is no longer part of the E3SM unified environement. If the machine you are using does not have an NCL module, creating a custom environement that includes NCL is an easy work around. Fixing this issue to avoid the NCL dependency will require rewriting the rmdups.ncl and mkunitymap.ncl script in another language (python+xarray would make sense). We will also need to write a version of the gc_qarea() function, unless the geocat project writes a port that we can use (see geocat issue #31).

Notes for Perlmutter (2024) - If you come across an error using ESMF that includes “This functionality requires ESMF to be built with the PIO library enabled“ in the log file, then this means you are using a version of ESMF that is not compiled with MPI support. If using a conda env then you can check the type of build using conda list | grep esmf and look for “mpi” or “nompi” in the build tag. The ESMF library can also be problematic when creating the “HYDRO1K“ map (make sure to use the SCRIP version of that grid!). Recent attempts to create this map resulted in segmentation faults, which seem to be a problem only in recent versions. In particular, version 8.6 was found to be problematic, but rolling back to version 8.0.1 was successful.

  1. Compile surface dataset source code
    NOTE: previously, editing the makefile ${e3sm_root}/components/elm/tools/mksurfdata_map/src/Makefile.common was required on most machines, but this was fixed by https://github.com/E3SM-Project/E3SM/pull/2757):

    Note for Perlmutter (Mar 2024 - Walter Hannah) - the build worked using the command below:



  2. Run the mksurfdata.pl script in "debug" mode to generate the namelist (use year 2010 on ne120np4 grids as an example). 

    (However, ./mksurfdata.pl -h shows -y is by default 2010. When running without "-y" option, standard output says sim_year 2000. I suspect the mksurfdata.pl help information is wrong. To be confirmed.)


  3. Modify namelist file
    (Should the correct namelist settings be automatically picked up if the default land build name list settings are modified accordingly?)

    Time-evolving Land use land cover change (LULCC) data should not be used for fixed-time compsets, but the LULCC information for that particular year should be used (right?)
    Manually change to mksrf_fvegtyp = '/global/cfs/cdirs/e3sm/inputdata/lnd/clm2/rawdata/AA_mksrf_landuse_rc_1850-2015_06062017_LUH2/AA_mksrf_landuse_rc_2010_06062017.nc' for the F2010 ne120 compset.


  4. Create the land surface data by interactive or batch job

    The land surface data in NetCDF format will be created at current directory. (How to verify the file is correct?)



8. Generate a new land initial condition (finidat)

Three options:

  • cold start:  finidat="", no file necessary.  Lets us get up and running, but not suitable for climate science applications

  • Interpolate a spunup state from a previous simulation.  This is reasonable for many applications, but not suitable for official published E3SM simulations.

  • spin-up a new initial condition following best practices from land model developers.  

From @Peter Thornton via email. We may end up changing what we say, but this is a start.

what I would recommend in general, is to start the spin-up process with a land cold-start condition, using reanalysis data atmosphere and having all the other land settings just the way you want them for the eventual coupled simulation (resolution, domain file, land physics and BGC settings). Then run for enough years to get an approximate steady state (the number depends on what land options you are using – no BGC means a lot fewer years). Then use the resulting restart file as the initial condition for a coupled run that has at least the same atmosphere settings you will eventually use for your production run. Save high-frequency output for multiple years (preferably 10 or more, but a high res run might not have that luxury). Use that atm output to drive a second offline land spin-up, to equilibrate the land to the expected initial climate from the atmosphere. Then use the resulting land restart as finidat for the start of your fully-coupled spin-up simulation, and let it run for a while to assess drifts in all coupled components. Only after you are satisfied that everything is stable are you in a safe state to begin a science experiment production run. Off-the-shelf finidat file is not likely to save you much time in this process, because it will not be spun up to your experimental conditions and tolerances.

9. Create a new atmospheric dry deposition file

From the README for mkatmsrffile tool at components/eam/tools/mkatmsrffile:

Atmospheric drydeposition at the surface depends on certain surface
properties including soil and land use properties. In most cases
these calculations can be handled in the land model and passed to he
atmosphere through the coupler. This is the default namelist setting
drydep_method='xactive_lnd'. However with modal areosols this method
is not adequate and we must recalculate these fields in the atmosphere
(see subroutine interp_map in mo_drydep.F90). For unstructured grids
it was determined to create this offline interpolation tool rather
than generalize the subroutine interp_map.

Following are the steps for building and running an executable for this tool:

  1. Change directory to tool root:
    cd components/eam/tools/mkatmsrffile

  2. Create a .env_mach_specific.sh by running
    ../../../../cime/tools/configure --macros-format=Makefile

  3. Get machine-specific environment settings via
    source .env_mach_specific.sh

  4. Make sure NETCDF_ROOT and FC environment variables are set right for your system, and build the executable:

    1. On Cori:  env NETCDF_ROOT=$NETCDF_DIR FC=ifort make

  5. Edit "nml_atmsrf" to update the input file paths

  6. Run the newly built executable





    This will produce a drydep file. Following input files were used for generating a new dry deposition file:



srfFileName: /project/projectdirs/e3sm/mapping/grids/1x1d.nc

atmFileName: /project/projectdirs/e3sm/mapping/grids/ne30np4_pentagons.091226.nc

landFileName: /project/projectdirs/e3sm/inputdata/atm/cam/chem/trop_mozart/dvel/regrid_vegetation.nc

soilwFileName: /project/projectdirs/e3sm/inputdata/atm/cam/chem/trop_mozart/dvel/clim_soilw.nc

srf2atmFmapname:/project/projectdirs/e3sm/bhillma/grids/ne30np4/atmsrffile/map_1x1_to_ne30np4_aave.nc

Note that if using Tempest Remap to provide mapping files, the above mapping file should be replaced with something that looks like map_1x1_to_ne30np4_mono.nc.

Output file produced using the above procedure was compared against an existing file (/project/projectdirs/e3sm/inputdata/atm/cam/chem/trop_mam/atmsrf_ne30np4_110920.nc​) using a script from @Peter Caldwell. Following figures show the comparison:



10. Create a new compset and/or new supported grid by modifying CIME's xml files

???



11. Implement tests for the new grid

(TODO:  develop this section.  One example might be to include a case-building test as done in ${e3sm_root}/cime/config/e3sm/tests.py)

 

12. Adding a new ocean and sea-ice mesh

If you wish to add a new ocean and sea-ice mesh in addition to (or instead of) a new atmosphere grid, you will need to use the compass tool to generate the mesh and dynamically adjusted initial condition. This procedure is detailed in a separate tutorial:

The new ocean/sea ice mesh should be generated before generating mapping and domain files (steps 3. and 4. above).


Summary

After reading through the info above on Jan 8, 2019 , I (@Peter Caldwell) created lists of stuff we should create tests for, merge to master, and revise to avoid dual-grid dependency. @Ben Hillman - am I missing anything?

Tools we should create tests for:

  1. TempestRemap for generating uniform grids
    (in Paul’s external git repo - may have its own tests?)

  2. SQuadGen for generating RRM grids
    (in Paul’s external repo - may have its own tests?)

  3. Generate topography via )

    1. needs utilities: components/eam/tools/topo_tool/cube_to_target and comopnents/homme/test/tool

  4. run ncremap (an NCO command) to generate mapping files

  5. cime/tools/mapping/gen_domain_files

  6. mksurfdata.pl to generate the namelist needed to make fsurdat file

  7. use mksurfdata_map for fsurdat

  8. use the interpic_new tool to regrid atmos state to new grid for initial condition



Stuff on branches that we need to get on master:

  1. branch brhillman/add-grid-scripts for the matlab script used to create the dual grid.

  2. PR #2633 to generate domain files without needing the dual grid?

  3. PR #2706 to add command line interface to topography tool to not have to edit source code by hand and recompile to compute subgrid surface roughness



Stuff requiring ESMF and/or the dual grid:

  1. generate mapping files needs ESM_GegridWeightGen?

  2. generate topography file

Tools that could use some clean-up:

  1. smoothtopo.job script used to run HOMME to apply dycore-specific smoothing to interpolated topography. It would be nice for this to be able to run via command line arguments rather than having to edit the script (which should make this easier to include in an automated workflow), and we should remove dependence on NCL since this is not guaranteed to be available.

    1. Replaced with “homme_tool”, 2020/5. see (see )

  2. makegrid.job script used to run HOMME+NCL to produce the non-optimized dualgrid and latlon descriptions of the SE grid. Again, it would be nice for this to be able to run via command line arguments rather than having to edit the script (which should make this easier to include in an automated workflow), and we should remove dependence on NCL since this is not guaranteed to be available.

    1. TR and PG2 grids make this obsolete - we now longer need the “dualgrid”.

  3. Land surface data scripts (TODO: add specifics about what needs to change here)