...
RESULTS: All tests pass with roundoff level changes to c_k10h, and fail (detect statistically different results) with small changes in c_k10h. With the default thresholds, PGN is the most sensitive (detecting a statistical difference with a 1e-8 change, followed by TSC at 1e-2, and MVK at 2e-2). These results are correlated with the timescale of each test, with PGN looking at physics columns after 1 timsteptimestep, TSC time step convergence with 300 timesteps, and MVK examining 1 year climatologies.
...
2022/6 update: TSC PASS/FAIL criterion still broken. ( https://github.com/E3SM-Project/E3SM/issues/4759 )
MVK
30 member ensemble of ~ 1 year simulations. Takes about 1.3 hours on 18 nodes.
Hack to reuse same “-c” case to run multiple experiments:
rm -f run/*.nc (otherwise we get PIO run time errors)
add “clubb_c_k10h=0.40” to user_nl_eam_???? files
./case.submit
MVK_P24x1.ne4_oQU240.F2010-CICE (Compy, 2011/11)
c_10kh | Test Result | TestStatus.log Metrics threshold=13 |
---|---|---|
0.35 (default) | PASS | |
0.36 | PASS | reject 7/121 |
0.38 | FAIL | reject 21/121 |
0.40 | FAIL | reject 50/121 |
MVK_P36x1.ne4_oQU240.F2010 (Anvil 2022/6)
(note: switch to F2010 case with MPAS sea ice)
c_10kh | Test Result | TestStatus.log Metrics threshold=13 |
---|---|---|
0.35 (default) | PASS | |
0.36 | PASS | reject 9/121 |
0.38 | FAIL | reject 34/121 |
PGN
20 member ensemble of ~ 1 timestep simulations. Takes about 1 min on 16 nodes.
Hack to reuse same “-c” case to run multiple experiments:
rm -f run/*.nc (otherwise we get PIO run time errors)
add “clubb_c_k10h=0.40” to user_nl_eam_???? files
./case.submit
PGN_P32x1.ne4_oQU240.F2010
...
20 member ensemble of ~ 1 timestep simulations. Takes about 1 min on 16 nodes.
(Compy, 2011/11)
c_10kh | Test Result | TestStatus.log Metrics T test (t,p) |
---|---|---|
0.35 (default) | PASS | (0.000, 1.000) |
0.350000001d0 | PASS | (-1.424, 0.169) |
0.35000001 | FAIL | (-2.542, 0.019); |
0.36 | FAIL | (-12.564, 0.000); |
PGN_P32x1.ne4_oQU240.F2010 (Anvil, 2022/6)
c_10kh | Test Result | TestStatus.log Metrics T test (t,p) |
---|---|---|
0.350000001d0 | PASS | (-1.424, 0.169) |
0.350000005d0 | PASS | (-1.549, 0.136) |
0.35000001 | FAIL | (-2.542, 0.019) |
0.35000002 | FAIL | (-2.619, 0.016) |
TSC
12 member ensemble of 5 day simulations. Takes about 10min on 11 nodes.
Hack to reuse same a “-c” case to run multiple experiments :
...
rm -f run/*.nc (otherwise we get PIO run time errors)
...
used in MVK and PGN tests does not work. For TSC, during the RUN phase, the user_nl_eam_???? ensemble member namelists will be created anew by cime/scripts/lib/CIME/SystemTests/tsc.py. This script can be edited to append user_nl_eam to each user_nl_eam_????
...
file ( gist for tsc.py patch file ) , and then one can set parameters in user_nl_eam.
link to PASS/FAIL post processing script: https://github.com/LIVVkit/evv4esm/blob/master/evv4esm/extensions/tsc.py
TSC_P36x1.ne4_ne4.F2010-CICE
...
(Compy, 2011/11)
c_10kh | Test Result (possible bug in scripts? fails for all values except when results are bfb) | Hui Wan 's criterion: PASS, unless all points in p_min plot in [5min,10min] range are FAIL | TestStatus.log Metrics region by region results Global, Land, Ocean |
---|---|---|---|
0.35 (default) | PASS | PASS | PASS, PASS, PASS |
0.350001 | FAIL | PASS | PASS, PASS, PASS |
0.35001 | FAIL | PASS | FAIL, PASS, PASS pmin plot |
0.3501 | FAIL | PASS | PASS, PASS, PASS pmin plot |
0.351 | FAIL | PASS | PASS, FAIL, PASS pmin plot |
0.36 | FAIL | FAIL | FAIL, FAIL, FAIL pmin.36.png |
0.37 | FAIL | FAIL | FAIL, FAIL, FAIL |
...
TSC
...
link to PASS/FAIL post processing script:
...
_P36x1.ne4_ne4.F2010-CICE (Anvil, 2022/6)
note: test aborts in MPAS sea ice if we use F2010
c_10kh | Test Result (possible bug in scripts? ) | Hui Wan 's criterion: PASS, unless all points in p_min plot in [5min,10min] range are FAIL |
---|---|---|
0.35 (default) | PASS | PASS |
0.351 | PASS | PASS |
0.36 | PASS | FAIL |
0.38 | PASS | FAIL |