Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature/hafs rtcases #646

Merged
merged 9 commits into from
Oct 27, 2023

Conversation

JingCheng-NOAA
Copy link
Contributor

@JingCheng-NOAA JingCheng-NOAA commented Oct 23, 2023

Add HAFS related regression test into GSI Ctests to solve the issue #600.

Four set of regression tests for the current suite of HAFSv1 GSI:

3DEnvar with GDAS Ensemble plus FGAT capability -- "hafs_3denvar_glbens".
4Denvar with GDAS Ensemble plus FGAT capability -- "hafs_4denvar_glbens".
3Envar with self-cycled HAFS Ensemble -- "hafs_3denvar_hafens".
3Envar with GDAS Ensemble plus self-cycled HAFS Ensemble -- "hafs_3denvar_hybens".

Fixes #600
Partially fixes #647

Type of change

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • Breaking change (fix or feature that would cause existing functionality to not work as expected)
  • This change requires a documentation update

How Has This Been Tested?

These changes are tested under Orion through GSI Ctests.

Checklist

  • [x ] My code follows the style guidelines of this project
  • I have performed a self-review of my own code
  • I have commented my code, particularly in hard-to-understand areas
  • New and existing tests pass with my changes
  • Any dependent changes have been merged and published

DUE DATE for this PR is 12/4/2023. If this PR is not merged into develop by this date, the PR will be closed and returned to the developer.

@JingCheng-NOAA
Copy link
Contributor Author

JingCheng-NOAA commented Oct 23, 2023 via email

@RussTreadon-NOAA
Copy link
Contributor

Ran hafs ctests on Orion after removing lines 7-1 from regression/regression_vars.sh .

Orion-login-3:/work2/noaa/da/rtreadon/git/gsi/pr645/build$ ctest -R hafs_
Test project /work2/noaa/da/rtreadon/git/gsi/pr645/build
    Start  9: hafs_3denvar_glbens
1/4 Test  #9: hafs_3denvar_glbens ..............   Passed  1873.75 sec
    Start 10: hafs_4denvar_glbens
2/4 Test #10: hafs_4denvar_glbens ..............   Passed  2302.39 sec
    Start 11: hafs_3denvar_hafens
3/4 Test #11: hafs_3denvar_hafens ..............***Failed  1758.92 sec
    Start 12: hafs_3denvar_hybens

The hafs_3denvar_hybens test did not finish because the loproc_contrl job hit the specified 15 minute wall clock time. The updat jobs finished within 15 minutes

Orion-login-3:/work/noaa/stmp/rtreadon/pr645/tmpreg_hafs_3denvar_hybens$ grep wall */stdout
hafs_3denvar_hybens_hiproc_updat/stdout:The total amount of wall time                        = 669.299156
hafs_3denvar_hybens_loproc_updat/stdout:The total amount of wall time                        = 526.743247

The slower running contrl job suggests an Orion workload issue since PR #646 does not alter gsi.x. The updat and contrl gsi.x are functionally equivalent.

The hafs_3denvar_hafens failure is due to the scalability test

The case has Failed the scalability test.
The slope for the update (156.030943 seconds per node) is less than that for the control (167.408557 seconds per node).

A check of the updat and contrl wall times shows that the loproc_contrl took noticeably longer time than the loproc_updat.

hafs_3denvar_hafens_hiproc_contrl/stdout:The total amount of wall time                        = 307.140940
hafs_3denvar_hafens_hiproc_updat/stdout:The total amount of wall time                        = 304.474938
hafs_3denvar_hafens_loproc_contrl/stdout:The total amount of wall time                        = 474.549497
hafs_3denvar_hafens_loproc_updat/stdout:The total amount of wall time                        = 429.299693

Since PR #646 does not alter source code, this difference is due to the Orion system load. Hence, this is a non-critical failure.

…e 4denvar and 3denvar_hybens cases.Removed the 3denvar_glbens and 3denvar_hafens cases.
@JingCheng-NOAA
Copy link
Contributor Author

JingCheng-NOAA commented Oct 24, 2023

I've reduced the total number of HAFS related regression tests to 2 after discussed with Shun.

@ShunLiu-NOAA
Copy link
Contributor

@yonghuiweng Could you please review this PR and make regression test on WCOSS and other HPCs that HAFS is using. Thank you.

@hu5970
Copy link
Collaborator

hu5970 commented Oct 25, 2023

@JingCheng-NOAA For verification, you can use the same check as rrfs:

In "GSI/regression/regression_test.sh", you can change the following two lines:

351    elif [[ `expr substr $exp1 1 4` = "rrfs" ]]; then
523 elif [[ `expr substr $exp1 1 4` = "rrfs" ]]; then

Add "hafs":

351    elif [[ `expr substr $exp1 1 4` = "rrfs" ]] || [[ `expr substr $exp1 1 4` = "hafs" ]]; then
523 elif [[ `expr substr $exp1 1 4` = "rrfs" ]] || [[ `expr substr $exp1 1 4` = "hafs" ]]; then

Thanks,
Ming

regression/CMakeLists.txt Outdated Show resolved Hide resolved
@RussTreadon-NOAA
Copy link
Contributor

RussTreadon-NOAA commented Oct 25, 2023

Hera ctests

FYI, Hera /scratch1/NCEPDEV/da/Russ.Treadon/git/gsi/pr645 contains a working copy of feature/hafs_rtcases with the following local modifications

        modified:   regression/CMakeLists.txt 
        modified:   regression/multi_regression.sh
        modified:   regression/regression_namelists.sh
        modified:   regression/regression_namelists_db.sh
        modified:   regression/regression_param.sh
        modified:   regression/regression_var.sh
        deleted:    regression/global_3dvar.sh
        deleted:    regression/global_4dvar.sh
        deleted:    regression/hwrf_nmm_d2.sh
        deleted:    regression/hwrf_nmm_d3.sh

These changes remove references to the global_3dvar, global_4dvar, hwrf_nmm_d2, and hwrf_nmm_d3 regression tests.

The data for the hafs tests has been sync'd to Hera. The remaining 7 ctests were run with the following results.

Hera(hfe05):/scratch1/NCEPDEV/da/Russ.Treadon/git/gsi/pr645/build$ ctest -j 7
Test project /scratch1/NCEPDEV/da/Russ.Treadon/git/gsi/pr645/build
    Start 1: global_4denvar
    Start 2: rtma
    Start 3: rrfs_3denvar_glbens
    Start 4: netcdf_fv3_regional
    Start 5: hafs_4denvar_glbens
    Start 6: hafs_3denvar_hybens
    Start 7: global_enkf
1/7 Test #3: rrfs_3denvar_glbens ..............   Passed  550.21 sec
2/7 Test #4: netcdf_fv3_regional ..............   Passed  967.87 sec
3/7 Test #7: global_enkf ......................   Passed  1441.42 sec
4/7 Test #2: rtma .............................   Passed  1632.62 sec
5/7 Test #6: hafs_3denvar_hybens ..............***Failed  1936.57 sec
6/7 Test #1: global_4denvar ...................   Passed  2210.41 sec
7/7 Test #5: hafs_4denvar_glbens ..............***Failed  2240.49 sec

71% tests passed, 2 tests failed out of 7

Total Test time (real) = 2240.51 sec

The following tests FAILED:
          5 - hafs_4denvar_glbens (Failed)
          6 - hafs_3denvar_hybens (Failed)

The hafs failures are non-fatal failures. Both tests failed the maxmem check

The memory for hafs_3denvar_hybens_loproc_updat is 2561532 KBs.  This has exceeded maximum allowable hardware memory limit of 2516582 KBs,
resulting in Failure maxmem of the regression test.
The memory for hafs_4denvar_glbens_loproc_updat is 2878960 KBs.  This has exceeded maximum allowable hardware memory limit of 2516582 KBs,
resulting in Failure maxmem of the regression test.

This check is misleading. The threshold is set to the per node memory divided by the number of cores. For Hera this is 96 Gb divided by 40 cores (2516582 Kb or 2.4 Gb). The hafs tests run with 20 tasks per Hera node. As such, neither hafs tests comes close to exceeding the 96 Gb limit for Hera nodes.

As @hu5970 mentioned, some of the current regression test checks should either be removed or revised. The maxmem and timing scalability tests can probably be removed.

@JingCheng-NOAA
Copy link
Contributor Author

FYI, Hera /scratch1/NCEPDEV/da/Russ.Treadon/git/gsi/pr645 contains a working copy of feature/hafs_rtcases with the following local modifications

        modified:   regression/CMakeLists.txt 
        modified:   regression/multi_regression.sh
        modified:   regression/regression_namelists.sh
        modified:   regression/regression_namelists_db.sh
        modified:   regression/regression_param.sh
        modified:   regression/regression_var.sh
        deleted:    regression/global_3dvar.sh
        deleted:    regression/global_4dvar.sh
        deleted:    regression/hwrf_nmm_d2.sh
        deleted:    regression/hwrf_nmm_d3.sh

These changes remove references to the global_3dvar, global_4dvar, hwrf_nmm_d2, and hwrf_nmm_d3 regression tests.

The data for the hafs tests has been sync'd to Hera. The remaining 7 ctests were run with the following results.

Hera(hfe05):/scratch1/NCEPDEV/da/Russ.Treadon/git/gsi/pr645/build$ ctest -j 7
Test project /scratch1/NCEPDEV/da/Russ.Treadon/git/gsi/pr645/build
    Start 1: global_4denvar
    Start 2: rtma
    Start 3: rrfs_3denvar_glbens
    Start 4: netcdf_fv3_regional
    Start 5: hafs_4denvar_glbens
    Start 6: hafs_3denvar_hybens
    Start 7: global_enkf
1/7 Test #3: rrfs_3denvar_glbens ..............   Passed  550.21 sec
2/7 Test #4: netcdf_fv3_regional ..............   Passed  967.87 sec
3/7 Test #7: global_enkf ......................   Passed  1441.42 sec
4/7 Test #2: rtma .............................   Passed  1632.62 sec
5/7 Test #6: hafs_3denvar_hybens ..............***Failed  1936.57 sec
6/7 Test #1: global_4denvar ...................   Passed  2210.41 sec
7/7 Test #5: hafs_4denvar_glbens ..............***Failed  2240.49 sec

71% tests passed, 2 tests failed out of 7

Total Test time (real) = 2240.51 sec

The following tests FAILED:
          5 - hafs_4denvar_glbens (Failed)
          6 - hafs_3denvar_hybens (Failed)

The hafs failures are non-fatal failures. Both tests failed the maxmem check

The memory for hafs_3denvar_hybens_loproc_updat is 2561532 KBs.  This has exceeded maximum allowable hardware memory limit of 2516582 KBs,
resulting in Failure maxmem of the regression test.
The memory for hafs_4denvar_glbens_loproc_updat is 2878960 KBs.  This has exceeded maximum allowable hardware memory limit of 2516582 KBs,
resulting in Failure maxmem of the regression test.

This check is misleading. The threshold is set to the per node memory divided by the number of cores. For Hera this is 96 Gb divided by 40 cores (2516582 Kb or 2.4 Gb). The hafs tests run with 20 tasks per Hera node. As such, neither hafs tests comes close to exceeding the 96 Gb limit for Hera nodes.

As @hu5970 mentioned, some of the current regression test checks should either be removed or revised. The maxmem and timing scalability tests can probably be removed.

Previously I only tested it on Orion. So the settings for Hera may be off. Please feel free to change it or I can make changes as well.

@RussTreadon-NOAA
Copy link
Contributor

RussTreadon-NOAA commented Oct 25, 2023

Hera ctests (continued)

Remove maxmem and timing scalability tests from regression_test.sh and regression_test_enkf.sh in the above mentioned Hera working copy of hafs-community:feature/hafs_rtcases. Rerun seven ctests.

Hera(hfe03):/scratch1/NCEPDEV/da/Russ.Treadon/git/gsi/pr645/build$ ctest -j 7
Test project /scratch1/NCEPDEV/da/Russ.Treadon/git/gsi/pr645/build
    Start 1: global_4denvar
    Start 2: rtma
    Start 7: global_enkf
    Start 4: netcdf_fv3_regional
    Start 3: rrfs_3denvar_glbens
    Start 5: hafs_4denvar_glbens
    Start 6: hafs_3denvar_hybens
1/7 Test #4: netcdf_fv3_regional ..............   Passed  491.83 sec
2/7 Test #3: rrfs_3denvar_glbens ..............   Passed  675.13 sec
3/7 Test #7: global_enkf ......................   Passed  949.82 sec
4/7 Test #6: hafs_3denvar_hybens ..............   Passed  2182.70 sec
5/7 Test #5: hafs_4denvar_glbens ..............   Passed  2308.16 sec
6/7 Test #2: rtma .............................   Passed  2474.11 sec
7/7 Test #1: global_4denvar ...................   Passed  3053.83 sec

100% tests passed, 0 tests failed out of 7

Total Test time (real) = 3053.84 sec

All tests pass. This is an expected result since this PR does not alter code, namelists, modules, or scripts.. This test is relevant to GSI issue #647 and is cross referenced there.

@hu5970
Copy link
Collaborator

hu5970 commented Oct 25, 2023

Seems Hera and Orion are good for testing. Is any one working on WCOSS2?

@ShunLiu-NOAA
Copy link
Contributor

As suggested on Monday's meeting, someone from HAFS DA might run the test on WCOSS to get familiar with HAFS DA regression test. Yonghui might make a regression test on WCOSS once he gets chance.

@RussTreadon-NOAA
Copy link
Contributor

Seems Hera and Orion are good for testing. Is any one working on WCOSS2?

HAFS data needs to be rsync'd to WCOSS2.

@yonghuiweng
Copy link

yonghuiweng commented Oct 25, 2023 via email

@RussTreadon-NOAA
Copy link
Contributor

WCOSS2 (Dogwood) tests
Do the following

  • Rsync Hera working copy of hafs_rtcases to Dogwood /lfs/h2/emc/da/noscrub/russ.treadon/git/gsi/pr645
  • Update with current head of feature/hafs_rtcases
  • Rsync hafs regression test data to Dogwood
  • Build feature/hafs_rtcases
  • Run seven ctests. As shown below, all tests pass. This is an expected result.
russ.treadon@dlogin07:/lfs/h2/emc/da/noscrub/russ.treadon/git/gsi/pr645/build> ctest -j 7
Test project /lfs/h2/emc/da/noscrub/russ.treadon/git/gsi/pr645/build
    Start 1: global_4denvar
    Start 2: rtma
    Start 3: rrfs_3denvar_glbens
    Start 4: netcdf_fv3_regional
    Start 5: hafs_4denvar_glbens
    Start 6: hafs_3denvar_hybens
    Start 7: global_enkf
1/7 Test #4: netcdf_fv3_regional ..............   Passed  482.59 sec
2/7 Test #7: global_enkf ......................   Passed  610.86 sec
3/7 Test #3: rrfs_3denvar_glbens ..............   Passed  666.14 sec
4/7 Test #2: rtma .............................   Passed  1088.11 sec
5/7 Test #1: global_4denvar ...................   Passed  1381.85 sec
6/7 Test #6: hafs_3denvar_hybens ..............   Passed  1448.17 sec
7/7 Test #5: hafs_4denvar_glbens ..............   Passed  1630.54 sec

100% tests passed, 0 tests failed out of 7

Total Test time (real) = 1630.56 sec

As a reminder, my modified working copy of feature/hafs_rtcases removes the maxmem and timing scalability tests.

The two hafs tests have the longest run time in the ctest suite. Notice that the hafs tests run two outer loops with each inner loop containing 50 iterations. Can we reduce the number of inner iterations? Reducing the inner loop iteration count is an easy way to reduce wall time.

The global_4denvar test runs 2 outer loops with 5 iterations on the first outer loop and 10 on the second outer loop. The global_4denvar test could reduce the second outer loop iterations to 5.

We should check the number of inner loop iterations used in other ctests.

@JingCheng-NOAA
Copy link
Contributor Author

JingCheng-NOAA commented Oct 26, 2023 via email

@RussTreadon-NOAA
Copy link
Contributor

Orion ctests

Repeat on Orion the steps followed on Hera and WCOSS2. All seven ctests pass

Orion-login-4:/work2/noaa/da/rtreadon/git/gsi/pr645/build$ ctest -j 7
Test project /work2/noaa/da/rtreadon/git/gsi/pr645/build
    Start 1: global_4denvar
    Start 2: rtma
    Start 3: rrfs_3denvar_glbens
    Start 4: netcdf_fv3_regional
    Start 5: hafs_4denvar_glbens
    Start 6: hafs_3denvar_hybens
    Start 7: global_enkf
1/7 Test #4: netcdf_fv3_regional ..............   Passed  542.43 sec
2/7 Test #7: global_enkf ......................   Passed  547.92 sec
3/7 Test #3: rrfs_3denvar_glbens ..............   Passed  604.99 sec
4/7 Test #2: rtma .............................   Passed  1208.90 sec
5/7 Test #1: global_4denvar ...................   Passed  1561.90 sec
6/7 Test #6: hafs_3denvar_hybens ..............   Passed  1873.67 sec
7/7 Test #5: hafs_4denvar_glbens ..............   Passed  2114.52 sec

100% tests passed, 0 tests failed out of 7

Total Test time (real) = 2114.54 sec

@RussTreadon-NOAA
Copy link
Contributor

Thank you @JingCheng-NOAA for being open to reducing the number of inner loop iterations in the hafs tests. I'll reduce the inner loop iterations in other tests and rerun the ctests to quantify the run time impact.

@RussTreadon-NOAA
Copy link
Contributor

Reduced inner loop iterations

Reduce the number of inner loop iterations for several tests. Rerun ctests on Orion, Hera, and WCOSS2 (Dogwood) with the following results.

Orion

Orion-login-4:/work2/noaa/da/rtreadon/git/gsi/pr645/build$ ctest -j 7
Test project /work2/noaa/da/rtreadon/git/gsi/pr645/build
    Start 5: hafs_4denvar_glbens
    Start 6: hafs_3denvar_hybens
    Start 1: global_4denvar
    Start 2: rtma
    Start 3: rrfs_3denvar_glbens
    Start 7: global_enkf
    Start 4: netcdf_fv3_regional
1/7 Test #4: netcdf_fv3_regional ..............   Passed  482.46 sec
2/7 Test #7: global_enkf ......................   Passed  488.62 sec
3/7 Test #3: rrfs_3denvar_glbens ..............   Passed  545.26 sec
4/7 Test #2: rtma .............................   Passed  969.11 sec
5/7 Test #6: hafs_3denvar_hybens ..............   Passed  1273.80 sec
6/7 Test #5: hafs_4denvar_glbens ..............   Passed  1452.80 sec
7/7 Test #1: global_4denvar ...................   Passed  1562.07 sec

100% tests passed, 0 tests failed out of 7

Total Test time (real) = 1562.08 sec

The previous round of ctests on Orion took 2114.54 sec.

Hera

Hera(hfe10):/scratch1/NCEPDEV/da/Russ.Treadon/git/gsi/pr645/build$ ctest -j 7
Test project /scratch1/NCEPDEV/da/Russ.Treadon/git/gsi/pr645/build
    Start 1: global_4denvar
    Start 5: hafs_4denvar_glbens
    Start 6: hafs_3denvar_hybens
    Start 2: rtma
    Start 7: global_enkf
    Start 4: netcdf_fv3_regional
    Start 3: rrfs_3denvar_glbens
1/7 Test #4: netcdf_fv3_regional ..............   Passed  676.55 sec
2/7 Test #3: rrfs_3denvar_glbens ..............   Passed  733.49 sec
3/7 Test #7: global_enkf ......................   Passed  985.35 sec
4/7 Test #2: rtma .............................   Passed  1218.54 sec
5/7 Test #6: hafs_3denvar_hybens ..............   Passed  1406.38 sec
6/7 Test #5: hafs_4denvar_glbens ..............   Passed  1464.25 sec
7/7 Test #1: global_4denvar ...................   Passed  2160.74 sec

100% tests passed, 0 tests failed out of 7

Total Test time (real) = 2160.75 sec

The previous round of ctests on Hera took 3053.84 sec.

WCOSS2 (Dogwood)

russ.treadon@dlogin08:/lfs/h2/emc/da/noscrub/russ.treadon/git/gsi/pr645/build> ctest -j 7
Test project /lfs/h2/emc/da/noscrub/russ.treadon/git/gsi/pr645/build
    Start 1: global_4denvar
    Start 5: hafs_4denvar_glbens
    Start 6: hafs_3denvar_hybens
    Start 2: rtma
    Start 7: global_enkf
    Start 3: rrfs_3denvar_glbens
    Start 4: netcdf_fv3_regional
1/7 Test #4: netcdf_fv3_regional ..............   Passed  482.63 sec
2/7 Test #3: rrfs_3denvar_glbens ..............   Passed  485.01 sec
3/7 Test #7: global_enkf ......................   Passed  609.06 sec
4/7 Test #2: rtma .............................   Passed  968.42 sec
5/7 Test #6: hafs_3denvar_hybens ..............   Passed  1208.72 sec
6/7 Test #5: hafs_4denvar_glbens ..............   Passed  1209.08 sec
7/7 Test #1: global_4denvar ...................   Passed  1322.42 sec

100% tests passed, 0 tests failed out of 7

Total Test time (real) = 1322.45 sec

The previous round of ctests on Dogwood took 1630.56 sec.

The modified working directory for hafs_rtcases including changes in the number of inner loop iterations is found in the following locations:

  • Orion: /work2/noaa/da/rtreadon/git/gsi/pr645
  • Hera: /scratch1/NCEPDEV/da/Russ.Treadon/git/gsi/pr645
  • Dogwood: /lfs/h2/emc/da/noscrub/russ.treadon/git/gsi/pr645

@JingCheng-NOAA
Copy link
Contributor Author

JingCheng-NOAA commented Oct 26, 2023 via email

@RussTreadon-NOAA
Copy link
Contributor

I updated a working copy of feature/hafs_rtcases on Orion in /work2/noaa/da/rtreadon/git/gsi/update with my changes. I committed the changes to my local working copy.

Orion-login-4:/work2/noaa/da/rtreadon/git/gsi/update$  git log | head
commit 1faf99ab9c40ad40403bda7b0ba0c3eed9457be9
Author: RussTreadon-NOAA <[email protected]>
Date:   Thu Oct 26 13:35:54 2023 +0000

    clean up, optimize, and simplify regression tests (#600, #647)

I can not push the commit to hafs-community:feature/hafs_rtcases. I do not have write permission to this repository.

If the changes in Orion work2/noaa/da/rtreadon/git/gsi/update are acceptable to the team, would someone with write permission please commit these changes to hafs-community:feature/hafs_rtcases.

Thank you.

@JingCheng-NOAA
Copy link
Contributor Author

I updated a working copy of feature/hafs_rtcases on Orion in /work2/noaa/da/rtreadon/git/gsi/update with my changes. I committed the changes to my local working copy.

Orion-login-4:/work2/noaa/da/rtreadon/git/gsi/update$  git log | head
commit 1faf99ab9c40ad40403bda7b0ba0c3eed9457be9
Author: RussTreadon-NOAA <[email protected]>
Date:   Thu Oct 26 13:35:54 2023 +0000

    clean up, optimize, and simplify regression tests (#600, #647)

I can not push the commit to hafs-community:feature/hafs_rtcases. I do not have write permission to this repository.

If the changes in Orion work2/noaa/da/rtreadon/git/gsi/update are acceptable to the team, would someone with write permission please commit these changes to hafs-community:feature/hafs_rtcases.

Thank you.

Hi Russ,
I will work on commit these changes to 'hafs-community:feature/hafs_rtcases'. But right now I don't have permission to access the directory: work2/noaa/da/rtreadon/git/gsi/update. Could you please grant me permission to access that folder?
Jing

@RussTreadon-NOAA
Copy link
Contributor

@JingCheng-NOAA , please try again. I executed a chmod 755 on the directory.

@JingCheng-NOAA
Copy link
Contributor Author

JingCheng-NOAA commented Oct 26, 2023 via email

@JingCheng-NOAA
Copy link
Contributor Author

I updated a working copy of feature/hafs_rtcases on Orion in /work2/noaa/da/rtreadon/git/gsi/update with my changes. I committed the changes to my local working copy.

Orion-login-4:/work2/noaa/da/rtreadon/git/gsi/update$  git log | head
commit 1faf99ab9c40ad40403bda7b0ba0c3eed9457be9
Author: RussTreadon-NOAA <[email protected]>
Date:   Thu Oct 26 13:35:54 2023 +0000

    clean up, optimize, and simplify regression tests (#600, #647)

I can not push the commit to hafs-community:feature/hafs_rtcases. I do not have write permission to this repository.

If the changes in Orion work2/noaa/da/rtreadon/git/gsi/update are acceptable to the team, would someone with write permission please commit these changes to hafs-community:feature/hafs_rtcases.

Thank you.

Hi Russ, I know you've removed the the maxmem and timing scalability tests in "regression_test_enkf.sh" and "regression_test.sh" for test purpose. I shouldn't include the changes in these two scripts, correct?

@RussTreadon-NOAA
Copy link
Contributor

@JingCheng-NOAA , after talking with @ShunLiu-NOAA it's my impression that we want to bring all my modifications into your branch. This incudes changes to regression_test.sh and regression_test_enkf.sh.

@JingCheng-NOAA
Copy link
Contributor Author

JingCheng-NOAA commented Oct 26, 2023 via email

ShunLiu-NOAA
ShunLiu-NOAA previously approved these changes Oct 26, 2023
@ShunLiu-NOAA
Copy link
Contributor

Thank you @RussTreadon-NOAA and @JingCheng-NOAA.
Jing, could you please run regression test with all modifications in your branch?
@hu5970 For this PR, is there anything more we still should test?

@RussTreadon-NOAA
Copy link
Contributor

@JingCheng-NOAA , we should git rm the following files

regression/global_3dvar.sh
regression/global_4dvar.sh

in hafs-community:feature/hafs_rtcases. global_4denvar is the only global gsi test executed by ctest.

@JingCheng-NOAA
Copy link
Contributor Author

JingCheng-NOAA commented Oct 26, 2023 via email

@RussTreadon-NOAA RussTreadon-NOAA self-requested a review October 26, 2023 16:16
Copy link
Contributor

@RussTreadon-NOAA RussTreadon-NOAA left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Rerun ctests on Hera, Orion, and Dogwood. All 7 tests pass on each machine.

Approve pending input from others.

  • @hu5970 , do you have any regression test changes you want to commit to hafs-community:feature/hafs_rtcases?
  • @JingCheng-NOAA , do you have additional changes you would like to make to your branch?

@JingCheng-NOAA
Copy link
Contributor Author

JingCheng-NOAA commented Oct 26, 2023 via email

@ShunLiu-NOAA ShunLiu-NOAA merged commit 2cb0f5b into NOAA-EMC:develop Oct 27, 2023
4 checks passed
@BinLiu-NOAA BinLiu-NOAA deleted the feature/hafs_rtcases branch December 12, 2023 02:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

update the regression test checks Add Regression Test For HAFSv1
5 participants