Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Wrapper PR for #479, #508, #507: zorl interstitial, ocn -> wat, merra2 threading #520

Merged

Conversation

climbfuji
Copy link
Collaborator

@climbfuji climbfuji commented Apr 14, 2021

PR Checklist

  • Ths PR is up-to-date with the top of all sub-component repositories except for those sub-components which are the subject of this PR. Please consult the ufs-weather-model wiki if you are unsure how to do this.

  • This PR has been tested using a branch which is up-to-date with the top of all sub-component repositories except for those sub-components which are the subject of this PR

  • An Issue describing the work contained in this PR has been created either in the subcomponent(s) or in the ufs-weather-model. The Issue should be created in the repository that is most relevant to the changes in contained in the PR. The Issue and the dependent sub-component PR
    are specified below.

  • If new or updated input data is required by this PR, it is clearly stated in the text of the PR.

Description

This PR combines the changes in #479, #508, #507. For details about each of these PRs, please see there. No new input data is required, changes are expected only for coupled model runs that use the wave component. New baseline date tag is 20210414. See below in section testing for more information.

Issue(s) addressed

Fixes NCAR/ccpp-physics#552
Fixes NCAR/ccpp-physics#455
Fixes NCAR/ccpp-physics#608

Testing

Regression tests for each of the PRs listed above were run separately, see information on those PRs.

For the combined PRs, regression tests were run against the existing baseline on Hera using GNU and Intel.

For GNU, all tests passed:

rt_hera_gnu_against_existing.log

For Intel, all tests passed except the following ones:

fv3_gfdlmprad
fv3_gfdlmprad_atmwav
fv3_wrtGauss_nemsio_c768
cpld_bmark_wave
cpld_bmarkfrac_wave
cpld_bmarkfrac_wave_v16
cpld_control_wave

For each of those tests, all files were identical, including all restart files, except RESTART/sfc_data.tile?.nc. The difference for these files is the name of the wave model zorl variable, which has changed from zorlw to zorlwav. This has been accepted by the physics developers and the physics chief, see #508.

rt_hera_intel_against_existing.log
rt_hera_intel_against_existing_fail_test.log
rt_hera_intel_against_existing_fail_test_details.log

Note. In order to retain b4b reproducibility with the existing code for all other tests and files, the surface roughness length variables zorl_{land,ice,water} need to be reset to huge in PR NCAR/ccpp-physics#596. See the description of this PR for more information. An issue has been created (NCAR/ccpp-physics#624) to follow-up on whether this resetting to huge should be done or not. Given the size of this combined PR, it is better to not change the answers this time.

Regression tests for the combined PRs will be run on all tier-1 platforms listed below.

  • hera.intel
  • hera.gnu
  • orion.intel
  • cheyenne.intel
  • cheyenne.gnu
  • gaea.intel
  • jet.intel
  • wcoss_cray
  • wcoss_dell_p3

Dependencies

NCAR/ccpp-physics#623
NOAA-EMC/fv3atm#279
#520

AnningCheng-NOAA and others added 19 commits March 5, 2021 15:18
* upstream/develop:
  update MOM6 to GFDL 20210224 main branch commit (ufs-community#439)
  Add GNU and Cheyenne Support to Automated RT (ufs-community#444)
  Move Noah MP init to CCPP and update Noah MP regression tests, ice flux init bug fix in CCPP (ufs-community#425)
  Feature/rt automation (ufs-community#403)
  Update ccpp-physics. Make RRTMGP thread safe (ufs-community#418)
  Update regression tests from GFSv15+Thompson to GFSv16+Thompson, include "Add one regional regression test in DEBUG mode. (ufs-community#419)" (ufs-community#421)
  UGWP v0 v1 combined (ufs-community#396)
  add optional mesh in MOM6; add dz_min and min_seaice as configurable variables for coupled model (ufs-community#399)
  updates FMS to 2020.04.01 (ufs-community#392)
  Move LSM vegetation lookup tables into CCPP, clean up RUC snow cover on ice initialization (remove IPD step 2)  (ufs-community#407)
  Update CMEPS for HAFS integration; add datm and coupled-model tests on Gaea (ufs-community#401)
  Remove legacy gnumake build from fv3atm and NEMS, remove legacy Python 2.7 support, rename v16beta to v16 and RT updates (ufs-community#384)
  MOM6 bugfixes, GFDL update, update CDMBGWD settings; fix for restart reproducibility (without waves) when USE_LA_LI2016=True, sign error on fprec passed to ocean, GFDL update, resolution dependent cdmbgwd settings (ufs-community#379)
  dycore options to add zero-gradient BC to reconstruct interface u/v and change dz_min as input (ufs-community#369)
  Update develop from NOAA-GSL: RUC ice, MYNN sfclay, stochastic land perturbations (ufs-community#386)
  update cpl gfsv16 tests, rrtmgp fix and bug fixes in cmeps (ufs-community#378)
  point fv3 to EMC develop branch (ufs-community#377)
  Remove IPD steps 3 and 5 (ufs-community#357)
  Update CMEPS  (ufs-community#345)
  Implementation of CCPP timestep_init and timestep_final phases (ufs-community#337)
  Remove unnecessary SIMD instruction sets for Jet, first round of cleanup in rt.conf, initialize cld_amt to zero for regional runs (dycore) (ufs-community#353)
  add frac grid input, update and add additional cpld tests (ufs-community#354)
  Add checkpoint restarts for ufs-cpld (ufs-community#342)
  Update the format of rt.conf (ufs-community#349)
  Remove IPD (step 1) (ufs-community#331)
  Feature/ww3update (ufs-community#334)
  Replace old regional SDF with FV3_GFS_v15_thompson_mynn (ufs-community#333)
  Update modules with hpc-stack v1.1.0 (ufs-community#319)
  Regression test log for PR ufs-community#323 for jet.intel (ufs-community#336)
  RRTMGP and Thompson MP coupling (ufs-community#323)
  Add 2 new tests for DATM-MOM6-CICE6 application (ufs-community#332)
  Add optional bulk flux calculation in ufs-datm (ufs-community#266)
  Final-final GFS v16 updates / restart reproducibility bugfixes (ufs-community#325)
  Updates to build for JEDI linking/control, add wcoss2 (ufs-community#295)
  Update CICE, Move regression test input outside baseline directory (ufs-community#270)
  Feature/update mom6 and retain b4b results for 025x025 resolution (ufs-community#290)
  Update for Jet, bug fixes in running with frac_grid=T and GFDL MP, and in restarting with frac_grid=T  (ufs-community#304)
  Updates to stochastic_physics_wrapper (ufs-community#280)
  Update develop from gsd/develop 2020/11/20: Unified gravity wave drag, updates to other GSL physics (ufs-community#297)
  Fix to allow quilting with non-factors for layout (ufs-community#250)
  rt update (ufs-community#261)
* upstream/develop:
  Updatetemplate; Fix model slowness when using threading; Update WW3 for porting and threading issues (ufs-community#383)
  update MOM6 to GFDL 20210308 main branch commit (ufs-community#458)
  Regional inlinepost (ufs-community#364)
  correct benchmark diag_tables for coupled model configurations; move bm_ic directory out of inputdata directory; use aws ec2 for CI test; auto-rt fixes (ufs-community#426)
@BrianCurtis-NOAA
Copy link
Collaborator

Machine: jet
Compiler: intel
Job: BL
Repo location: /lfs4/HFIP/h-nems/emc.nemspara/autort/pr/615506885/20210414213010/ufs-weather-model
Please manually delete: /lfs4/HFIP/h-nems/emc.nemspara/RT_RUNDIRS/emc.nemspara/FV3_RT/rt_99770
Test fv3_thompson_no_aero 033 failed in run_test failed
Test fv3_gsd 031 failed in run_test failed
Please make changes and add the following label back:
jet-intel-BL

@climbfuji
Copy link
Collaborator Author

climbfuji commented Apr 15, 2021

Notes on regression testing (to be updated):

  • auto-bl/rt passed for: hera.intel, hera.gnu, gaea.intel, orion.intel
  • auto-bl/rt failed for: jet.intel - reason: the usual slurm issues on jet, the two failing tests had the following error:
sbatch: error: Batch job submission failed: Socket timed out on send/recv operation
  • manual tests passed for: wcoss_cray, wcoss_dell_p3 (thanks @DeniseWorthen), cheyenne.gnu, cheyenne.intel, jet.intel
  • ci tests passed for commit d34e398

@DeniseWorthen
Copy link
Collaborator

WCOSS logs are posted: /scratch2/NCEPDEV/stmp1/Denise.Worthen/PR520

@climbfuji climbfuji added the Ready for Commit Queue The PR is ready for the Commit Queue. All checkboxes in PR template have been checked. label Apr 15, 2021
@climbfuji
Copy link
Collaborator Author

@junwang-noaa @DusanJovic-NOAA ready for merging.

@junwang-noaa junwang-noaa merged commit 65f5341 into ufs-community:develop Apr 15, 2021
@junwang-noaa
Copy link
Collaborator

junwang-noaa commented Apr 15, 2021 via email

@climbfuji
Copy link
Collaborator Author

Merged

Thank you, that was a big set of changes.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Baseline Updates Current baselines will be updated. Ready for Commit Queue The PR is ready for the Commit Queue. All checkboxes in PR template have been checked. Waiting for Reviews The PR is waiting for reviews from associated component PR's.
Projects
None yet
5 participants