Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

New YAKL 2022-02-18 #4804

Merged
merged 6 commits into from
Mar 9, 2022
Merged

New YAKL 2022-02-18 #4804

merged 6 commits into from
Mar 9, 2022

Conversation

mrnorman
Copy link
Contributor

@mrnorman mrnorman commented Feb 18, 2022

Upgrading to the latest YAKL. Includes performance improvements for CUDA and HIP as well as a bug (memory leak) fix for the pool allocator. The new YAKL also has a fully functioning SYCL backend for Intel GPUs.

CUDA has what appears to be a compiler bug in the FFT routines. These have been worked around in the new YAKL. The FFT API changed by no longer requiring a parameter to init(). This has been reflected in pressure.cpp and crm_variance_transport.cpp

An issue with the standalone samxx has been fixed, and new machine files have been added.

This PR passes ./create_test SMS_Ld10_P6x7.ne4pg2_ne4pg2.F-MMFXX.summit_gnugpu.eam-rrtmgpxx

[BFB]

Copy link
Contributor

@whannah1 whannah1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks good to me.

@rljacob rljacob added the MMF label Feb 24, 2022
@brhillman
Copy link
Contributor

@mrnorman this one fails:

ERS_Ln9.ne4pg2_ne4pg2.F-MMFXX.mappy_gnu.eam-mmf_use_ESMT

@brhillman brhillman added the BFB PR leaves answers BFB label Mar 3, 2022
brhillman added a commit that referenced this pull request Mar 3, 2022
Upgrading to the latest YAKL. Includes performance improvements for CUDA and HIP as well as a bug (memory leak) fix for the pool allocator. The new YAKL also has a fully functioning SYCL backend for Intel GPUs.

CUDA has what appears to be a compiler bug in the FFT routines. These have been worked around in the new YAKL. The FFT API changed by no longer requiring a parameter to init(). This has been reflected in pressure.cpp and crm_variance_transport.cpp

An issue with the standalone samxx has been fixed, and new machine files have been added.

This PR passes ./create_test SMS_Ld10_P6x7.ne4pg2_ne4pg2.F-MMFXX.summit_gnugpu.eam-rrtmgpxx

[BFB]
@brhillman
Copy link
Contributor

Merged to next.

@amametjanov
Copy link
Member

There are some IBM compiler build (link) errors on next that appear to be coming from this PR:

../atm/samxx/libsamxx.a(scalar_momentum.cpp.o):(.toc+0x70): undefined reference to `yakl::c::Bounds<1, true>::strides'
../atm/samxx/libsamxx.a(shear_prod2D.cpp.o):(.toc+0x20): undefined reference to `yakl::c::Bounds<3, true>::lbounds'
../atm/samxx/libsamxx.a(advect2_mom_xy.cpp.o):(.toc+0x10): undefined reference to `yakl::c::Bounds<3, true>::strides'
...

https://my.cdash.org/viewTest.php?onlydelta&buildid=2136782

Do these (8) tests work on Summit?

@mrnorman
Copy link
Contributor Author

mrnorman commented Mar 4, 2022

I'll look at these on summit when I get the chance.

…nd PGI compilers, but more testing is needed before merging to next again
brhillman added a commit that referenced this pull request Mar 8, 2022
Upgrading to the latest YAKL. Includes performance improvements for CUDA and HIP as well as a bug (memory leak) fix for the pool allocator. The new YAKL also has a fully functioning SYCL backend for Intel GPUs.

CUDA has what appears to be a compiler bug in the FFT routines. These have been worked around in the new YAKL. The FFT API changed by no longer requiring a parameter to init(). This has been reflected in pressure.cpp and crm_variance_transport.cpp

An issue with the standalone samxx has been fixed, and new machine files have been added.

This PR passes ./create_test SMS_Ld10_P6x7.ne4pg2_ne4pg2.F-MMFXX.summit_gnugpu.eam-rrtmgpxx

[BFB]
@brhillman brhillman merged commit 2c69bef into master Mar 9, 2022
@brhillman
Copy link
Contributor

Merged to master.

@brhillman brhillman deleted the mrnorman/new-yakl-2022-02-18 branch March 9, 2022 21:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
BFB PR leaves answers BFB MMF
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants