-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fixing cuda crashes #483
Fixing cuda crashes #483
Conversation
RecoLocalCalo/EcalRecProducers/plugins/AmplitudeComputationKernels.h
Outdated
Show resolved
Hide resolved
Both CPU fixes are in the main release. |
@mariadalfonso perfect! :) |
For the HLT menu to use
|
With the 3 points above you a correct results in 2018 data and the two floats energy-cpu and energy-gpu that should be identical. |
@vkhristenko I have a few fixes:
Could you apply them with curl -L https://github.com/cms-patatrack/cmssw/files/4774221/diff.txt | patch -p1 ? The diff is attached: diff . |
@fwyzard applied |
recHitsM0TokenOut_{produces<OProductType>("recHitsM0LabelOut")}, | ||
recHitsLegacyTokenOut_{produces<HBHERecHitCollection>("recHitsLegacyLabelOut")} {} | ||
recHitsM0TokenOut_{produces<OProductType>(ps.getParameter<std::string>("recHitsM0LabelOut"))}, | ||
recHitsLegacyTokenOut_{produces<HBHERecHitCollection>(ps.getParameter<std::string>("recHitsLegacyLabelOut"))} {} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
With these changes, I finally have the full HLT running 👍
Move ECAL and HCAL CUDA code to plugins. General cleanup: remove unused code, apply clang-format and various include changes. Fix product labels for HCAL rechits on CPU. Co-authored-by: Andrea Bocci <andrea.bocci@cern.ch>
Move ECAL and HCAL CUDA code to plugins. General cleanup: remove unused code, apply clang-format and various include changes. Fix product labels for HCAL rechits on CPU. Co-authored-by: Andrea Bocci <andrea.bocci@cern.ch>
Move ECAL and HCAL CUDA code to plugins. General cleanup: remove unused code, apply clang-format and various include changes. Fix product labels for HCAL rechits on CPU. Co-authored-by: Andrea Bocci <andrea.bocci@cern.ch>
Move ECAL and HCAL CUDA code to plugins. General cleanup: remove unused code, apply clang-format and various include changes. Fix product labels for HCAL rechits on CPU. Co-authored-by: Andrea Bocci <andrea.bocci@cern.ch>
Move ECAL and HCAL CUDA code to plugins. General cleanup: remove unused code, apply clang-format and various include changes. Fix product labels for HCAL rechits on CPU. Co-authored-by: Andrea Bocci <andrea.bocci@cern.ch>
Move ECAL and HCAL CUDA code to plugins. General cleanup: remove unused code, apply clang-format and various include changes. Fix product labels for HCAL rechits on CPU. Co-authored-by: Andrea Bocci <andrea.bocci@cern.ch>
Move ECAL and HCAL CUDA code to plugins. General cleanup: remove unused code, apply clang-format and various include changes. Fix product labels for HCAL rechits on CPU. Co-authored-by: Andrea Bocci <andrea.bocci@cern.ch>
Move ECAL and HCAL CUDA code to plugins. General cleanup: remove unused code, apply clang-format and various include changes. Fix product labels for HCAL rechits on CPU. Co-authored-by: Andrea Bocci <andrea.bocci@cern.ch>
Move ECAL and HCAL CUDA code to plugins. General cleanup: remove unused code, apply clang-format and various include changes. Fix product labels for HCAL rechits on CPU. Co-authored-by: Andrea Bocci <andrea.bocci@cern.ch>
Move ECAL and HCAL CUDA code to plugins. General cleanup: remove unused code, apply clang-format and various include changes. Fix product labels for HCAL rechits on CPU. Co-authored-by: Andrea Bocci <andrea.bocci@cern.ch>
Move ECAL and HCAL CUDA code to plugins. General cleanup: remove unused code, apply clang-format and various include changes. Fix product labels for HCAL rechits on CPU. Co-authored-by: Andrea Bocci <andrea.bocci@cern.ch>
Move ECAL and HCAL CUDA code to plugins. General cleanup: remove unused code, apply clang-format and various include changes. Fix product labels for HCAL rechits on CPU. Co-authored-by: Andrea Bocci <andrea.bocci@cern.ch>
Move ECAL and HCAL CUDA code to plugins. General cleanup: remove unused code, apply clang-format and various include changes. Fix product labels for HCAL rechits on CPU. Co-authored-by: Andrea Bocci <andrea.bocci@cern.ch>
Move ECAL and HCAL CUDA code to plugins. General cleanup: remove unused code, apply clang-format and various include changes. Fix product labels for HCAL rechits on CPU. Co-authored-by: Andrea Bocci <andrea.bocci@cern.ch>
PR description:
Couple of things:
EventFilter/EcalRawToDigi
depended onRecoLocalCalo/EcalRecAlgos
and Producers... directly and indirectly -> removedEventFilter/EcalRawToDigi
: All cuda kernels/device functions moved to be part of pluginsRecoLocalCalo/EcalRecAlgos
andProducers
: moved all cuda kernels/device functions to be part of pluginsRecoLocalCalo/HcalRecAlgos
andProducers
: moved all cuda kernels/device functions to be part of pluginsthe problem was due to device-side linking across packages when it was not needed. And symbols were present in .so that were not supposed to be there. e.g. running w/o current pr just ecal digi to raw + full hcal, you get the same issue. doing nm on the
pluginEventFilterEcalRawToDigi.so
will show symbols fromRecoLocalCalo/EcalRecAlgos
.Furthermore, I think the following situation will fail:
PR validation:
Using the exes provided as part of the release.
@mariadalfonso do not know if you remember...
1 thing to note for hcal: i do not know what was done for the cpu, cause whenever i used to validate cpu vs gpu, i used a different branch (w/ a couple of fixes applied). Note, it was documented as part of previous prs