-
Notifications
You must be signed in to change notification settings - Fork 4.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Number of wiregroups is incorrectly overwritten in CMSSW_11_1_X_2020-07-25-1100 #30921
Comments
A new Issue was created by @dildick Sven Dildick. @Dr15Jones, @dpiparo, @silviodonato, @smuzaffar, @makortel, @qliphy can you please review it and eventually sign/assign? Thanks. cms-bot commands are listed here |
assign geometry |
New categories assigned: geometry @Dr15Jones,@cvuosalo,@mdhildreth,@makortel,@ianna,@civanch you have been requested to review this Pull request/Issue and eventually sign? Thanks |
@slomeo Could you please look at this bug? Or maybe mention it to your Muon colleagues? |
Deeper investigation reveals it's not the Rather the |
Weird. The number of wiregroups also gets corrupted in CMSSW_11_1_X_2020-07-27-1100 on |
@dildick If there is incorrect overwriting of memory, it can have seemingly random and unpredictable effects. Trying to find the instance of overwriting can be very difficult because the bad effect may be far separated from the cause. |
The best tool to look for memory overwrite is ASAN. My suggestion is to find a recent ASAN IB and run the job using that release. |
@dildick Can you continue to debug this problem? You know most about it. It is looking less like a geometry issue. |
@cvuosalo I found a simple way around the problem, namely remove this line: https://github.com/cms-sw/cmssw/blob/master/L1Trigger/CSCTriggerPrimitives/src/CSCAnodeLCTProcessor.cc#L223. Basically, call the geometry functions for every ALCT processor, for every event. Now, the CLCT processor code has a similar block, https://github.com/cms-sw/cmssw/blob/master/L1Trigger/CSCTriggerPrimitives/src/CSCCathodeLCTProcessor.cc#L223. Should be I concerned that the number of strips may be incorrectly overwritten? I don't know. |
assign l1 |
New categories assigned: l1 @benkrikler,@rekovic you have been requested to review this Pull request/Issue and eventually sign? Thanks |
I think this issue also involves the L1 code. |
@dildick You could change that
to
This way you avoid unnecessary calls for every event but prevent the bad values from being used. |
@Dr15Jones @smuzaffar Is there a recent 11_1_X ASAN build I can test? |
I don't think we have ASAN builds in 11_1_X, only in 11_2_X. |
Ok. I squashed the 11_2_X version in a single commit (dildick@1eefd3e); cherry-picked it into the [1]
|
@dildick I've found I can't use the FNAL machines for this because of their restriction on virtual memory. Try using a CERN development machine. |
+1 |
@slomeo It looks like this issue is resolved, so nothing more needs to be done. |
I tested the CSC trigger emulator for #30914 (CMSSW_11_1_X_2020-07-25-1100) on a 11_1_0_pre8 relval sample ( root://cmsxrootd-site.fnal.gov//store/relval/CMSSW_11_1_0_pre8/RelValZMM_14/GEN-SIM-DIGI-RAW/111X_mcRun3_2021_realistic_v4-v1/20000/E1B44039-321A-284E-91FB-97CD10F43A48.root). I noticed that the preTrigger() function was crashing in the CSCAnodeLCTProcessor, because of an invalid number of wiregroups. I added a few more print-outs and it seems that the line
numWireGroups = cscChamber_->layer(1)->geometry()->numberOfWireGroups()
(https://github.com/cms-sw/cmssw/blob/CMSSW_11_1_X/L1Trigger/CSCTriggerPrimitives/src/CSCAnodeLCTProcessor.cc#L227) returns the nominal wiregroup values, except for chamber ME-1/1/11. numberOfWireGroups returned 196609 in Run 1, Event 800, LumiSection 8. Perhaps something in the event setup was corrupted?This results in an error
The text was updated successfully, but these errors were encountered: