Make sure that the ROOT writers enforce consistency for the Frame contents they write #513

tmadlener · 2023-11-08T15:14:20Z

BEGINRELEASENOTES

Introduce checks in ROOTFrameWriter::writeFrame and ROOTNTupleWriter::writeFrame that ensure consistent contents for all Frames of a given category. If inconsistent contents are found an exception is thrown. Before these changes this might lead to a crash or to unreadable files. Fixes ROOTFrameWriter can produce unreadable files without warning #382
Refactor ROOTNTupleWriter internals to have only one map that keeps track of categories instead of two maps and a set that need to be kept consistent.

ENDRELEASENOTES

tmadlener · 2023-11-09T08:55:06Z

@andresailer @Zehvogel any thoughts about the contents of the exception messages? In principle the only exception that we can trigger from the k4FWCore PodioOutput is the one where a collection is not available. The reason is that there we are guaranteed to have a consistent set of collsToWrite as that is dictated by the framework.

src/rootUtils.h

tests/unittests/unittest.cpp

src/ROOTNTupleWriter.cc

Co-authored-by: Andre Sailer <andre.philippe.sailer@cern.ch>

jmcarcell · 2023-11-13T08:19:37Z

src/rootUtils.h

+  // Since we are guaranteed to have unique names here, we can just look for
+  // collisions brute force, which seems to be quickest approach for vector
+  // sizes we typically have here (few hundred)
+  for (const auto& id : candidateColls) {
+    if (std::find(existingColls.begin(), existingColls.end(), id) == existingColls.end()) {
+      return false;
+    }
+  }


Suggested change

// Since we are guaranteed to have unique names here, we can just look for

// collisions brute force, which seems to be quickest approach for vector

// sizes we typically have here (few hundred)

for (const auto& id : candidateColls) {

if (std::find(existingColls.begin(), existingColls.end(), id) == existingColls.end()) {

return false;

}

}

// Since we are guaranteed to have unique names here, we can just look for

// collisions brute force, which seems to be quickest approach for vector

// sizes we typically have here (few hundred)

for (const auto& id : candidateColls) {

if (!std::binary_search(existingColls.begin(), existingColls.end(), id, [](const auto& lhs, const auto& rhs) { std::lexicographical_compare(

lhs.begin(), lhs.end(), rhs.begin(), rhs.end(),

[](const auto& cl, const auto& cr) { return std::tolower(cl) < std::tolower(cr); }))) {

return false;

}

}

We can use that existingColls is ordered, and maybe make a helper function for the comparison lambda, looks like it should be faster.

Edit: Actually they are not sorted for the rntuple so this wouldn't work then without sorting them first

I added the binary_search version to benchmarks¹ I did before to see if a more involved approach works better for us. It looks like a binary_search approach without lower-casing things is the best (depending on the number of collections). On the other hand, in most cases this check will probably exit simply because the two vectors have unequal size.

Results with one vector orderd as we do in the ROOTFrameWriter (and the second one unordered). The numbers are the sizes of the vectors. CompareLinear is the current implementation, CompareBinary and CompareBinaryLowerCase use binary_search. The former one uses binary_search(existingColls.begin(), existingColls.end(), id), the second one the proposal from above.

--------------------------------------------------------------------- Benchmark Time CPU Iterations --------------------------------------------------------------------- CompareLinear/2 11.8 ns 11.8 ns 57782514 CompareLinear/8 50.6 ns 50.5 ns 10000000 CompareLinear/64 1153 ns 1151 ns 603770 CompareLinear/512 61044 ns 60996 ns 11362 CompareBinary/2 27.8 ns 27.8 ns 24459705 CompareBinary/8 142 ns 141 ns 4901529 CompareBinary/64 525 ns 525 ns 1375137 CompareBinary/512 706 ns 705 ns 983974 CompareBinaryLowerCase/2 251 ns 251 ns 2779374 CompareBinaryLowerCase/8 1646 ns 1644 ns 422281 CompareBinaryLowerCase/64 19288 ns 19274 ns 36780 CompareBinaryLowerCase/512 173434 ns 173318 ns 4026

Footnotes

Basically what the benchmark does is to select N collection names randomly from all the collection names that we collected during Make CollectionIDs a 32bit hash value of the collection name #412. This is done twice, in order to have vectors to compare. In this way the cheap check for the sizes is effectively always skipped and depending on N the actual loop is executed at least a few times. The biggest N=512 in this case represents the worst case for the runtime as it will do the full N^2 of work. This is not entirely our use case, but it should represent sort of the worst case scenario. ↩

After some further checking it turns out that binary_search without taking into account that existingColls are lexicographically sorted case insensitve doesn't work because the pre-conditions of binary_search are broken. Hence, it looks like std::find and linear searching is still the way to go.

Just for documentation of our short internal discussion. Making the binary_search approach exit early in case the strings are of unequal sizes, speeds that up quite a bit and it becomes comparable in speed at 64 elements already and scales much better:

----------------------------------------------------------------------------- Benchmark Time CPU Iterations ----------------------------------------------------------------------------- CompareLinear/2 11.3 ns 11.3 ns 58053921 CompareLinear/8 48.3 ns 48.3 ns 14333123 CompareLinear/64 1130 ns 1130 ns 618685 CompareLinear/512 60220 ns 60216 ns 11686 CompareBinaryLowerCaseShortCut/2 150 ns 150 ns 4636032 CompareBinaryLowerCaseShortCut/8 498 ns 498 ns 1373998 CompareBinaryLowerCaseShortCut/64 1159 ns 1159 ns 599214 CompareBinaryLowerCaseShortCut/512 6149 ns 6149 ns 111620

Given that our happy path will always have to go through the complete vector to see whether the contents are the same, I have switched to this approach and also made sure that the collection names are sorted accordingly for the RNTuple writer.

src/rootUtils.h

tmadlener added 2 commits November 8, 2023 15:37

Only record the datamodel definition once

fa7e96f

Make sure the contents of Frames in one category are consistent

8fc5d81

tmadlener force-pushed the consistent-events branch from b3ecd30 to 8fc5d81 Compare November 8, 2023 15:22

tmadlener mentioned this pull request Nov 8, 2023

Update ROOTFrameWriter.cc #494

Closed

tmadlener added 5 commits November 8, 2023 16:37

Ignore performance concerns on the path to an exception

9ecce26

Refactor RNTupleWriter internals in preparation

ed99889

Minimize number of internal maps in RNTuple writer

d470afb

Finish refactoring to make consistent Frame writing possible

e70375b

Enforce consistency also for RNTuple writer

8c7f0ee

tmadlener changed the title ~~Make sure that the ROOTFrameWriter writes consistent frames in each category~~ Make sure that the ROOT writers enforce consistency for the Frame contents they write Nov 8, 2023

tmadlener added 2 commits November 9, 2023 09:25

Add RNTuple writer unittests

d0a87b5

Correct docstring to match implementation

38233df

tmadlener force-pushed the consistent-events branch from 9321d90 to c82ca5f Compare November 9, 2023 08:32

Split test cases into two to tag them differently

9dbd135

tmadlener force-pushed the consistent-events branch from c82ca5f to 9dbd135 Compare November 9, 2023 08:47

tmadlener mentioned this pull request Nov 9, 2023

Make PodioOutput exit gracefully instead of crashing key4hep/k4FWCore#159

Merged

1 task

andresailer reviewed Nov 9, 2023

View reviewed changes

src/rootUtils.h Show resolved Hide resolved

andresailer reviewed Nov 9, 2023

View reviewed changes

tests/unittests/unittest.cpp Show resolved Hide resolved

tmadlener added 3 commits November 9, 2023 10:36

Always define PODIO_ENABLE_RNTUPLE with appropriate value

7719acf

Include more information in exception message

1d2766f

Add possibility to check consistency of frames before writing

2342300

andresailer reviewed Nov 9, 2023

View reviewed changes

src/ROOTNTupleWriter.cc Outdated Show resolved Hide resolved

tmadlener and others added 3 commits November 9, 2023 13:15

Add more labels to exclude tests from running in sanitizer CI

8be26e1

Make exception messages consistent

5d85991

Co-authored-by: Andre Sailer <andre.philippe.sailer@cern.ch>

Use correct label for excluding Thread sanitizer

af405fe

jmcarcell reviewed Nov 13, 2023

View reviewed changes

Make exception message more succinct

3cbcd14

tmadlener mentioned this pull request Nov 13, 2023

[WIP] Simplify python util unittests key4hep/EDM4hep#238

Closed

1 task

Remove unnecessary header

6abff1d

Switch to improved binary_search

c480c02

tmadlener merged commit 8ebec76 into AIDASoft:master Nov 14, 2023
17 checks passed

tmadlener deleted the consistent-events branch November 14, 2023 11:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make sure that the ROOT writers enforce consistency for the Frame contents they write #513

Make sure that the ROOT writers enforce consistency for the Frame contents they write #513

tmadlener commented Nov 8, 2023 •

edited

Loading

tmadlener commented Nov 9, 2023

jmcarcell Nov 13, 2023 •

edited

Loading

tmadlener Nov 13, 2023

tmadlener Nov 13, 2023

tmadlener Nov 14, 2023

Make sure that the ROOT writers enforce consistency for the Frame contents they write #513

Make sure that the ROOT writers enforce consistency for the Frame contents they write #513

Conversation

tmadlener commented Nov 8, 2023 • edited Loading

tmadlener commented Nov 9, 2023

jmcarcell Nov 13, 2023 • edited Loading

Choose a reason for hiding this comment

tmadlener Nov 13, 2023

Choose a reason for hiding this comment

Footnotes

tmadlener Nov 13, 2023

Choose a reason for hiding this comment

tmadlener Nov 14, 2023

Choose a reason for hiding this comment

tmadlener commented Nov 8, 2023 •

edited

Loading

jmcarcell Nov 13, 2023 •

edited

Loading