Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

integrate hep_tfds, September 2022 benchmark training #136

Merged
merged 47 commits into from
Sep 2, 2022

Conversation

jpata
Copy link
Owner

@jpata jpata commented Aug 31, 2022

@jpata jpata added the CMS Concerns the CMS MLPF model label Aug 31, 2022
@jpata jpata changed the title September 2022 benchmark training, integrate hep_tfds integrate hep_tfds, September 2022 benchmark training Aug 31, 2022
@jpata jpata merged commit fb89d79 into main Sep 2, 2022
@jpata jpata deleted the sep22_benchmarktraining branch December 22, 2022 14:09
jpata added a commit that referenced this pull request Sep 15, 2023
* Initial commit

* add template dataset definitions

* Add initial CMS particle-flow dataset implementation

Also changed to a new tensorflow dataset template

* add test scripts

* Run black formatting on python files

* Add instructions to cms_pf, use manual_dir for preprocessing

* fix: ability to choose data directory for the tfrecords files

* feat: Add Delphes dataset

* fix: support loading both .pkl.bz2 and .pkl

* fix: remove extra dimension in cms_pf data items

* fix cms

* fixes for delphes

* ensure dir exists

* separate cms datasets

* clarify manual dir

* cleanup print

* added singleele and singlemu

* update 1.1

* cleanup cms datasets

* update datamodel

* added new datasets

* gen/sim 12_3_0_pre6 generation (#1)

* 1.2 format, ztt dataset

* version 1.3.0 with new gensim truth

* new dataset

* add qcd

* add some asserts

* add new features

* keep PS

* add tau as pf target

* 1.3.1 remove ps and brem (#2)

* fix HF labeling (#3)

* add new high-PU QCD dataset, update energy

* up

* fix

* Add gen jet index (#4)

* first attempt at gen jet clustering

* add other reqs

* revert test

* fix mapping to before masking particles

* fix out of index bufg

* benchmark training for CMS

* move path

* move path

* remove submodule

* remove

* move

* fix import

* format

* format

* remove some dummy files

* up

* try with masking

* use a different dataset for logging the jet/met distributions

* clean

* added clic ttbar

Co-authored-by: Eric Wulff <eric.g.t.wulff@gmail.com>
Co-authored-by: Eric Wulff <eric.wulff@cern.ch>
Co-authored-by: Javier Duarte <jduarte@ucsd.edu>
Former-commit-id: fb89d79
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CMS Concerns the CMS MLPF model
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants