Add a fingerprint for each EvaluationModule #206

mathemakitten · 2022-07-26T19:02:19Z

In order to support #126 we need to fingerprint whichever EvaluationModule (metric, measurement, etc) we're using for later reproducibility.

This extracts the already-computed hash from each EvaluationModule and makes it easy to access in the evaluation_cls via module._fingerprint, similar to how in datasets you can do ds._fingerprint.

Test via

module = evaluate.load("lvwerra/element_count", module_type="measurement")
print(f"Module fingerprint: {module._fingerprint}")

HuggingFaceDocBuilderDev · 2022-07-26T19:06:44Z

The documentation is not available anymore as the PR was closed or merged.

lvwerra

LGTM! Also interested to hear what @lhoestq thinks. We plan to use this to cache evaluator computations.

Please wait with merging - want to do a minor release first.

lhoestq

In datasets we use fingerprint to identify data, while we use hash to identify dataset scripts. For example a DatasetBuilder has a hash that identifies the code of the dataset script it is going to run.

Not a strong opinion but I think you can also name it hash for consistency

lvwerra · 2022-07-28T13:15:55Z

Sounds good to me - also have no strong opinion :)

mathemakitten · 2022-07-28T21:59:41Z

Renamed, thanks for the clarification on hash vs. fingerprint!

* Add fingerprint for Hub modules * Rename evaluation module fingerprint to _hash * fix typo Co-authored-by: helen <helen@huggingface.co>

Add fingerprint for Hub modules

d58689d

lvwerra requested a review from lhoestq July 28, 2022 09:31

lvwerra approved these changes Jul 28, 2022

View reviewed changes

lhoestq reviewed Jul 28, 2022

View reviewed changes

helen added 2 commits July 28, 2022 14:55

Merge branch 'main' into hn-fingerprint-evalmodule

98d3844

Rename evaluation module fingerprint to _hash

00999b8

fix typo

28c6bea

lvwerra merged commit 9a10e58 into main Jul 29, 2022

lvwerra deleted the hn-fingerprint-evalmodule branch July 29, 2022 09:30

mathemakitten added a commit that referenced this pull request Aug 3, 2022

Add a fingerprint for each EvaluationModule (#206)

c2a8c43

* Add fingerprint for Hub modules * Rename evaluation module fingerprint to _hash * fix typo Co-authored-by: helen <helen@huggingface.co>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a fingerprint for each EvaluationModule #206

Add a fingerprint for each EvaluationModule #206

mathemakitten commented Jul 26, 2022

HuggingFaceDocBuilderDev commented Jul 26, 2022 •

edited

Loading

lvwerra left a comment •

edited

Loading

lhoestq left a comment

lvwerra commented Jul 28, 2022

mathemakitten commented Jul 28, 2022

Add a fingerprint for each EvaluationModule #206

Add a fingerprint for each EvaluationModule #206

Conversation

mathemakitten commented Jul 26, 2022

HuggingFaceDocBuilderDev commented Jul 26, 2022 • edited Loading

lvwerra left a comment • edited Loading

Choose a reason for hiding this comment

lhoestq left a comment

Choose a reason for hiding this comment

lvwerra commented Jul 28, 2022

mathemakitten commented Jul 28, 2022

HuggingFaceDocBuilderDev commented Jul 26, 2022 •

edited

Loading

lvwerra left a comment •

edited

Loading