FEAT: add support for question answering benchmark #94

dlmgary · 2024-03-12T13:21:57Z

Description

This PR:

Adds support for question answering benchmark for language models.
Implements a new QuestionAnsweringBenchmarkOrchestrator to evaluate different benchmarks.
Implements a new QuestionAnswerScorer to handle question answering logic post model inference.
Adds WMD dataset for bio, chem, and cyber from The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Implements a new QuestionAnsweringDataset model to store and distribute question answering datasets in the future.

Tests

no new tests required
new tests added
existing tests adjusted

Documentation

no documentation changes needed
documentation added or edited
example notebook added or updated

…mark and models associated with it.

pyrit/score/question_answering_score_engine.py

The Weapons of Mass Destruction Proxy (WMDP) benchmark is a publicly available dataset published Li, Nathaniel, et al. "The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning." arXiv preprint arXiv:2403.03218 (2024). The raw dataset is available at https://github.com/centerforaisafety/wmdp. The format of the dataset has been updated to be compliant with the `QuestionAnsweringDataset` PyRIT model so it can be used for Q&A evaluations. The content of the dataset has not been changed.

pyrit/models.py

…commit hooks

pyrit/orchestrator/benchmark_orchestrator.py

pyrit/models.py

pyrit/orchestrator/benchmark_orchestrator.py

- create a new `QuestionAnswerScorer` - move default system prompt to new YAML file - refactor `QuestionAnsweringBenchmarkOrchestrator` to accept scorer during initialization.

pyrit/orchestrator/benchmark_orchestrator.py

pyrit/models.py

pyrit/score/question_answer_scorer.py

doc/code/orchestrator.py

pyrit/orchestrator/benchmark_orchestrator.py

pyrit/score/scorer.py

pyrit/datasets/question_answering_dataset/README.md

dlmgary added 2 commits March 12, 2024 09:20

feat: add support for proof-of-concept question answering score bench…

26bb481

…mark and models associated with it.

refactor: fix comments

ffc2b89

dlmgary commented Mar 12, 2024

View reviewed changes

pyrit/score/question_answering_score_engine.py Outdated Show resolved Hide resolved

dlmgary requested review from romanlutz, rdheekonda, rlundeen2 and nina-msft March 12, 2024 19:17

nina-msft reviewed Mar 12, 2024

View reviewed changes

pyrit/models.py Outdated Show resolved Hide resolved

dlmgary added 7 commits March 14, 2024 14:10

Merge branch 'main' into dlmgary_benchmarking

d63b3ee

feat: add benchmark orchestrator

89516e8

refactor: fix JSON spacing.

263c5ed

refactor: fix imports

5891c53

Merge branch 'main' into dlmgary_benchmarking

688f614

refactor: remove unused import

e5f7f3f

refactor: remove magic line in Jupyter notebook that is breaking pre-…

06c3e8f

…commit hooks

dlmgary requested a review from nina-msft March 18, 2024 22:48