[Feature Branch][LLM Testing] Full Testing Harness for LLMs #1216

dbogunowicz · 2023-08-29T14:05:01Z

The implementation of the test harness for LLMs. By default, the tests are turned off so that we do not choke GHA.
To enable tests: remove @pytest.mark.skip(reason="Those tests are too heavy to run as a normal part of the CI.")
@pytest.mark.skip(reason="Those tests are too heavy to run as a normal part of the CI.")
and run pytest tests/deepsparse/transformers/pipelines/test_text_generation.py
Future consideration: adding config and utilizing small toy models to make tests extremely lightweight.

Includes PRs:

… main

* initial commit * finish creation of helper objects * Update tests/conftest.py * small refactor * [Feature Branch][LLM Testing] LLM Testing Suite (#1227) * Update README.md * Update src/deepsparse/yolov8/README.md * Update text_generation.py * quality * readability * all tests passing * added some full kv cache tests * initial commit * ready for review * Delete tests/deepsparse/transformers/pipelines/proposal_text_generation_tests.md

tests/deepsparse/transformers/pipelines/test_text_generation.py

tests/deepsparse/transformers/pipelines/helpers.py

tests/deepsparse/transformers/pipelines/test_text_generation.py

dsikka

How confident are we in our test coverage? Possibly add tests when running with deterministic off or multiple input sequences?

…ithub.com/neuralmagic/deepsparse into feature/damian/llm_testing_feature_branch

tests/deepsparse/transformers/pipelines/helpers.py

dsikka

Remove ORT ground truth class and use deepsparse pipeline instead

…sting_feature_branch

bfineran

LGTM overall. as discussed offline - will need some refactors to move cleanly to a config based method

dsikka

LGTM. Support for getting this to run on a nightly basis is still pending?

dbogunowicz and others added 8 commits August 28, 2023 08:54

initial commit

b7133a0

Merge branch 'main' of https://github.com/neuralmagic/deepsparse into…

6c9ab1d

… main

initial commit

a47f977

Merge branch 'main' into feature/damian/llm_testing_feature_branch

90382bd

Merge branch 'main' into feature/damian/llm_testing_feature_branch

37116cb

Merge branch 'main' into feature/damian/llm_testing_feature_branch

87fe78b

Merge branch 'main' into feature/damian/llm_testing_feature_branch

afb2a2a

dbogunowicz mentioned this pull request Sep 7, 2023

[TextGeneration] Samling arguments for generation #1225

Merged

dbogunowicz requested review from bfineran, dsikka and Satrat September 7, 2023 14:01

dbogunowicz and others added 3 commits September 8, 2023 12:40

Merge branch 'main' into feature/damian/llm_testing_feature_branch

a13d7a8

fix tests

92163b9

Merge branch 'main' into feature/damian/llm_testing_feature_branch

567337f

dsikka reviewed Sep 8, 2023

View reviewed changes

tests/deepsparse/transformers/pipelines/test_text_generation.py Outdated Show resolved Hide resolved

dsikka reviewed Sep 8, 2023

View reviewed changes

tests/deepsparse/transformers/pipelines/test_text_generation.py Show resolved Hide resolved

dsikka reviewed Sep 8, 2023

View reviewed changes

dbogunowicz added 2 commits September 11, 2023 11:58

Dipika's comments plus adjusting the script to renamed variables

bfe1b62

Merge branch 'feature/damian/llm_testing_feature_branch' of https://g…

27b45e6

…ithub.com/neuralmagic/deepsparse into feature/damian/llm_testing_feature_branch

bfineran reviewed Sep 11, 2023

View reviewed changes

tests/deepsparse/transformers/pipelines/helpers.py Outdated Show resolved Hide resolved

dsikka requested changes Sep 11, 2023

View reviewed changes

remove ORT ground truth

f2693ff

dbogunowicz requested review from bfineran and dsikka September 12, 2023 06:03

dbogunowicz and others added 4 commits September 12, 2023 09:32

add OPT tests

dd270a2

Merge branch 'main' into feature/damian/llm_testing_feature_branch

c22f5f2

Merge remote-tracking branch 'origin/main' into feature/damian/llm_te…

a975402

…sting_feature_branch

rebase and disable tests in GHA

646d6f5

dbogunowicz and others added 2 commits September 13, 2023 09:05

quality

17e63ee

Merge branch 'main' into feature/damian/llm_testing_feature_branch

d1379b7

bfineran approved these changes Sep 13, 2023

View reviewed changes

dsikka approved these changes Sep 13, 2023

View reviewed changes

dbogunowicz merged commit 907ea83 into main Sep 13, 2023
7 of 13 checks passed

dbogunowicz deleted the feature/damian/llm_testing_feature_branch branch September 13, 2023 14:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Branch][LLM Testing] Full Testing Harness for LLMs #1216

[Feature Branch][LLM Testing] Full Testing Harness for LLMs #1216

dbogunowicz commented Aug 29, 2023 •

edited

Loading

dsikka left a comment

dsikka left a comment

bfineran left a comment

dsikka left a comment

[Feature Branch][LLM Testing] Full Testing Harness for LLMs #1216

[Feature Branch][LLM Testing] Full Testing Harness for LLMs #1216

Conversation

dbogunowicz commented Aug 29, 2023 • edited Loading

dsikka left a comment

Choose a reason for hiding this comment

dsikka left a comment

Choose a reason for hiding this comment

bfineran left a comment

Choose a reason for hiding this comment

dsikka left a comment

Choose a reason for hiding this comment

dbogunowicz commented Aug 29, 2023 •

edited

Loading