Include Sparsity in Size Calculation #463

Satrat · 2024-02-21T21:44:22Z

IMPORTANT: merge #462 first, the quantization fix is also needed for the size calculation to be correct

Asana ticket: https://app.asana.com/0/1206109050183159/1206630352611634/f

Before

Size calculation was: total_params * (8 * quant_percent + 32 * (1 - quant_percent)

After

Size calculation is: total_params * (1 - sparsity_percent) * (8 * quant_percent + 32 * (1 - quant_percent)

Testing

from sparsezoo.analyze_v2 import analyze

MODEL_PATH = "zoo:codellama-7b-evolcodealpaca_codellama_pretrain-pruned60_quantized"
analysis = analyze(MODEL_PATH)
print(analysis)

Output:

Params:
	total 	        : 6738149376
	sparsity%	: 57.53771573852388
	size [bits]	: 24257284280.0
	quantized %	: 97.99253034547152

expected_size = 6738149376 * ( 1 - 0.5753771573852388) * (8 * 0.9799253034547152 + 32 * (1 - 0.9799253034547152)) = 24267869036.34734
reported_size = 24257284280.0
expected_size / reported_size = 1.000436353724727

bfineran

LGTM - nice. Only taking the last commit into account

* initial fix * style * incorporate sparsity into size calculation * quality

* `RegistryMixin` improved alias management (#404) * initial commit * add docstrings * simplify * hardening * refactor * format registry lookup strings to be lowercases * standardise aliases * Move evaluator registry (#411) * More control over external data size (#412) * When splitting external data, avoid renaming `model.data` to `model.data.1` if only one external data file gets eventually saved (#414) * [model.download] fix function returning nothing (#420) * [BugFix] Path not expanded (#418) * [Fix] Allow for processing Path in the sparsezoo analysis (#417) * Raise TypeError instead of ValueError (#426) * Fix misleading docstring (#416) Add test * add support for benchmark.yaml (#415) * add support for benchmark.yaml recent zoo models use `benchmark.yaml` instead of `benchmarks.yaml`. adding this additional pathway so `benchmark.yaml` is downloaded in the bulk model download * update files filter * fix tests --------- Co-authored-by: dbogunowicz <damian@neuralmagic.com> * [BugFix] Add analyze to init (#421) * Add analyze to init * Move onnxruntime to deps * Print model analysis (#423) * [model.download] fix function returning nothing (#420) * [BugFix] Path not expanded (#418) * print model-analysis * [Fix] Allow for processing Path in the sparsezoo analysis (#417) * add print statement at the end of cli run --------- Co-authored-by: Dipika Sikka <dipikasikka1@gmail.com> Co-authored-by: Rahul Tuli <rahul@neuralmagic.com> Co-authored-by: dbogunowicz <97082108+dbogunowicz@users.noreply.github.com> * Omit scalar weight (#424) * ommit scalar weights: * remove unwanted files * comment * Update src/sparsezoo/utils/onnx/analysis.py Co-authored-by: Benjamin Fineran <bfineran@users.noreply.github.com> --------- Co-authored-by: Benjamin Fineran <bfineran@users.noreply.github.com> --------- Co-authored-by: George <george@neuralmagic.com> Co-authored-by: Dipika Sikka <dipikasikka1@gmail.com> Co-authored-by: dbogunowicz <97082108+dbogunowicz@users.noreply.github.com> Co-authored-by: Benjamin Fineran <bfineran@users.noreply.github.com> * update analyze help message for correctness (#432) * initial commit (#430) * [sparsezoo.analyze] Fix pathway such that it works for larger models (#437) * fix analyze to work with larger models * update for failing tests; add comments * Update src/sparsezoo/utils/onnx/external_data.py Co-authored-by: dbogunowicz <97082108+dbogunowicz@users.noreply.github.com> --------- Co-authored-by: Dipika Sikka <dipikasikka1@gmail.coom> Co-authored-by: dbogunowicz <97082108+dbogunowicz@users.noreply.github.com> * Delete hehe.py (#439) * Download deployment dir for llms (#435) * Download deployment dir for llms * Use path instead of download * only set save_as_external_data to true if the model originally had external data (#442) * Add Channel Wise Quantization Support (#441) * Chunk download (#429) * chunk download, break down into 10 * lint * threads download * draft * chunk download draft * job based download and combining/deleteing chunks * delete old code * lint * fix num jobs if file_size is less than the chunk size * doc string and return types * test * lint * fix type hints (#445) * fix bug if the value is a dict (#447) * [deepsparse.analyze] Fix v1 functionality to work with llms (#451) * fix equivalent changes made to analyze_v2 such that inference session works for llms; update wanrings to be debug printouts * typo * overwrite file (#450) Co-authored-by: 21 <a21@21s-MacBook-Pro.local> * Adds a `numpy_array_representer` to yaml (#454) on runtime, to avoid serialization issues * Avoid division by zero (#457) Avoid log of zero * op analysis total counts had double sparse counts (#461) * Rename legacy analyze to analyze_v1 (#459) * Fixing Quant % Calcuation (#462) * initial fix * style * Include Sparsity in Size Calculation (#463) * initial fix * style * incorporate sparsity into size calculation * quality * op analysis total counts had double sparse counts (#461) * Fixing Quant % Calcuation (#462) * initial fix * style * Include Sparsity in Size Calculation (#463) * initial fix * style * incorporate sparsity into size calculation * quality * Revert "Merge branch 'main' into analyze_cherry_picks" This reverts commit 509fa1a, reversing changes made to 08f94c4. --------- Co-authored-by: dbogunowicz <97082108+dbogunowicz@users.noreply.github.com> Co-authored-by: Rahul Tuli <rahul@neuralmagic.com> Co-authored-by: Dipika Sikka <dipikasikka1@gmail.com> Co-authored-by: Benjamin Fineran <bfineran@users.noreply.github.com> Co-authored-by: dbogunowicz <damian@neuralmagic.com> Co-authored-by: George <george@neuralmagic.com> Co-authored-by: Dipika Sikka <dipikasikka1@gmail.coom> Co-authored-by: 21 <a21@21s-MacBook-Pro.local>

Satrat added 4 commits February 21, 2024 17:23

initial fix

2e7b8bf

style

0b888b1

Merge branch 'main' into fix_quant_calculation

eb0d561

incorporate sparsity into size calculation

8f33437

Satrat requested review from horheynm, rahul-tuli, bfineran and anmarques and removed request for horheynm February 21, 2024 21:46

bfineran previously approved these changes Feb 21, 2024

View reviewed changes

rahul-tuli previously approved these changes Feb 21, 2024

View reviewed changes

Merge branch 'main' into fix_size_calcuation

3c04596

Satrat dismissed stale reviews from rahul-tuli and bfineran via 3c04596 February 22, 2024 14:23

quality

7c3e647

Satrat requested review from bfineran and rahul-tuli February 22, 2024 14:30

rahul-tuli approved these changes Feb 22, 2024

View reviewed changes

bfineran approved these changes Feb 22, 2024

View reviewed changes

Satrat merged commit a00ca1e into main Feb 22, 2024
4 checks passed

Satrat deleted the fix_size_calcuation branch February 22, 2024 14:57

Satrat added a commit that referenced this pull request Feb 22, 2024

Include Sparsity in Size Calculation (#463)

08f94c4

* initial fix * style * incorporate sparsity into size calculation * quality

This was referenced Feb 22, 2024

[Cherry Picks] Analyze Bug Fixes #464

Closed

[Cherry Picks] Analyze Bug Fixes (Updated) #465

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Include Sparsity in Size Calculation #463

Include Sparsity in Size Calculation #463

Satrat commented Feb 21, 2024

bfineran left a comment

Include Sparsity in Size Calculation #463

Include Sparsity in Size Calculation #463

Conversation

Satrat commented Feb 21, 2024

Before

After

Testing

bfineran left a comment

Choose a reason for hiding this comment