[GPTQ Modifier UX] Update tests to use GPTQModifier for obcq style quantization #2294

rahul-tuli · 2024-05-20T17:56:21Z

This PR updates test recipes and readme to use new GPTQ modifier for quantization

* Split WandaPruningModifier and SparseGPTModifier Make sparsegpt not inherit from wanda modifier Decouple SparseGPTModifierPyTorch from WandaPruningModifier Fix docstrings * Split SparseGPT and GPTQ modifiers (#2272) * Update OBCQ * Extract GPTQ Modifier * [GPTQ Modifier UX] Update tests to use GPTQModifier for obcq style quantization (#2294) * Update OBCQ * Extract GPTQ Modifier * Update test recipes * GPTQ UX config groups support (#2273) * Update OBCQ * Extract GPTQ Modifier * Update test recipes * Add config_groups support to GPTQModifier * mask_structure preservation test (#2284) * test * Preserve weight sparsity if greater than threshold * Add argument to preserve sparsity mask in SPARSEGPT * fix case when mask is none * Add test to check mask_structure - initial mask structure should be preserved b/w consecutive runs; added test to check this * Update tensor_follows_mask_structure to check for atleast n zeros --------- Co-authored-by: Sara Adkins <sara@neuralmagic.com> * PR comments --------- Co-authored-by: Sara Adkins <sara@neuralmagic.com> * Fix default case * Update test to use new vLLMQuantizationModifier * Style --------- Co-authored-by: Sara Adkins <sara@neuralmagic.com>

rahul-tuli added 3 commits May 20, 2024 13:55

Update OBCQ

dd8241b

Extract GPTQ Modifier

4409730

Update test recipes

080c3b1

rahul-tuli requested review from Satrat, bfineran, horheynm and dbogunowicz May 20, 2024 17:56

rahul-tuli mentioned this pull request May 20, 2024

[Feature Branch] Quant modifier UX #2263

Merged

7 tasks

bfineran approved these changes May 20, 2024

View reviewed changes

Base automatically changed from create-gptq-modifier to quant-modifier-ux May 20, 2024 18:56

rahul-tuli merged commit c695567 into quant-modifier-ux May 20, 2024

rahul-tuli deleted the update-tests branch May 20, 2024 18:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GPTQ Modifier UX] Update tests to use GPTQModifier for obcq style quantization #2294

[GPTQ Modifier UX] Update tests to use GPTQModifier for obcq style quantization #2294

rahul-tuli commented May 20, 2024

[GPTQ Modifier UX] Update tests to use GPTQModifier for obcq style quantization #2294

[GPTQ Modifier UX] Update tests to use GPTQModifier for obcq style quantization #2294

Conversation

rahul-tuli commented May 20, 2024