Preserve sparsity GPTQ #2281

rahul-tuli · 2024-05-13T13:44:38Z

Recently a bug was revealed, where if GPTQ modifier was applied consecutively after SparseGPT, the weight sparsity mask was not being respected, this PR fixes that by preserving the mask, we do this automatically if the weight sparsity is greater than SPARSITY_THRESHOLD which has been set to 5% for now.

Credits to @Satrat and @abhinavnmagic for proposing the fix

The unit test for consecutive application now runs w/o having to increase the relative tolerance which was done as a part of #2272

test

9749362

This was referenced May 13, 2024

test #2278

Closed

[Feature Branch] Quant modifier UX #2263

Merged

rahul-tuli requested review from Satrat, bfineran, dsikka, horheynm, dbogunowicz and abhinavnmagic May 13, 2024 13:46

Preserve weight sparsity if greater than threshold

77ad1a2

rahul-tuli force-pushed the preserve-sparsity-gptq branch from c220ab9 to 77ad1a2 Compare May 13, 2024 13:59

bfineran approved these changes May 13, 2024

View reviewed changes

Satrat approved these changes May 15, 2024

View reviewed changes

rahul-tuli merged commit 14a1b08 into gptq-ux-config-groups May 17, 2024

rahul-tuli deleted the preserve-sparsity-gptq branch May 17, 2024 16:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Preserve sparsity GPTQ #2281

Preserve sparsity GPTQ #2281

rahul-tuli commented May 13, 2024

Preserve sparsity GPTQ #2281

Preserve sparsity GPTQ #2281

Conversation

rahul-tuli commented May 13, 2024