Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Split Wanda and SparseGPT #2266

Merged
merged 1 commit into from
May 6, 2024
Merged

Conversation

rahul-tuli
Copy link
Member

@rahul-tuli rahul-tuli commented May 3, 2024

This PR refactors SparseGPT modifier(s) to remove their inheritance from Wanda, making SparseGPT fully independent. The change improves the modularity of the SparseGPT architecture, making it more straightforward to maintain and extend/modify

Test plan: Ran all the tests, + a manual one shot script on HuggingFaceM4/tiny-random-LlamaForCausalLM

Make sparsegpt not inherit from wanda modifier
Decouple SparseGPTModifierPyTorch from WandaPruningModifier
Fix docstrings
@rahul-tuli rahul-tuli changed the title [WIP] Split Wanda and SparseGPT Split Wanda and SparseGPT May 6, 2024
@rahul-tuli rahul-tuli marked this pull request as ready for review May 6, 2024 14:13
@dbogunowicz dbogunowicz merged commit e7fa21c into quant-modifier-ux May 6, 2024
@dbogunowicz dbogunowicz deleted the split-wanda-and-sparsegpt branch May 6, 2024 14:57
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants