MVP for Alternating Flow #1912

Satrat · 2023-12-15T20:20:31Z

Initial implementation for alternating between oneshot and finetuning stages. This branch is based on two active PRs, they should be merged first:

Testing

test_multi_recipe.yaml

test_oneshot_stage:
  obcq_modifiers:
    SparseGPTModifier:
      sparsity: 0.5
      block_size: 128
      sequential_update: False
      quantize: False
      percdamp: 0.01
      prunen: 0
      prunem: 0
      targets: [
        "re:model.layers.\\d+$"
      ]
      target_ids: ["attention_mask", "position_ids"]  
test_finetune_stage:
  pruning_modifiers:
    ConstantPruningModifier:
      targets: [
        "re:.*self_attn.q_proj",
        "re:.*self_attn.k_proj",
        "re:.*self_attn.v_proj",
        "re:.*self_attn.o_proj",
        "re:.*mlp.gate_proj",
        "re:.*mlp.up_proj"
      ]
      start: 0
test_second_oneshot_stage:
  obcq_modifiers:
    SparseGPTModifier:
      sparsity: 0.7
      block_size: 128
      sequential_update: False
      quantize: False
      percdamp: 0.01
      prunen: 0
      prunem: 0
      targets: [
        "re:model.layers.\\d+$"
      ]
      target_ids: ["attention_mask", "position_ids"]  
test_second_finetune_stage:
  pruning_modifiers:
    ConstantPruningModifier:
      targets: [
        "re:.*self_attn.q_proj",
        "re:.*self_attn.k_proj",
        "re:.*self_attn.v_proj",
        "re:.*self_attn.o_proj",
        "re:.*mlp.gate_proj",
        "re:.*mlp.up_proj"
      ]
      start: 0
test_quantization_oneshot_stage:
  obcq_modifiers:
    QuantizationModifier:
      ignore:
        - LlamaRotaryEmbedding
        - LlamaRMSNorm
        - SiLUActivation
        - model.layers.0.mlp.down_proj
        - model.layers.1.mlp.down_proj
        - model.layers.2.mlp.down_proj
        - model.layers.3.mlp.down_proj
        - model.layers.4.mlp.down_proj
        - model.layers.5.mlp.down_proj
      post_oneshot_calibration: False
      scheme_overrides:
        Embedding:
          input_activations: null
          weights:
            num_bits: 8
            symmetric: False

Test script:

def run():
    from sparseml.transformers.finetune.text_generation import run_general
    
    model = "Xenova/llama2.c-stories15M"
    dataset_name = "open_platypus"
    concatenate_data = False
    run_stages = True
    output_dir = "./output_oneshot"
    overwrite_output_dir = True
    recipe = "test_multi_recipe.yaml"
    splits = {
        "calibration": "train[:50%]",
        "train": "train[50%:]"
    }

    run_general(
        model_name_or_path=model,
        dataset_name=dataset_name,
        run_stages=run_stages,
        output_dir=output_dir,
        overwrite_output_dir=overwrite_output_dir,
        recipe=recipe,
        concatenate_data = concatenate_data,
        splits = splits
    )

if __name__ == "__main__":
    run()

Known Issues/ Shortcomings

FSDP hasn't been tested yet
Training checkpoints getting overwritten during subsequent finetuning runs
No way to specify different numbers of epochs for each finetune stage
No way to specify different datset splits for different finetuning stages
Checkpoint loading between stages not implemented
Output recipe doesn't indicate what stages have been run and what hasn't
No unit or integration tests!

…into sparse_auto_recipe

src/sparseml/transformers/finetune/runner.py

rahul-tuli

LGTM! Good tests

src/sparseml/transformers/finetune/text_generation.py

rahul-tuli

LGTM! Good tests

Satrat added 30 commits November 16, 2023 16:12

initial recipe re-loading

d5abe8e

Merge branch 'main' into sparse_auto_recipe

ec0e180

loading for input recipe

2d7b5b7

Merge branch 'main' into sparse_auto_recipe

2cc9e16

persist structure across recipe loads

356bd81

clean up fn names

1b67b6f

Merge branch 'main' into sparse_auto_recipe

f06ed8a

clean up duplicated code

ab5a464

delete extra file

11f4efe

unit tests

7e960a3

fix failing test

ebb5407

quantization edge cases

6a394d7

quant tests

d7974bf

Merge branch 'main' into sparse_auto_recipe

4b9014d

fixes for stage name clashes

701ab2c

Merge branch 'sparse_auto_recipe' of github.com:neuralmagic/sparseml …

5812488

…into sparse_auto_recipe

clean up documentation

21473aa

setup StageRunner class

485501b

running one_shot from text_gen script

2d536a3

cleanup helper fns

a4406ae

precision support

4576a80

formatting

27467e3

Merge branch 'main' into alternate_flows

10a0fed

WIP for alternating

7c754e0

fixing device issue

0eb06bf

Merge branch 'sparse_auto_recipe' into alternating_flow_pt2

f45326d

Merge branch 'main' into sparse_auto_recipe

e46dd96

MVP for alternating flows

d308987

add apply flag during finalization as well

fe9af83

clarity comments

5f6e854

Satrat added 2 commits December 15, 2023 20:13

Merge branch 'sparse_auto_recipe' of github.com:neuralmagic/sparseml …

4588eb2

…into sparse_auto_recipe

clean up docstrings

391350d

Satrat requested review from bfineran, dsikka, rahul-tuli and dbogunowicz December 15, 2023 20:29

Satrat added 4 commits December 15, 2023 20:32

fix unit test

f7fb65a

Merge branch 'main' into alternate_flows

e429929

Merge branch 'sparse_auto_recipe' into alternating_flow_pt2

7336453

Merge branch 'alternate_flows' into alternating_flow_pt2

2968171

bfineran reviewed Dec 18, 2023

View reviewed changes

src/sparseml/transformers/finetune/runner.py Outdated Show resolved Hide resolved

src/sparseml/transformers/finetune/runner.py Outdated Show resolved Hide resolved

Satrat added 4 commits December 20, 2023 18:31

add finetuning README

ee1ee2d

Merge branch 'main' of github.com:neuralmagic/sparseml

9004da6

Merge branch 'main' into alternating_flow_pt2

180a24d

cleaning up stage logic

a8760eb

bfineran previously approved these changes Dec 28, 2023

View reviewed changes

Satrat added 2 commits January 2, 2024 10:43

Merge branch 'main' into alternating_flow_pt2

8eba7dd

quality

9ef0d4c

Satrat dismissed bfineran’s stale review via 9ef0d4c January 2, 2024 16:34

Satrat marked this pull request as ready for review January 2, 2024 21:31

Satrat requested a review from bfineran January 4, 2024 23:31

Satrat mentioned this pull request Jan 4, 2024

FSDP oneshot #1939

Merged

bfineran approved these changes Jan 8, 2024

View reviewed changes

Merge branch 'main' into alternating_flow_pt2

c4562c0

rahul-tuli approved these changes Jan 9, 2024

View reviewed changes

src/sparseml/transformers/finetune/text_generation.py Show resolved Hide resolved

rahul-tuli approved these changes Jan 9, 2024

View reviewed changes

Merge branch 'main' into alternating_flow_pt2

797413a

Satrat merged commit f592037 into main Jan 9, 2024
11 of 12 checks passed

Satrat deleted the alternating_flow_pt2 branch January 9, 2024 14:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MVP for Alternating Flow #1912

MVP for Alternating Flow #1912

Satrat commented Dec 15, 2023 •

edited

Loading

rahul-tuli left a comment

rahul-tuli left a comment

MVP for Alternating Flow #1912

MVP for Alternating Flow #1912

Conversation

Satrat commented Dec 15, 2023 • edited Loading

Testing

Known Issues/ Shortcomings

rahul-tuli left a comment

Choose a reason for hiding this comment

rahul-tuli left a comment

Choose a reason for hiding this comment

Satrat commented Dec 15, 2023 •

edited

Loading