[Feature] Support LoRA #1687

fanqiNO1 · 2023-07-04T08:40:47Z

Motivation

Support LoRA.

Modification

Support LoRA.

Use cases

model = dict(
    type='ImageClassifier',
    backbone=dict(
        type='LoRAModel',
        module=dict(
            type='VisionTransformer',
            arch='b',
            img_size=384,
            patch_size=16,
            drop_rate=0.1,
            init_cfg=dict(type='Pretrained', checkpoint=''), prefix='backbone'),
        alpha=16,
        rank=16,
        drop_rate=0.1,
        targets=[dict(type='qkv')]),
    neck=None,
    head=dict(
        type='VisionTransformerClsHead',
        num_classes=1000,
        in_channels=768,
        loss=dict(
            type='LabelSmoothLoss', label_smooth_val=0.1,
            mode='classy_vision'),
        init_cfg=[dict(type='TruncNormal', layer='Linear', std=2e-5)],
    ))

Experiments

Use the vit-p16-224 pretrained on the imagenet21k and finetune on the imagenet1k with 384px.
The GPU memory consumption has been reduced from 24GB to 17GB.
The number of trainable parameters has been reduced from 88M to 1.2M.
The accuracy is accuracy/top1: 84.1000, accuracy/top5: 97.1200.
The accuracy/top1 from the original paper An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale is 83.97.
Use the dinov2-small and finetune on the imagenet1k.
The accuracy is accuracy/top1: 81.4360 accuracy/top5: 95.9140.
The accuracy/top1 from the original paper DINOv2: Learning Robust Visual Features without Supervision is 81.1 (Linear evaluation on ImageNet-1k of frozen pretrained features).
Use the BLIP2 and finetune on the COCOCaption.
The GPU memory consumption has been reduced from OOM (>80GB) to 60GB.
The result is BLEU@4: 42.65, CIDEr: 143.84.
The result from the original paper BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models is BLEU@4: 43.7, CIDEr: 145.8. (The original paper finetunes the whole vision backbone, but I just finetune the attention layers of the vision backbone.)

Checklist

Before PR:

Pre-commit or other linting tools are used to fix the potential lint issues.
Bug fixes are fully covered by unit tests, the case that causes the bug should be added in the unit tests.
The modification is covered by complete unit tests. If not, please add more unit test to ensure the correctness.
The documentation has been modified accordingly, like docstring or example tutorials.

After PR:

If the modification has potential influence on downstream or other related projects, this PR should be tested with those projects, like MMDet or MMSeg.
CLA has been signed and all committers have signed the CLA in this PR.

codecov · 2023-07-04T09:30:13Z

Codecov Report

Patch coverage: 35.86% and project coverage change: -2.93 ⚠️

Comparison is base (f9dcae2) 68.16% compared to head (476c073) 65.24%.

❗ Current head 476c073 differs from pull request most recent head 8d69857. Consider uploading reports for the commit 8d69857 to get more accurate results

Additional details and impacted files

@@            Coverage Diff             @@
##              dev    #1687      +/-   ##
==========================================
- Coverage   68.16%   65.24%   -2.93%     
==========================================
  Files         295      332      +37     
  Lines       23372    25839    +2467     
  Branches     3713     4127     +414     
==========================================
+ Hits        15932    16859     +927     
- Misses       6880     8362    +1482     
- Partials      560      618      +58

Flag	Coverage Δ
unittests	`65.24% <35.86%> (-2.93%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
mmpretrain/apis/feature_extractor.py	`37.50% <0.00%> (ø)`
mmpretrain/apis/image_caption.py	`30.64% <0.00%> (ø)`
mmpretrain/apis/image_retrieval.py	`21.42% <0.00%> (ø)`
mmpretrain/apis/visual_grounding.py	`27.53% <0.00%> (ø)`
mmpretrain/apis/visual_question_answering.py	`25.67% <0.00%> (ø)`
mmpretrain/datasets/__init__.py	`60.46% <0.00%> (-13.83%)`	⬇️
mmpretrain/datasets/flickr30k_caption.py	`0.00% <0.00%> (ø)`
mmpretrain/datasets/flickr30k_retrieval.py	`0.00% <0.00%> (ø)`
mmpretrain/datasets/gqa_dataset.py	`0.00% <0.00%> (ø)`
mmpretrain/datasets/nocaps.py	`0.00% <0.00%> (ø)`
... and 67 more

... and 8 files with indirect coverage changes

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

fangyixiao18

provide an example lora config of ViT
add a lora weights merge script for users

mmpretrain/models/peft/lora.py

tools/model_converters/merge_lora_weight.py

tests/test_models/test_peft/test_lora.py

fanqiNO1 added 18 commits June 28, 2023 20:31

[Feature] Support LoRA

f92867e

[Feature] Support LoRA

e0f4d35

[Fix] Fix bugs

c83af5c

[Refactor] Add copyright

874c9e0

[Fix] Fix bugs

9497dac

[Enhancement] Add

6b9efe9

[Fix] Fix bugs

54081f9

[Fix] Fix bugs

39d3808

[Fix] Fix bugs

f9a68f6

[Fix] Fix bugs

a11e732

[Fix] Fix bugs

eed356e

[Docs] Update docstring

ebadd47

[Docs] Update docstring

d01d851

[Refactor] Reformat with yapf

b4c9e84

[Docs] Update docstring

33d209e

[Refactor] Docformat

3686434

[Refactor] Fix double-quote-string

57e3147

[Fix] fix pytorch version

c075309

fanqiNO1 and others added 9 commits July 4, 2023 17:36

[Fix] isort

e5da269

[Fix] isort

f6501d8

[Enhancement] Extend forward

6a58db5

[Enhancement] Extend test

9a5ac3b

[Fix] Fix targets

0072fd6

Merge branch 'open-mmlab:main' into lora

2f559c1

[Enhancement] Extend LoRA to frozen models

ff62978

[Fix] Fix spelling

dd9403b

[Fix] Override __getattr__

c5525a3

fangyixiao18 requested changes Jul 20, 2023

View reviewed changes

mmpretrain/models/peft/lora.py Show resolved Hide resolved

[Fix] Add init_cfg

affb240

fanqiNO1 added 8 commits July 20, 2023 14:43

[Enhancement] Add example config

7b4573c

[Fix] Fix init_cfg

63343dc

[Enhancement] Add merging script

a12f43b

[Fix] Remove init_cfg

7b42024

[Fix] Change lora key

347bd95

[Fix] Fix merge scripts

be0bbe9

[Fix] Fix merge scripts

1457d68

[Docs] Add docs

faf0523

fangyixiao18 approved these changes Jul 24, 2023

View reviewed changes

mzr1996 requested changes Jul 24, 2023

View reviewed changes

tools/model_converters/merge_lora_weight.py Outdated Show resolved Hide resolved

tests/test_models/test_peft/test_lora.py Outdated Show resolved Hide resolved

[Fix] fix

8d69857

mzr1996 approved these changes Jul 24, 2023

View reviewed changes

mzr1996 merged commit 64c446d into open-mmlab:dev Jul 24, 2023
9 of 10 checks passed

fanqiNO1 deleted the lora branch July 24, 2023 03:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Support LoRA #1687

[Feature] Support LoRA #1687

fanqiNO1 commented Jul 4, 2023 •

edited

Loading

codecov bot commented Jul 4, 2023 •

edited

Loading

fangyixiao18 left a comment

[Feature] Support LoRA #1687

[Feature] Support LoRA #1687

Conversation

fanqiNO1 commented Jul 4, 2023 • edited Loading

Motivation

Modification

Use cases

Experiments

Checklist

codecov bot commented Jul 4, 2023 • edited Loading

Codecov Report

fangyixiao18 left a comment

Choose a reason for hiding this comment

fanqiNO1 commented Jul 4, 2023 •

edited

Loading

codecov bot commented Jul 4, 2023 •

edited

Loading