LoRA-ViT

Low rank adaptation for Vision Transformer, we supported segmentation and classification.

Feature

Supported DeepLab segmentation for lukemelas/PyTorch-Pretrained-ViT. 20230315
Supported timm. 20230316
Repo clean up.

Installation

Gii clone. My torch.__version__==1.13.0, other version newer than torch.__version__==1.10.0 should also work, I guess. You may also need a safetensors from huggingface to load and save weight.

Usage

You may use Vision Transformer from timm:

import timm
import torch
from lora import LoRA_ViT_timm
img = torch.randn(2, 3, 224, 224)
model = timm.create_model('vit_base_patch16_224', pretrained=True)
lora_vit = LoRA_ViT_timm(vit_model=model, r=4, num_classes=10)
pred = lora_vit(img)
print(pred.shape)

If timm is too complicated, you can use a simpler implementation of ViT from lukemelas/PyTorch-Pretrained-ViT. Wrap you ViT using LoRA-ViT, this a simple example of classifer

from base_vit import ViT
import torch
from lora import LoRA_ViT

model = ViT('B_16_imagenet1k')
model.load_state_dict(torch.load('B_16_imagenet1k.pth'))
preds = model(img) # preds.shape = torch.Size([1, 1000])

num_params = sum(p.numel() for p in model.parameters() if p.requires_grad)
print(f"trainable parameters: {num_params}") #trainable parameters: 86859496


lora_model = LoRA_ViT(model, r=4, num_classes=10)
num_params = sum(p.numel() for p in lora_model.parameters() if p.requires_grad)
print(f"trainable parameters: {num_params}") # trainable parameters: 147456

this an example for segmentation tasks, using deeplabv3

model = ViT('B_16_imagenet1k')
model.load_state_dict(torch.load('B_16_imagenet1k.pth'))
lora_model = LoRA_ViT(model, r=4)
seg_lora_model = SegWrapForViT(vit_model=lora_model, image_size=384,
                            patches=16, dim=768, n_classes=10)

num_params = sum(p.numel() for p in seg_lora_model.parameters() if p.requires_grad)
print(f"Number of trainable parameters: {num_params/2**20:.3f}") # trainable parameters: 6.459

Save and load LoRA:

lora_model.save_lora_parameters('mytask.lora.safetensors') # save
lora_model.load_lora_parameters('mytask.lora.safetensors') # load

Performance

In M1 Pro, LoRA is about 1.8x~1.9x faster. python performance_profile.py should do the time profiler now. More test will come soon.

Credit

ViT code and imagenet pretrained weight come from lukemelas/PyTorch-Pretrained-ViT

Name		Name	Last commit message	Last commit date
Latest commit History 76 Commits
OAI-train		OAI-train
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
adapter.py		adapter.py
base_vit.py		base_vit.py
example.ipynb		example.ipynb
lora.py		lora.py
loss_landscape.ipynb		loss_landscape.ipynb
performance_profile.py		performance_profile.py
seg_vit.py		seg_vit.py
train.py		train.py
train_3D.py		train_3D.py
train_kfold.py		train_kfold.py
train_timm.py		train_timm.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LoRA-ViT

Feature

Installation

Usage

Performance

Credit

About

Releases

Packages

Languages

License

zhaozh10/LoRA-ViT

Folders and files

Latest commit

History

Repository files navigation

LoRA-ViT

Feature

Installation

Usage

Performance

Credit

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages