Skip to content

Unofficial implementation of Stand-Alone Self-Attention in Vision Models (obsolete)

License

Notifications You must be signed in to change notification settings

MerHS/SASA-pytorch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This is NOT an official implementation. Please let me know whether this implementation contains any misreadings of the original paper.

Prerequisites

  • Python +3.6
  • pytorch +1.1.0
  • scipy
  • Pillow
  • torchvision

Benchmark (WIP)

Trained with ImageNet. (WIP: CIFAR-10, CIFAR-100)

Backbone network and parameters are based on the official torchvision ResNet and trainer example.

Trained up to 90 epochs / batch 64 on a single NVIDIA 1080Ti GPU, with SGD optimizer with a learning rate of 0.1 which is linearly warmed up for 10 epochs followed by cosine decay. (according to the SASA paper)

About

Unofficial implementation of Stand-Alone Self-Attention in Vision Models (obsolete)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages