STGAN in Pytorch

This is a Pytorch implementation of the STGAN model from the paper "Cloud Removal in Satellite Images Using Spatiotemporal Generative Models," Sarukkai^*, Jain^*, Uzkent, and Ermon (https://arxiv.org/abs/1912.06838). This implementation was written by Vishnu Sarukkai (https://github.com/VSAnimator/) and Anirudh Jain.

This implementation is built on a clone of the implementation of Pix2Pix from https://github.com/junyanz/pytorch-CycleGAN-and-pix2pix. The readme also is based on the readme from the CycleGAN/pix2pix repo. That repo contains the code corresponding to the following two papers:

Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks.
Jun-Yan Zhu*, Taesung Park*, Phillip Isola, Alexei A. Efros. In ICCV 2017. (* equal contributions) [Bibtex]

Image-to-Image Translation with Conditional Adversarial Networks.
Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, Alexei A. Efros. In CVPR 2017. [Bibtex]

Prerequisites

Linux or macOS
Python 3
CPU or NVIDIA GPU + CUDA CuDNN

Getting Started

Installation

Clone this repo:

git clone https://github.com/VSAnimator/stgan
cd stgan

Install PyTorch 0.4+ and other dependencies (e.g., torchvision, visdom and dominate).
- For pip users, please type the command pip install -r requirements.txt.

STGAN dataset

Download the STGAN dataset from https://doi.org/10.7910/DVN/BSETKZ. This download includes two datasets (singleImage and multipleImage), described both in the paper and on the linked page. Before training a model, these images must be split into train/val/test splits--for instance, for both singleImage/clear and singleImage/cloudy, create three subfolders called "train","val","test", and assign the images from the main directory into the corresponding partitions either using a script that assigns splits at random or a split that keeps images from the same "tile" together (accounting for the possibility that these images are correlated). In our case, we used a 80-10-10 train-val-test split while keeping image crops from the same "tile" together. Make sure that corresponding clear and cloudy images are assigned to the same split. Remove the "IR" image files from the cloudy image folders if only RGB data is desired.

Once the data is formatted this way, call one of the dataset creation scripts corresponding to the dataset:

SingleImage, RGB data:

python datasets/combine_normal.py --fold_A /path/to/data/cloudy --fold_B /path/to/data/clear --fold_AB /path/to/data/combined

SingleImage, RGB+IR data:

python datasets/combine_normal_ir.py --fold_A /path/to/data/cloudy --fold_B /path/to/data/clear --fold_AB /path/to/data/combined

MultipleImage, RGB data:

python datasets/combine_temporal.py --fold_A /path/to/data/cloudy --fold_B /path/to/data/clear --fold_AB /path/to/data/combined

MultipleImage, RGB+IR data:

python datasets/combine_temporal_ir.py --fold_A /path/to/data/cloudy --fold_B /path/to/data/clear --fold_AB /path/to/data/combined

This will combine each pair of images (cloudy,clear) into a single image file, ready for training. The files process the different input datasets slightly differently to make the data ready for training.

STGAN training

Train a model:

python train.py --dataroot ./path/to/data/combined  --name stgan --model temporal_branched_ir --netG unet_256_independent --dataset_mode temporal_ir --input_nc 4

Options include temporal_branched and temporal_branched_ir for "model" (depending on if we're using infrared data), 3 or 4 for "input_nc" (again depending on if we're using infrared data), unet_256_independent and independent_resnet_9blocks for "netG", temporal_ir or temporal for "dataset_mode" (depending on whether we're using infrared data) and further options such as "batch_size", "preprocess", and "norm". See train.py for detained training options.
Our best models often opted for larger batch_size, norm=batch, and both options for netG worked well (with the resnet generator taking longer to train than the unet but sometimes providing improvements in accuracy).
To view training results and loss plots, run python -m visdom.server and click the URL http://localhost:8097.
Test the model:

python test.py --dataroot ./path/to/data/combined  --name stgan --model temporal_branched_ir --netG unet_256_independent --dataset_mode temporal_ir --input_nc 4

The test results will be saved to a html file here: ./results/stgan/test_latest/index.html.

Name		Name	Last commit message	Last commit date
Latest commit History 440 Commits
data		data
datasets		datasets
models		models
options		options
scripts		scripts
util		util
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
pix2pix.ipynb		pix2pix.ipynb
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

STGAN in Pytorch

Prerequisites

Getting Started

Installation

STGAN dataset

STGAN training

About

Releases

Packages

Contributors 42

Languages

License

VSAnimator/stgan

Folders and files

Latest commit

History

Repository files navigation

STGAN in Pytorch

Prerequisites

Getting Started

Installation

STGAN dataset

STGAN training

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 42

Languages

Packages