- pytorch 1.1
- torchvision 0.3
- pyclipper
- opencv3
prepare a text in the following format, use '\t' as a separator
/path/to/img.jpg path/to/label.txt
...
- config the
train_data_path
,val_data_path
in config.json - use following script to run
python3 train.py
eval.py is used to test model on test dataset
- config
model_path
,img_path
,gt_path
,save_path
in eval.py - use following script to test
python3 eval.py
predict.py is used to inference on single image
- config
model_path
,img_path
, in predict.py - use following script to predict
python3 predict.py
The project is still under development.
only train on ICDAR2015 dataset
Method | Precision (%) | Recall (%) | F-measure (%) | FPS |
---|---|---|---|---|
paper(resnet18+short_size:736) | x | x | 80.4 | 26.1 |
my implementation(resnet50+short_size:736+pse扩张) | 60.06 | 48.57 | 53.71 | 12.18 (P100) |
my implementation(resnet50+short_size:736+聚类) | 54.91 | 45.21 | 49.59 | 1.44 (P100) |
my implementation(psenet+resnet50+short_size:736+pse扩张) | 76.9 | 78.57 | 77.73 | 8.79 (P100) |
TBD