VidSGG-BIG/VidVRD-helper/baseline at 3dde8e6d1f39b85ad384bd81b7b74fb17d7ba3f6 · Dawn-LX/VidSGG-BIG

History

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
__init__.py		__init__.py
association.py		association.py
feature.py		feature.py
model.py		model.py
trajectory.py		trajectory.py

README.md

The baseline code for the VidVRD dataset introduced in the following paper.

@inproceedings{shang2017video,
    author={Shang, Xindi and Ren, Tongwei and Guo, Jingfan and Zhang, Hanwang and Chua, Tat-Seng},
    title={Video Visual Relation Detection},
    booktitle={ACM International Conference on Multimedia},
    address={Mountain View, CA USA},
    month={October},
    year={2017}
}

Baseline Quick Start

Install the prerequisites

conda create -n vidvrd python=2.7 anaconda cmake tensorflow=1.8.0 keras tqdm ffmpeg=3.4 py-opencv
export PYTHONNOUSERSITE=1 && source activate vidvrd
pip install dlib==19.3.1 --isolated

Download precomputed features, model and detected relations from here, and decompress the zipfile under the same folder as this repository.
Run python evaluate.py vidvrd test relation ../vidvrd-baseline-output/models/baseline_relation_prediction.json to evaluate the precomputed detected relations. Since a few wrong labels in the dataset were corrected after paper submission, the result is slightly different from the one reported in the paper. Some qualitative results can be found here.
Run python baseline.py --detect to detect video visual relations using the precomputed model.
Run python baseline.py --train to train a new model by adjusting the hyperparameters in the script, based on the precomputed features.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

baseline

baseline

README.md

Baseline Quick Start

Files

baseline

Directory actions

More options

Directory actions

More options

Latest commit

History

baseline

Folders and files

parent directory

README.md

Baseline Quick Start