Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
calibration		calibration
images		images
lib/g2opy_changes		lib/g2opy_changes
settings		settings
test		test
videos/kitti06		videos/kitti06
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.ini		config.ini
config.py		config.py
constants.py		constants.py
convert_groundtruth.py		convert_groundtruth.py
dataset.py		dataset.py
display2D.py		display2D.py
feature_detector.py		feature_detector.py
feature_matcher.py		feature_matcher.py
feature_tracker.py		feature_tracker.py
frame.py		frame.py
geom_helpers.py		geom_helpers.py
ground_truth.py		ground_truth.py
helpers.py		helpers.py
initializer.py		initializer.py
install_pip3_packages.sh		install_pip3_packages.sh
install_thirdparty.sh		install_thirdparty.sh
main_slam.py		main_slam.py
main_vo.py		main_vo.py
map.py		map.py
map_point.py		map_point.py
moving_average.py		moving_average.py
mplot2d.py		mplot2d.py
mplot3d.py		mplot3d.py
optimizer_g2o.py		optimizer_g2o.py
pinhole_camera.py		pinhole_camera.py
slam.py		slam.py
timer.py		timer.py
video_streamer.py		video_streamer.py
viewer3D.py		viewer3D.py
visual_odometry.py		visual_odometry.py

Repository files navigation

pySLAM

Author: Luigi Freda

pySLAM is a 'toy' implementation of a Visual Odometry (VO) pipeline in Python. It has been developed for educational purposes for a computer vision class I taught. I started developing it for fun, during my free-time, taking inspiration from some repos available on the web.

Main Scripts:

main_vo.py combines the simplest VO ingredients without performing any image point triangulation or windowed bundle adjustment. At each step $k$, main_vo.py estimates the current camera pose $C_k$ with respect to the previous one $C_{k-1}$. The inter frame pose estimation returns $[R_{k-1,k},t_{k-1,k}]$ with $||t_{k-1,k}||=1$. With this very basic computation, you need to use a ground truth in order to recover a correct inter-frame scale $s$ and estimate a meaningful trajectory by composing $C_k = C_{k-1} * [R_{k-1,k}, s t_{k-1,k}]$. This script is a first start to understand the basics of inter frame feature tracking and camera pose estimation.
main_slam.py adds feature tracking along multiple frames, point triangulation and bundle adjustment in order to estimate the camera trajectory up-to-scale and a build a local map. It's still a basic VO pipeline but it shows some basic blocks which are necessary to develop a real visual SLAM pipeline.

You can use this framework as a baseline to create your own (proof of concept) VO/SLAM pipelines in python. When you test it, please, consider that's intended as a simple 'toy' framework. Check the terminal warnings if something weird happens.

Enjoy it!

Requirements

Python 3 (tested under Python 3.5)
Numpy
OpenCV (see below for a suggested python installation)

You may need to install some python3 packages. These packages can be automatically installed by running:

$ ./install_pip3_packages.sh

If you want to run main_slam.py you have to install the libs:

pangolin
g2o

This can be easily done by running the script:

$ ./install_thirdparty.sh

Usage

You can test the code right away by running:

$ python3 -O main_vo.py

This will process a KITTI video (available in the folder videos) by using its corresponding camera calibration file (available in the folder settings), and its groundtruth (available in the video folder).

N.B.: remind, the simple script main_vo.py strictly requires a ground truth, since the relative motion between two adjacent camera frames can be only estimated up to scale with a monocular camera (i.e. the inter frame pose estimation returns $[R_{k-1,k},t_{k-1,k}]$ with $||t_{k-1,k}||=1$).

In order to process a different dataset, you need to set the file config.ini:

select your dataset type in the section [DATASET] (see the section Datasets below for further details)
the camera settings file accordingly (see the section Camera Settings below)
the groudtruth file accordingly (see the section Camera Settings below)

If you want to test the script main_slam.py, you can run:

$ python3 -O main_slam.py

Datasets

You can use 4 different types of datasets:

Dataset	type in `config.ini`
KITTI odometry data set (grayscale, 22 GB)	`type=KITTI_DATASET`
TUM dataset	`type=TUM_DATASET`
video file	`type=VIDEO_DATASET`
folder of images	`type=FOLDER_DATASET`

KITTI Datasets

The code expects the following structure in the specified path folder (section [KITTI_DATASET] of config.ini). :

├── sequences
    ├── 00
    ...
    ├── 21
├── poses
    ├── 00.txt
        ...
    ├── 10.txt

Download the dataset (grayscale images) from http://www.cvlibs.net/datasets/kitti/eval_odometry.php and prepare the folder as specified above
Select the corresponding calibration settings file (parameter [KITTI_DATASET][cam_settings] in config.ini)

TUM Datasets

The code expects a file associations.txt correctly generated in each TUM dataset folder (specified in the section [TUM_DATASET] of config.ini).

Download a sequence from http://vision.in.tum.de/data/datasets/rgbd-dataset/download and uncompress it.
Associate RGB images and depth images using the python script associate.py. You can generate your own associations file executing:

$ python associate.py PATH_TO_SEQUENCE/rgb.txt PATH_TO_SEQUENCE/depth.txt > associations.txt

Select the corresponding calibration settings file (parameter [TUM_DATASET][cam_settings] in config.ini)

Camera Settings

The folder settings contains the camera settings files which can be used for testing the code. These are the same used in the framework ORBSLAM2. You can easily modify one of those files for creating your own new calibration file (for your new datasets).

In order to calibrate your camera, you can use the scripts in the folder calibration and you may want to have a look here. In particular:

use the script grab_chessboard_images.py to collect a sequence of images where the chessboard can be detected (set the chessboard size there)
use the script calibrate.py to process the collected images and compute the calibration parameters (set the chessboard size there)

References

Suggested books:

Multiple View Geometry in Computer Vision by Richard Hartley and Andrew Zisserman
An Invitation to 3-D Vision by Yi-Ma, Stefano Soatto, Jana Kosecka, S. Shankar Sastry
Computer Vision: Algorithms and Applications, by Richard Szeliski

Suggested material:

Vision Algorithms for Mobile Robotics by Davide Scaramuzza
CS 682 Computer Vision by Jana Kosecka

Moreover, you may want to have a look at the OpenCV guide or tutorials.

TODO

keyframe generation and management
proper local map generation and management
loop closure

How to install OpenCV under Ubuntu

In order to use non-free module (link) under Ubuntu 16.04, you can run
$ pip install opencv-contrib-python==3.4.0.12

For a more advanced installation procedure, take a look here.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pySLAM

Requirements

Usage

Datasets

KITTI Datasets

TUM Datasets

Camera Settings

References

TODO

How to install OpenCV under Ubuntu

About

Releases

Packages

Languages

License

kareotoko/pyslam

Folders and files

Latest commit

History

Repository files navigation

pySLAM

Requirements

Usage

Datasets

KITTI Datasets

TUM Datasets

Camera Settings

References

TODO

How to install OpenCV under Ubuntu

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages