Skip to content

Installation

Michael Chan edited this page Aug 30, 2019 · 10 revisions

Installation

1. Install Docker

You may follow the official document if you have not installed docker yet. https://docs.docker.com/install/linux/docker-ce/ubuntu/#set-up-the-repository Followings are the code for installing docker in Ubuntu.

sudo apt-get update
sudo apt-get install \
    apt-transport-https \
    ca-certificates \
    curl \
    software-properties-common
curl -fsSL https://download.docker.com/linux/ubuntu/gpg | sudo apt-key add -
sudo apt-key fingerprint 0EBFCD88
sudo add-apt-repository \
   "deb [arch=amd64] https://download.docker.com/linux/ubuntu \
   $(lsb_release -cs) \
   stable"
sudo apt-get update
sudo apt-get -y install docker-ce

2. Clone datalab github repository

git clone --recurse-submodule https://github.com/michaelchanwahyan/datalab.git

3. Build the image

To build the docker image, it may take more than 20 minutes. But after it, all the essential components will be installed.

cd datalab
sudo bash buildimg.sh

The docker image should be installed and you can check by

sudo docker images

4. Execute the docker

Scripts are saved in rundocker.sh You can execute it easily.

# the container uses docker volume to communicate.
# before you run up the container, be sure
# sudo mkdir -p /app ; sudo chown -R $(whoami) /app ; sudo chmod u+rwx /app
sudo bash rundocker.sh

You can check the status via

sudo docker ps

It takes around one minute to start the jupyter lab and airflow. You can visit the jupyter lab via port 9999 and airflow via port 9090

Clone this wiki locally