rust-dl-webserver

This project provides an example of serving a deep learning model with batched prediction using Rust. In particular it runs a GPT2 model to generate text based on input context.

Features

Batched prediction using batched-fn when GPU is detected.
Back pressure mechanism that will return a 503 status code if the server gets back-logged with too many requests.

Setup

You'll need to download the model files for GPT-2 through the Rust Bert repository. This requires Python 3.

git clone https://github.com/guillaume-be/rust-bert && cd rust-bert
pip install -r requirements.txt
python utils/download-dependencies_gpt2.py

Also in order for the server to make use of your GPU (if you have one available) you'll need to compile it against the right version of the LibTorch C++ library, which you can download from https://pytorch.org/get-started/locally/. After downloading, unzip the file.

Running the server

Once you've downloaded the model files and LibTorch, clone this repo and run the server with:

make run LIBTORCH=/path/to/libtorch

Now in a separate terminal you can send several requests in to the server at once:

curl -d '{"text":"Hello, World!"}' \
    -H "Content-Type: application/json" \
    http://localhost:3030/generate &
curl -d '{"text":"Stay at home"}' \
    -H "Content-Type: application/json" \
    http://localhost:3030/generate &
curl -d '{"text":"Wash your hands"}' \
    -H "Content-Type: application/json" \
    http://localhost:3030/generate &
curl -d '{"text":"Do not touch your face"}' \
    -H "Content-Type: application/json" \
    http://localhost:3030/generate &

The logs from the server should look something like this:

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
.github		.github
img		img
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Makefile		Makefile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

rust-dl-webserver

Features

Setup

Running the server

About

Releases

Packages

Languages

guillaume-be/rust-dl-webserver

Folders and files

Latest commit

History

Repository files navigation

rust-dl-webserver

Features

Setup

Running the server

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages