Name		Name	Last commit message	Last commit date
parent directory ..
scripts		scripts
src		src
CMakeLists.txt		CMakeLists.txt
README.md		README.md
cpp_random.bmp		cpp_random.bmp
lora_cpp_random.bmp		lora_cpp_random.bmp
python_random.bmp		python_random.bmp

README.md

OpenVINO Latent Consistency Model C++ image generation pipeline

The pure C++ text-to-image pipeline, driven by the OpenVINO native API for SD v1.5 Latent Consistency Model with LCM Scheduler. It includes advanced features like LoRA integration with safetensors and OpenVINO Tokenizers. Loading openvino_tokenizers to ov::Core enables tokenization. The common folder contains schedulers for image generation and imwrite() for saving bmp images. This demo has been tested for Linux platform only. There is also a Jupyter notebook which provides an example of image generaztion in Python.

Note

This tutorial assumes that the current working directory is <openvino.genai repo>/image_generation/lcm_dreamshaper_v7/cpp/ and all paths are relative to this folder.

Step 1: Prepare build environment

C++ Packages:

CMake: Cross-platform build tool
OpenVINO: Model inference

Prepare a python environment and install dependencies:

conda create -n openvino_lcm_cpp python==3.10
conda activate openvino_lcm_cpp
conda install -c conda-forge openvino c-compiler cxx-compiler make

Step 2: Latent Consistency Model and Tokenizer models

Latent Consistency Model model

Install dependencies to import models from HuggingFace:

git submodule update --init
conda activate openvino_lcm_cpp
python -m pip install -r scripts/requirements.txt
python -m pip install ../../../thirdparty/openvino_tokenizers/[transformers]

Run model conversion script to download and convert PyTorch model to OpenVINO IR via optimum-intel. Please, use the script scripts/convert_model.py to convert the model:
```
cd scripts
python convert_model.py -lcm "SimianLuo/LCM_Dreamshaper_v7" -t FP16
```

Note

Only static model is currently supported for this sample.

LoRA enabling with safetensors

Refer to python pipeline blog. The safetensor model is loaded via safetensors.h. The layer name and weight are modified with Eigen Lib and inserted into the LCM model with ov::pass::MatcherPass in the file common/diffusers/src/lora.cpp.

LCM model lcm_dreamshaper_v7 and Lora soulcard are tested in this pipeline.

Download and put safetensors and model IR into the models folder.

Step 3: Build the LCM application

conda activate openvino_lcm_cpp
cmake -DCMAKE_BUILD_TYPE=Release -S . -B build
cmake --build build --config Release --parallel

Step 4: Run Pipeline

./build/lcm_dreamshaper [-p <posPrompt>] [-s <seed>] [--height <output image>] [--width <output image>] [-d <device>] [-r <readNPLatent>] [-a <alpha>] [-h <help>] [-m <modelPath>] [-t <modelType>]

Usage:
  lcm_dreamshaper [OPTION...]

-p, --posPrompt arg Initial positive prompt for SD (default: cyberpunk cityscape like Tokyo New York with tall buildings at dusk golden hour cinematic lighting)
-d, --device arg AUTO, CPU, or GPU. Doesn't apply to Tokenizer model, OpenVINO Tokenizers can be inferred on a CPU device only (default: CPU)
--step arg Number of diffusion step ( default: 20)
-s, --seed arg Number of random seed to generate latent (default: 42)
--num arg Number of image output(default: 1)
--height arg Height of output image (default: 512)
--width arg Width of output image (default: 512)
-c, --useCache Use model caching
-r, --readNPLatent Read numpy generated latents from file
-m, --modelPath arg Specify path of SD model IR (default: ../scripts/SimianLuo/LCM_Dreamshaper_v7)
-t, --type arg Specify the type of SD model IR (FP16_static or FP16_dyn) (default: FP16_static)
-l, --loraPath arg Specify path of lora file. (*.safetensors). (default: )
-a, --alpha arg alpha for lora (default: 0.75)
-h, --help Print usage

Note

The tokenizer model will always be loaded to CPU: OpenVINO Tokenizers can be inferred on a CPU device only.

Example:

Positive prompt: a beautiful pink unicorn

Read the numpy latent input and noise for scheduler instead of C++ std lib for the alignment with Python pipeline.

Generate image with random data generated by Python ./build/lcm_dreamshaper -r

Generate image with C++ lib generated latent and noise : ./build/lcm_dreamshaper

Generate image with soulcard lora and C++ generated latent and noise ./stable_diffusion -r -l path/to/soulcard.safetensors

Benchmark:

For the generation quality, C++ random generation with MT19937 results is differ from numpy.random.randn() and diffusers.utils.randn_tensor. Hence, please use -r, --readNPLatent for the alignment with Python (this latent file is for output image 512X512 only)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cpp

cpp

README.md

OpenVINO Latent Consistency Model C++ image generation pipeline

Step 1: Prepare build environment

Step 2: Latent Consistency Model and Tokenizer models

Latent Consistency Model model

LoRA enabling with safetensors

Step 3: Build the LCM application

Step 4: Run Pipeline

Benchmark:

Files

cpp

Directory actions

More options

Directory actions

More options

Latest commit

History

cpp

Folders and files

parent directory

README.md

OpenVINO Latent Consistency Model C++ image generation pipeline

Step 1: Prepare build environment

Step 2: Latent Consistency Model and Tokenizer models

Latent Consistency Model model

LoRA enabling with safetensors

Step 3: Build the LCM application

Step 4: Run Pipeline

Benchmark: