Cortex.cpp

Documentation - API Reference - Changelog - Bug reports - Discord

⚠️ Cortex.cpp is currently in Development. This documentation outlines the intended behavior of Cortex, which may not yet be fully implemented in the codebase.

Overview

Cortex.cpp is a Local AI engine that is used to run and customize LLMs. Cortex can be deployed as a standalone server, or integrated into apps like Jan.ai.

Cortex.cpp is a multi-engine that uses llama.cpp as the default engine but also supports the following:

To install Cortex.cpp, download the installer for your operating system from the following options:

Version Type	Windows	MacOS		Linux
Stable (Recommended)	Download	Intel	M1/M2/M3/M4	Debian Download	Fedora Download

Note: You can also build Cortex.cpp from source by following the steps here.

Libraries

Quickstart

CLI

# 1. Start the Cortex.cpp server (The server is running at localhost:3928)
cortex

# 2. Start a model
cortex run <model_id>:[engine_name]

# 3. Stop a model
cortex stop <model_id>:[engine_name]

# 4. Stop the Cortex.cpp server
cortex stop

API

Start the API server using cortex command.
Pull a Model

curl --request POST \
  --url http://localhost:3928/v1/models/{model_id}/pull

Start a Model

curl --request POST \
  --url http://localhost:3928/v1/models/{model_id}/start \
  --header 'Content-Type: application/json' \
  --data '{
  "prompt_template": "system\n{system_message}\nuser\n{prompt}\nassistant",
  "stop": [],
  "ngl": 4096,
  "ctx_len": 4096,
  "cpu_threads": 10,
  "n_batch": 2048,
  "caching_enabled": true,
  "grp_attn_n": 1,
  "grp_attn_w": 512,
  "mlock": false,
  "flash_attn": true,
  "cache_type": "f16",
  "use_mmap": true,
  "engine": "llamacpp"
}'

Chat with a Model

curl http://localhost:3928/v1/chat/completions \
-H "Content-Type: application/json" \
-d '{
  "model": "",
  "messages": [
    {
      "role": "user",
      "content": "Hello"
    },
  ],
  "model": "mistral",
  "stream": true,
  "max_tokens": 1,
  "stop": [
      null
  ],
  "frequency_penalty": 1,
  "presence_penalty": 1,
  "temperature": 1,
  "top_p": 1
}'

Stop a Model

curl --request POST \
  --url http://localhost:3928/v1/models/mistral/stop

Stop the Cortex.cpp server using cortex stop command.

Note: Our API server is fully compatible with the OpenAI API, making it easy to integrate with any systems or tools that support OpenAI-compatible APIs.

Built-in Model Library

Cortex.cpp supports various models available on the Cortex Hub. Once downloaded, all model source files will be stored at C:\Users\<username>\AppData\Local\cortexcpp\models.

Here are example of models that you can use based on each supported engine:

Model	llama.cpp `:gguf`	TensorRT `:tensorrt`	ONNXRuntime `:onnx`	Command
llama3.1	✅		✅	cortex run llama3.1:gguf
llama3	✅	✅	✅	cortex run llama3
mistral	✅	✅	✅	cortex run mistral
qwen2	✅			cortex run qwen2:7b-gguf
codestral	✅			cortex run codestral:22b-gguf
command-r	✅			cortex run command-r:35b-gguf
gemma	✅		✅	cortex run gemma
mixtral	✅			cortex run mixtral:7x8b-gguf
openhermes-2.5	✅	✅	✅	cortex run openhermes-2.5
phi3 (medium)	✅		✅	cortex run phi3:medium
phi3 (mini)	✅		✅	cortex run phi3:mini
tinyllama	✅			cortex run tinyllama:1b-gguf

Note: You should have at least 8 GB of RAM available to run the 7B models, 16 GB to run the 14B models, and 32 GB to run the 32B models.

Cortex.cpp CLI Commands

For complete details on CLI commands, please refer to our CLI documentation.

REST API

Cortex.cpp includes a REST API accessible at localhost:3928. For a complete list of endpoints and their usage, visit our API documentation.

Uninstallation

Windows

Navigate to Add or Remove program.
Search for Cortex.cpp.
Click Uninstall.
Delete the Cortex.cpp data folder located in your home folder.

MacOs

Run the uninstaller script:

sudo sh cortex-uninstall.sh

Note: The script requires sudo permission.

Linux

sudo apt remove cortexcpp

Alternate Installation

We also provide Beta and Nightly version.

Version Type	Windows	MacOS		Linux
Beta Build	cortexcpp.exe	Intel	M1/M2/M3/M4	cortexcpp.deb	cortexcpp.AppImage
Nightly Build	cortexcpp.exe	Intel	M1/M2/M3/M4	cortexcpp.deb	cortexcpp.AppImage

Build from Source

Windows

Clone the Cortex.cpp repository here.
Navigate to the engine > vcpkg folder.
Configure the vpkg:

cd vcpkg
./bootstrap-vcpkg.bat
vcpkg install

Build the Cortex.cpp inside the build folder:

mkdir build
cd build
cmake .. -DBUILD_SHARED_LIBS=OFF -DCMAKE_TOOLCHAIN_FILE=path_to_vcpkg_folder/vcpkg/scripts/buildsystems/vcpkg.cmake -DVCPKG_TARGET_TRIPLET=x64-windows-static

Use Visual Studio with the C++ development kit to build the project using the files generated in the build folder.
Verify that Cortex.cpp is installed correctly by getting help information.

# Get the help information
cortex -h

MacOS

Clone the Cortex.cpp repository here.
Navigate to the engine > vcpkg folder.
Configure the vpkg:

cd vcpkg
./bootstrap-vcpkg.sh
vcpkg install

Build the Cortex.cpp inside the build folder:

mkdir build
cd build
cmake .. -DCMAKE_TOOLCHAIN_FILE=path_to_vcpkg_folder/vcpkg/scripts/buildsystems/vcpkg.cmake
make -j4

Use Visual Studio with the C++ development kit to build the project using the files generated in the build folder.
Verify that Cortex.cpp is installed correctly by getting help information.

# Get the help information
cortex -h

Linux

Clone the Cortex.cpp repository here.
Navigate to the engine > vcpkg folder.
Configure the vpkg:

cd vcpkg
./bootstrap-vcpkg.sh
vcpkg install

Build the Cortex.cpp inside the build folder:

mkdir build
cd build
cmake .. -DCMAKE_TOOLCHAIN_FILE=path_to_vcpkg_folder/vcpkg/scripts/buildsystems/vcpkg.cmake
make -j4

Use Visual Studio with the C++ development kit to build the project using the files generated in the build folder.
Verify that Cortex.cpp is installed correctly by getting help information.

# Get the help information
cortex -h

Contact Support

For support, please file a GitHub ticket.
For questions, join our Discord here.
For long-form inquiries, please email hello@jan.ai.

Name		Name	Last commit message	Last commit date
Latest commit History 1,436 Commits
.github		.github
assets		assets
coverage		coverage
docker		docker
engine		engine
platform		platform
.gitignore		.gitignore
.gitmodules		.gitmodules
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cortex.cpp

Overview

Libraries

Quickstart

CLI

API

Built-in Model Library

Cortex.cpp CLI Commands

REST API

Uninstallation

Windows

MacOs

Linux

Alternate Installation

Build from Source

Windows

MacOS

Linux

Contact Support

About

Releases 133

Packages

Contributors 32

Languages

License

janhq/cortex.cpp

Folders and files

Latest commit

History

Repository files navigation

Cortex.cpp

Overview

Libraries

Quickstart

CLI

API

Built-in Model Library

Cortex.cpp CLI Commands

REST API

Uninstallation

Windows

MacOs

Linux

Alternate Installation

Build from Source

Windows

MacOS

Linux

Contact Support

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 133

Packages 0

Contributors 32

Languages

Packages