LocalAI model gallery

The model gallery is a curated collection of models created by the community and tested with LocalAI.

We encourage contributions to the gallery! However, please note that if you are submitting a pull request (PR), we cannot accept PRs that include URLs to models based on LLaMA or models with licenses that do not allow redistribution. Nevertheless, you can submit a PR with the configuration file without including the downloadable URL.

Helper tool

To load a model from main onto localhost

bash ./load.sh wizard

More

For how to use the files in this repository, see the Documentation

Draft spec of the model definition yaml file

Main Configuration

name: Name of the model
parameters: Prediction parameters
- top_p: Top P value
- top_k: Top K value
- maxtokens: Maximum tokens
- temperature: Temperature
- model: Model file
f16: Use F16 format (true/false)
threads: Number of threads
debug: Debug mode (true/false)
roles: Map of roles
embeddings: Use embeddings (true/false)
backend: Backend name

Template Configuration

template
- chat: Chat template
- chat_message: Chat message template
- completion: Completion template
- edit: Edit template
- function: Function template

Function Configuration

function
- disable_no_action: Disable no action (true/false)
- no_action_function_name: No action function name
- no_action_description_name: No action description name

Feature Flags

feature_flags: Map of feature flags

LLM Configuration

llm
- system_prompt: System prompt
- tensor_split: Tensor split
- main_gpu: Main GPU
- rms_norm_eps: RMS Norm Epsilon
- ngqa: NGQA
- prompt_cache_path: Prompt cache path
- prompt_cache_all: Prompt cache all (true/false)
- prompt_cache_ro: Prompt cache read-only (true/false)
- mirostat_eta: Mirostat ETA
- mirostat_tau: Mirostat TAU
- mirostat: Mirostat
- gpu_layers: GPU layers
- mmap: Use MMAP (true/false)
- mmlock: Use MMLock (true/false)
- low_vram: Low VRAM mode (true/false)
- grammar: Grammar
- stopwords: List of stopwords
- cutstrings: List of cutstrings
- trimspace: List of trimspace
- context_size: Context size
- numa: Use NUMA (true/false)
- lora_adapter: Lora adapter
- lora_base: Lora base
- no_mulmatq: No MulMatQ (true/false)
- draft_model: Draft model
- n_draft: N Draft
- quantization: Quantization

AutoGPTQ Configuration

autogptq
- model_base_name: Model base name
- device: Device
- triton: Use Triton (true/false)
- use_fast_tokenizer: Use fast tokenizer (true/false)

Diffusers Configuration

diffusers
- pipeline_type: Pipeline type
- scheduler_type: Scheduler type
- cuda: Use CUDA (true/false)
- enable_parameters: Enable parameters
- cfg_scale: CFG Scale
- img2img: Image to Image Diffuser (true/false)
- clip_skip: Clip skip
- clip_model: Clip model
- clip_subfolder: Clip subfolder

GRPC Configuration

grpc
- attempts: Attempts
- attempts_sleep_time: Attempts sleep time

Vall-E Configuration

vall-e
- audio_path: Audio path

Name		Name	Last commit message	Last commit date
Latest commit History 161 Commits
.github/workflows		.github/workflows
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
airoboros.yaml		airoboros.yaml
baichuan-7b.yaml		baichuan-7b.yaml
base.yaml		base.yaml
bert-embeddings.yaml		bert-embeddings.yaml
codellama-7b-instruct.yaml		codellama-7b-instruct.yaml
go.mod		go.mod
go.sum		go.sum
gorilla.yaml		gorilla.yaml
gpt4all-j-groovy.yaml		gpt4all-j-groovy.yaml
gpt4all-j.yaml		gpt4all-j.yaml
gpt4all-l13b-snoozy.yaml		gpt4all-l13b-snoozy.yaml
guanaco.yaml		guanaco.yaml
hippogriff.yaml		hippogriff.yaml
huggingface.yaml		huggingface.yaml
hypermantis.yaml		hypermantis.yaml
index.yaml		index.yaml
koala.yaml		koala.yaml
llama2-7b-chat-gguf.yaml		llama2-7b-chat-gguf.yaml
llama2-chat.yaml		llama2-chat.yaml
load.sh		load.sh
lunademo.yaml		lunademo.yaml
main.go		main.go
main_test.go		main_test.go
manticore-chat-pyg-guanaco.yaml		manticore-chat-pyg-guanaco.yaml
manticore.yaml		manticore.yaml
mistral.yaml		mistral.yaml
mpt-7b-base.yaml		mpt-7b-base.yaml
mpt-7b-chat.yaml		mpt-7b-chat.yaml
mpt-7b-instruct.yaml		mpt-7b-instruct.yaml
openllama-3b-gguf.yaml		openllama-3b-gguf.yaml
openllama-7b-open-instruct.yaml		openllama-7b-open-instruct.yaml
openllama_3b.yaml		openllama_3b.yaml
openllama_7b.yaml		openllama_7b.yaml
renovate.json		renovate.json
rwkv-20b.yaml		rwkv-20b.yaml
rwkv-raven-1b.yaml		rwkv-raven-1b.yaml
rwkv-raven-7b.yaml		rwkv-raven-7b.yaml
rwkv-world.yaml		rwkv-world.yaml
samantha.yaml		samantha.yaml
stablediffusion.yaml		stablediffusion.yaml
vicuna.yaml		vicuna.yaml
virtual.yaml		virtual.yaml
whisper-base.yaml		whisper-base.yaml
wizard.yaml		wizard.yaml
wizardcode-15b.yaml		wizardcode-15b.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LocalAI model gallery

Helper tool

More

Draft spec of the model definition yaml file

Main Configuration

Template Configuration

Function Configuration

Feature Flags

LLM Configuration

AutoGPTQ Configuration

Diffusers Configuration

GRPC Configuration

Vall-E Configuration

About

Releases

Packages

Languages

License

psyrtsov/local-ai-models

Folders and files

Latest commit

History

Repository files navigation

LocalAI model gallery

Helper tool

More

Draft spec of the model definition yaml file

Main Configuration

Template Configuration

Function Configuration

Feature Flags

LLM Configuration

AutoGPTQ Configuration

Diffusers Configuration

GRPC Configuration

Vall-E Configuration

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages