LanguidoMensdelson

LanguidoMensdelson

Stars

NSTiwari / Llama3-on-Mobile

This repository is an implementation of quantizing and converting the Llama3-8B-Instruct model weights and deploying it on Android for on-device inference.

Makefile 52 4 Updated May 25, 2024

bernardo-bruning / ollama-copilot

Proxy that allows you to use ollama as a copilot like Github copilot

Go 305 22 Updated Sep 5, 2024

lmg-anon / mikupad

LLM Frontend in a single html file

HTML 234 26 Updated Oct 6, 2024

NVIDIA / ChatRTX

A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM

TypeScript 2,692 319 Updated Aug 21, 2024

mlc-ai / mlc-llm

Universal LLM Deployment Engine with ML Compilation

Python 18,823 1,533 Updated Oct 5, 2024

LostRuins / koboldcpp

Forked from ggerganov/llama.cpp

Run GGUF models easily with a KoboldAI UI. One File. Zero Install.

C++ 4,998 350 Updated Oct 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly