Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 1,419 103 Updated Sep 20, 2024

ludwig-ai / ludwig

Low-code framework for building custom LLMs, neural networks, and other AI models

Python 11,095 1,188 Updated Sep 11, 2024

WangRongsheng / Aurora

🐳 Aurora is a [Chinese Version] MoE model. Aurora is a further work based on Mixtral-8x7B, which activates the chat capability of the model's Chinese open domain.

Python 257 21 Updated May 9, 2024

predibase / llm_distillation_playbook

Best practices for distilling large language models.

Jupyter Notebook 373 29 Updated Feb 1, 2024

infiniflow / infinity

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text

C++ 2,438 266 Updated Sep 21, 2024

karpathy / ng-video-lecture

Python 3,434 891 Updated Jan 31, 2024

nlpxucan / WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,202 713 Updated Aug 5, 2024

activeloopai / deeplake

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activelo…

Python 8,053 615 Updated Sep 21, 2024

modelscope / data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据！

Python 2,565 162 Updated Sep 20, 2024

Mintplex-Labs / anything-llm

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.

JavaScript 22,912 2,327 Updated Sep 21, 2024

yizhongw / self-instruct

Aligning pretrained language models with instruction data generated by themselves.

Python 4,071 482 Updated Mar 27, 2023

argilla-io / argilla

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

Python 3,808 357 Updated Sep 20, 2024

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 11,032 661 Updated Sep 19, 2024

itsnamgyu / reasoning-teacher

Official code for "Large Language Models Are Reasoning Teachers", ACL 2023

Jupyter Notebook 304 20 Updated Oct 6, 2023

Shark-NLP / self-adaptive-ICL

self-adaptive in-context learning

Python 42 5 Updated May 5, 2023

weaviate / Verba

Retrieval Augmented Generation (RAG) chatbot powered by Weaviate

TypeScript 6,044 642 Updated Sep 17, 2024

RCGAI / SimplyRetrieve

Lightweight chat AI platform featuring custom knowledge, open-source LLMs, prompt-engineering, retrieval analysis. Highly customizable. For Retrieval-Centric & Retrieval-Augmented Generation.

Python 197 13 Updated Feb 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gingersna

Block or report Gingersna

Stars

qhjqhj00 / MemoRAG

unslothai / unsloth

enoch3712 / ExtractThinker

jina-ai / reader

HMUNACHI / cuda-repo

hamishivi / EasyLM

RLHFlow / RLHF-Reward-Modeling

lm-sys / arena-hard-auto

hmarkc / parallel-prompt-decoding

scutan90 / DeepLearning-500-questions

TIGER-AI-Lab / MAmmoTH2

Spico197 / Humback

seanzhang-zhichen / llama3-chinese

argilla-io / distilabel