Stars
Tensors and Dynamic neural networks in Python with strong GPU acceleration
TensorFlow code and pre-trained models for BERT
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Code and documentation to train Stanford's Alpaca models, and generate the data.
A high-throughput and memory-efficient inference and serving engine for LLMs
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
DSPy: The framework for programming—not prompting—foundation models
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Large Language Model Text Generation Inference
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
Google AI 2018 BERT pytorch implementation
Example models using DeepSpeed
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
Tools for merging pretrained large language models.
CodiumAI Cover-Agent: An AI-Powered Tool for Automated Test Generation and Code Coverage Enhancement! 💻🤖🧪🐞
Aligning pretrained language models with instruction data generated by themselves.
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
General technology for enabling AI capabilities w/ LLMs and MLLMs
Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins
LLaMA: Open and Efficient Foundation Language Models
Code for the paper "Evaluating Large Language Models Trained on Code"
Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image …
Alpaca dataset from Stanford, cleaned and curated