Skip to content
Change the repository type filter

All

    Repositories list

    • LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models
      Python
      4000Updated Sep 11, 2024Sep 11, 2024
    • mamba-2

      Public
      Mamba SSM architecture
      Python
      Apache License 2.0
      1.1k000Updated Jul 29, 2024Jul 29, 2024
    • MambaByte

      Public
      Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta
      Python
      6000Updated Jul 18, 2024Jul 18, 2024
    • Capibara SSM is a new open-source fundation model
      GNU General Public License v3.0
      0070Updated Jun 13, 2024Jun 13, 2024
    • cookbook

      Public
      A collection of guides and examples for the Gemini API.
      Jupyter Notebook
      Apache License 2.0
      702000Updated Jun 13, 2024Jun 13, 2024
    • xlstm

      Public
      Official repository of the xLSTM.
      Python
      GNU Affero General Public License v3.0
      92000Updated Jun 10, 2024Jun 10, 2024
    • Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)
      Jsonnet
      Apache License 2.0
      59000Updated Jun 6, 2024Jun 6, 2024
    • Adala

      Public
      Adala: Autonomous DAta (Labeling) Agent framework
      Python
      Apache License 2.0
      74000Updated May 31, 2024May 31, 2024
    • IRSRMamba

      Public
      Official PyTorch implementation of the paper IRSRMamba: Infrared Image Super-Resolution via Mamba-based Wavelet Transform Feature Modulation Model.
      Python
      Apache License 2.0
      5000Updated May 17, 2024May 17, 2024
    • BitNet

      Public
      Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch
      Python
      MIT License
      143000Updated May 16, 2024May 16, 2024
    • High-quality datasets, tools, and concepts for LLM fine-tuning.
      166000Updated Apr 28, 2024Apr 28, 2024
    • jax

      Public
      Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
      Python
      Apache License 2.0
      2.8k000Updated Apr 28, 2024Apr 28, 2024
    • maxtext

      Public
      A simple, performant and scalable Jax LLM!
      Python
      Apache License 2.0
      278000Updated Apr 28, 2024Apr 28, 2024
    • paxml

      Public
      Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry leading model flop utilization rates.
      Python
      Apache License 2.0
      68000Updated Apr 26, 2024Apr 26, 2024
    • synthea

      Public
      Synthetic Patient Population Simulator
      Java
      Apache License 2.0
      644000Updated Apr 24, 2024Apr 24, 2024
    • tpu

      Public
      Reference models and tools for Cloud TPUs.
      Jupyter Notebook
      Apache License 2.0
      1.8k000Updated Apr 19, 2024Apr 19, 2024
    • megalodon

      Public
      Reference implementation of Megalodon 7B model
      Cuda
      MIT License
      52000Updated Apr 18, 2024Apr 18, 2024
    • Profile README
      MIT License
      2000Updated Apr 12, 2024Apr 12, 2024
    • The official Python library for the OpenAI API
      Python
      Apache License 2.0
      3.1k000Updated Apr 12, 2024Apr 12, 2024
    • FastChat

      Public
      The release repo for "Vicuna: An Open Chatbot Impressing GPT-4"
      Python
      Apache License 2.0
      4.5k000Updated Apr 12, 2024Apr 12, 2024
    • Qwen1.5

      Public
      Qwen1.5 is the improved version of Qwen, the large language model series developed by Qwen team, Alibaba Cloud.
      Shell
      539000Updated Apr 12, 2024Apr 12, 2024
    • Source to source compiler for PyTorch. It makes PyTorch programs faster on single accelerators and distributed.
      Python
      Apache License 2.0
      77000Updated Apr 12, 2024Apr 12, 2024
    • litgpt

      Public
      Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
      Python
      Apache License 2.0
      1k000Updated Apr 12, 2024Apr 12, 2024
    • Label Studio is a multi-type data labeling and annotation tool with standardized output format
      JavaScript
      Apache License 2.0
      2.3k000Updated Apr 12, 2024Apr 12, 2024
    • Various projects using Large Language Model (GPT & LLAMA) other open source model from HuggingFace and OpenAI. OpenAI API required for running various model
      Jupyter Notebook
      The Unlicense
      25000Updated Apr 12, 2024Apr 12, 2024
    • Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
      Python
      185000Updated Apr 12, 2024Apr 12, 2024
    • mamba2

      Public
      Python
      Apache License 2.0
      1.1k000Updated Apr 12, 2024Apr 12, 2024
    • A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest AI framework ever.
      Python
      MIT License
      23000Updated Apr 12, 2024Apr 12, 2024
    • mamba

      Public
      Python
      Apache License 2.0
      1.1k000Updated Apr 12, 2024Apr 12, 2024
    • GitHub Action to set up micromamba
      TypeScript
      MIT License
      15000Updated Apr 12, 2024Apr 12, 2024