-
Apache Software Foundation
- CA
-
04:34
(UTC -07:00) - http://www.linkedin.com/in/henrysaputra
Stars
Chronon is a data platform for serving for AI/ML applications.
🚀 Build and manage real-life ML, AI, and data science projects with ease!
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
QLoRA: Efficient Finetuning of Quantized LLMs
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supportin…
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
Universal LLM Deployment Engine with ML Compilation
Awesome-LLM: a curated list of Large Language Model
Large Language Model Text Generation Inference
Distributed PyTorch implementation of multi-headed graph convolutional neural networks
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
A Generic Low-Code Framework Built on a Config-Driven Tree Walker
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
Development repository for the Triton language and compiler
Code for the paper "Language Models are Unsupervised Multitask Learners"
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-metric spaces.
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
Rust port of dendibakh/perf-ninja - an online course where you can learn and master the skill of low-level performance analysis and tuning.
The serverless framework purpose-built for event streaming applications.
flink-jpmml is a fresh-made library for dynamic real time machine learning predictions built on top of PMML standard models and Apache Flink streaming engine
Apache Pulsar - distributed pub-sub messaging system
A cloud-native vector database, storage for next generation AI applications