Skip to content
View hsaputra's full-sized avatar

Organizations

@krylov-ml @JanusGraph

Block or report hsaputra

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Chronon is a data platform for serving for AI/ML applications.

Scala 709 43 Updated Sep 20, 2024

Mamba SSM architecture

Python 12,602 1,059 Updated Aug 15, 2024

🚀 Build and manage real-life ML, AI, and data science projects with ease!

Python 8,060 753 Updated Sep 20, 2024

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

Python 2,040 143 Updated Aug 1, 2024

Probabilistic Machine Learning: Advanced Topics

1,389 119 Updated Jun 27, 2024

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 9,915 817 Updated Jun 10, 2024

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,341 463 Updated Aug 19, 2024

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supportin…

Jupyter Notebook 11,624 1,651 Updated Sep 11, 2024

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

9,299 706 Updated May 31, 2024

Universal LLM Deployment Engine with ML Compilation

Python 18,675 1,514 Updated Sep 19, 2024

Awesome-LLM: a curated list of Large Language Model

17,475 1,416 Updated Sep 19, 2024

Large Language Model Text Generation Inference

Python 8,788 1,024 Updated Sep 20, 2024

Distributed PyTorch implementation of multi-headed graph convolutional neural networks

Python 60 24 Updated Sep 20, 2024

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 4,604 430 Updated Jun 22, 2024

A Generic Low-Code Framework Built on a Config-Driven Tree Walker

C# 280 47 Updated Jan 16, 2024

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

7,344 374 Updated Jul 16, 2023

Development repository for the Triton language and compiler

C++ 12,772 1,540 Updated Sep 20, 2024

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 22,265 5,487 Updated Aug 14, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,801 4,052 Updated Sep 18, 2024

Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk

C++ 13,117 1,158 Updated Jul 29, 2024

Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-metric spaces.

C++ 3,378 448 Updated Sep 17, 2024

Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

Python 3,166 246 Updated Jan 20, 2024

AI + Data, online. https://vespa.ai

Java 5,631 589 Updated Sep 20, 2024

Rust port of dendibakh/perf-ninja - an online course where you can learn and master the skill of low-level performance analysis and tuning.

Rust 188 11 Updated Aug 24, 2024

The serverless framework purpose-built for event streaming applications.

Go 210 27 Updated Sep 18, 2024

Parallel computing with task scheduling

Python 12,420 1,698 Updated Sep 20, 2024

flink-jpmml is a fresh-made library for dynamic real time machine learning predictions built on top of PMML standard models and Apache Flink streaming engine

Scala 96 30 Updated May 9, 2019

Apache Pulsar - distributed pub-sub messaging system

Java 14,132 3,566 Updated Sep 20, 2024

A cloud-native vector database, storage for next generation AI applications

Go 29,520 2,827 Updated Sep 20, 2024

Apache DataFusion SQL Query Engine

Rust 5,930 1,123 Updated Sep 19, 2024
Next