Skip to content
View skykongkong8's full-sized avatar

Organizations

@kucc @nnstreamer @Guerilla-Coders

Block or report skykongkong8

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
61 results for source starred repositories
Clear filter

Main gperftools repository

C++ 8,404 1,500 Updated Oct 7, 2024

Tensor library for machine learning

C++ 10,964 1,008 Updated Oct 6, 2024

LLM inference in C/C++

C++ 65,936 9,472 Updated Oct 7, 2024

An open-source RAG-based tool for chatting with your documents.

Python 13,537 1,016 Updated Oct 6, 2024
Jupyter Notebook 21 21 Updated Oct 7, 2024

The HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for float…

Nim 274 15 Updated Jan 4, 2024

Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

C++ 298 115 Updated Oct 8, 2024

Eigen is a C++ template library for linear algebra: matrices, vectors, numerical solvers, and related algorithms.

C++ 568 127 Updated Oct 18, 2023

OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.

C 6,323 1,486 Updated Oct 7, 2024

TinyChatEngine: On-Device LLM Inference Library

C++ 720 68 Updated Jul 4, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,348 939 Updated Oct 1, 2024

LLM training in simple, raw C/CUDA

Cuda 23,793 2,661 Updated Oct 2, 2024

21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Jupyter Notebook 62,797 32,138 Updated Oct 7, 2024

Universal Manipulation Interface: In-The-Wild Robot Teaching Without In-The-Wild Robots

Python 637 125 Updated Sep 15, 2024

row-major matmul optimization

C++ 588 79 Updated Sep 9, 2023

Accessible large language models via k-bit quantization for PyTorch.

Python 6,140 616 Updated Oct 7, 2024

Low-precision matrix multiplication

C++ 1,774 451 Updated Jan 29, 2024

Encapsulate the frequently used AVX instructions as independent modules to reduce repeated development workload.

C 113 41 Updated Jan 13, 2024

High-efficiency floating-point neural network inference operators for mobile, server, and Web

C 1,834 348 Updated Oct 8, 2024

The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.

C++ 2,816 774 Updated Sep 27, 2024

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

C++ 1,177 486 Updated Oct 8, 2024

tutorial to optimize GEMM performance on android

C++ 51 8 Updated Feb 17, 2016

Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN

Python 2,980 552 Updated May 15, 2024

Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation

Jupyter Notebook 3,813 660 Updated Jun 22, 2024

an extended version of Julien Pommier's sse_mathfun

C 38 7 Updated Aug 16, 2019

A header only library implementing common mathematical functions using SIMD intrinsics

C 90 21 Updated Sep 24, 2024

Automatically exported from code.google.com/p/math-neon

C 38 17 Updated Apr 20, 2015

An open optimized software library project for the ARM® Architecture

C 1,460 407 Updated Dec 9, 2022

Official PyTorch implementation for Make Prompts Adaptable: Bayesian Modeling for Vision-Language Prompt Learning with Data-Dependent Prior [AAAI 2024]

Python 15 1 Updated Apr 30, 2024
Next