Skip to content
View spongezz's full-sized avatar
Block or Report

Block or report spongezz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

⚡ Dynamically generated stats for your github readmes

JavaScript 66,688 21,665 Updated Jul 15, 2024

Easy and Efficient Quantization for Transformers

C++ 159 13 Updated Jul 15, 2024

Easy and Efficient Transformer : Scalable Inference Solution For Large NLP model

Python 257 46 Updated Jul 8, 2024

LLM inference in C/C++

C++ 61,669 8,821 Updated Jul 15, 2024

The Triton TensorRT-LLM Backend

Python 595 84 Updated Jul 9, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 7,472 811 Updated Jul 12, 2024

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, s…

Cuda 758 121 Updated Jul 29, 2023

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 3,343 299 Updated Jul 15, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 129,296 25,631 Updated Jul 15, 2024

Several simple examples for popular neural network toolkits calling custom CUDA operators.

Python 1,264 182 Updated Apr 29, 2021

Ongoing research training transformer models at scale

Python 9,391 2,114 Updated Jul 12, 2024

手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube

Jupyter Notebook 1,514 224 Updated Jul 14, 2024

https://hrl.boyuai.com/

Jupyter Notebook 2,146 493 Updated Nov 22, 2022

工作加速方法

Dockerfile 3 Updated Oct 11, 2022

Collaborative Collection of C++ Best Practices. This online resource is part of Jason Turner's collection of C++ Best Practices resources. See README.md for more information.

7,933 872 Updated Jul 11, 2024

《C++ Primer Plus 第6版(中文版)》原书代码、习题答案和个人笔记,仅供学习和交流。

C++ 1,995 449 Updated Oct 7, 2023

c++后台服务器开发面经或八股总结!(有深度有广度,和仅有概念的总结文章不同!)

1,276 201 Updated Jul 11, 2024

OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.

C 6,160 1,468 Updated Jul 15, 2024

Python3 implementation of oddball

Python 35 3 Updated Jun 17, 2024

Code for Deep Anomaly Detection on Attributed Networks (SDM2019)

Python 128 23 Updated Sep 13, 2021

刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.

Markdown 124,319 23,132 Updated Jul 8, 2024

这是一个faster-rcnn的pytorch实现的库,可以利用voc数据集格式的数据进行训练。

Python 1,523 351 Updated Oct 3, 2023

数据挖掘、计算机视觉、自然语言处理、推荐系统竞赛知识、代码、思路

Jupyter Notebook 4,120 1,050 Updated Jul 4, 2024
Python 9 5 Updated Oct 14, 2016