Skip to content
View javey-q's full-sized avatar
Block or Report

Block or report javey-q

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

The first open source triton inference engine for Stable Diffusion, specifically for sdxl

Python 11 1 Updated Nov 27, 2023

Deploy stable diffusion model with onnx/tenorrt + tritonserver

Jupyter Notebook 118 20 Updated Aug 15, 2023

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

Python 690 32 Updated Jun 27, 2024

Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.

Python 1,047 59 Updated May 9, 2024

Material for cuda-mode lectures

Jupyter Notebook 1,728 165 Updated Jun 13, 2024

OneDiff: An out-of-the-box acceleration library for diffusion models.

Python 1,441 85 Updated Jul 10, 2024

LLM101n: Let's build a Storyteller

15,301 731 Updated Jun 28, 2024

Sourcetrail - free and open-source interactive source explorer

C++ 14,309 1,334 Updated Dec 13, 2021

MindSpore online courses: Step into LLM

Jupyter Notebook 381 82 Updated Jun 14, 2024

Cloud mask with Landsat 8 and Sentinel 2.

Python 7 1 Updated May 29, 2022

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C++ 7,647 407 Updated Jul 1, 2024

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 2,074 178 Updated Jul 9, 2024

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 1,760 164 Updated Jul 10, 2024
Python 539 48 Updated Jun 19, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 7,432 802 Updated Jul 10, 2024

Accepted by New Trends in Image Restoration and Enhancement workshop (NTIRE), in conjunction with CVPR 2024.

Jupyter Notebook 100 10 Updated Jul 8, 2024

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 7,368 876 Updated Jul 10, 2024

Mamba SSM architecture

Python 11,603 950 Updated Jul 3, 2024

Official Code for Stable Cascade

Jupyter Notebook 6,445 519 Updated Mar 12, 2024

This repo holds the competitions (information, solutions, summaries, memories) that our team has participated in

22 Updated Feb 4, 2024

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Python 2,728 206 Updated Jul 10, 2024

[TGRS 2024] DiffCR: A Fast Conditional Diffusion Framework for Cloud Removal from Optical Satellite Images

Python 13 5 Updated Mar 6, 2024
Python 3 1 Updated May 7, 2024

Generative Models by Stability AI

Python 23,284 2,570 Updated Jul 9, 2024

LLM training in simple, raw C/CUDA

Cuda 21,521 2,335 Updated Jul 10, 2024

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,015 565 Updated Jul 9, 2024

展示boss直聘岗位的发布时间

JavaScript 785 27 Updated Jun 6, 2024
Next