Skip to content
View javey-q's full-sized avatar
Block or Report

Block or report javey-q

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

This is a framework to evaluate your stable diffusion model

Python 3 Updated Jul 18, 2024

Kolors Team

Python 2,757 158 Updated Jul 26, 2024

TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization and sparsity. It compresses deep learning models for downstream deployment frame…

Python 345 20 Updated Jul 26, 2024

optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052

C++ 442 33 Updated Mar 15, 2024

Faster generation with text-to-image diffusion models.

Python 173 9 Updated May 16, 2024

stable diffusion, controlnet, tensorrt, accelerate

Python 50 8 Updated Apr 28, 2023

TensorRT Extension for Stable Diffusion Web UI

Python 1,858 141 Updated Jun 14, 2024
Python 5 7 Updated Nov 25, 2023

The first open source triton inference engine for Stable Diffusion, specifically for sdxl

Python 11 1 Updated Nov 27, 2023

Deploy stable diffusion model with onnx/tenorrt + tritonserver

Jupyter Notebook 118 21 Updated Aug 15, 2023

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

Python 712 32 Updated Jun 27, 2024

Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.

Python 1,069 60 Updated Jul 16, 2024

Material for cuda-mode lectures

Jupyter Notebook 1,964 194 Updated Jun 13, 2024

OneDiff: An out-of-the-box acceleration library for diffusion models.

Python 1,488 89 Updated Jul 27, 2024

LLM101n: Let's build a Storyteller

25,812 1,372 Updated Jul 21, 2024

Sourcetrail - free and open-source interactive source explorer

C++ 14,431 1,340 Updated Dec 13, 2021

MindSpore online courses: Step into LLM

Jupyter Notebook 387 82 Updated Jun 14, 2024

Cloud mask with Landsat 8 and Sentinel 2.

Python 7 1 Updated May 29, 2022

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C++ 7,697 395 Updated Jul 15, 2024

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 2,125 182 Updated Jul 25, 2024

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 1,785 168 Updated Jul 25, 2024
Python 549 50 Updated Jun 19, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 7,696 836 Updated Jul 28, 2024

Accepted by New Trends in Image Restoration and Enhancement workshop (NTIRE), in conjunction with CVPR 2024.

Jupyter Notebook 116 10 Updated Jul 15, 2024

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 7,436 887 Updated Jul 25, 2024

Mamba SSM architecture

Python 11,935 995 Updated Jul 24, 2024

Official Code for Stable Cascade

Jupyter Notebook 6,461 522 Updated Jul 25, 2024
Next