YoctoHan

Follow

❤️

YoctoHan YoctoHan

❤️

Follow

3 followers · 3 following

Beijing
18:50 (UTC +08:00)

Block or Report

Block or report YoctoHan

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

FasterTransformer FasterTransformer Public

Forked from NVIDIA/FasterTransformer

Transformer related optimization, including BERT, GPT

C++
lmdeploy lmdeploy Public

Forked from InternLM/lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

C++
aix_infer_trt aix_infer_trt Public

Forked from NVIDIA/TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++