kaiyux

Follow

🎯

Focusing

Kaiyu Xie kaiyux

🎯

Focusing

Follow

Engineer @ NVIDIA

145 followers · 46 following

Beijing, China
20:11 (UTC +08:00)

Achievements

BetaSend feedback

Achievements

BetaSend feedback

Organizations

Block or Report

Block or report kaiyux

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

NVIDIA/TensorRT-LLM NVIDIA/TensorRT-LLM Public

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 7.3k 794
triton-inference-server/tensorrtllm_backend triton-inference-server/tensorrtllm_backend Public

The Triton TensorRT-LLM Backend

Python 581 81
CTC-decoder CTC-decoder Public

A cpp reimplementation for CTC decoder

C++ 5
ICDAR2COCO ICDAR2COCO Public

A tool for the conversion from ICDAR to COCO dataset.

Python 8 1