Skip to content
View KepingYan's full-sized avatar

Block or report KepingYan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Reference models for Intel(R) Gaudi(R) AI Accelerator

Jupyter Notebook 153 79 Updated Sep 16, 2024

Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)

Python 148 186 Updated Sep 30, 2024

A latent text-to-image diffusion model

Jupyter Notebook 67,721 10,108 Updated Jun 18, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 36,558 4,513 Updated Sep 25, 2024

Pretrain, finetune and serve LLMs on Intel platforms with Ray

Python 99 28 Updated Sep 20, 2024

Efficient Retrieval Augmentation and Generation Framework

Python 1,282 116 Updated Sep 12, 2024

LLM inference in C/C++

C++ 65,669 9,422 Updated Sep 30, 2024

Large Language Model Text Generation Inference

Python 8,840 1,036 Updated Sep 30, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 27,639 4,075 Updated Sep 30, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,936 4,057 Updated Sep 30, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 15,990 1,564 Updated Sep 30, 2024

LlamaIndex is a data framework for your LLM applications

Python 35,800 5,065 Updated Sep 30, 2024

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,225 153 Updated Jun 25, 2024

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Python 3,222 320 Updated Sep 29, 2024

Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

Python 1,702 300 Updated Apr 6, 2023

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Python 31,741 3,898 Updated Sep 30, 2024

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

TypeScript 31,440 5,480 Updated Sep 30, 2024

LangChain 的中文入门教程

7,356 595 Updated Aug 11, 2024

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 18,299 1,918 Updated Apr 4, 2024

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Python 15,701 1,852 Updated Jun 27, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,384 4,031 Updated Jul 17, 2024

Model parallel transformers in JAX and Haiku

Python 6,279 892 Updated Jan 21, 2023

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 132,789 26,458 Updated Sep 30, 2024

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

Python 8,948 658 Updated Sep 30, 2024

RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.

Python 308 68 Updated Jul 31, 2024

Visual Studio Code

TypeScript 162,979 28,777 Updated Sep 30, 2024

Intel® AI Reference Models: contains Intel optimizations for running deep learning workloads on Intel® Xeon® Scalable processors and Intel® Data Center GPUs

Python 675 219 Updated Sep 30, 2024

oneAPI Collective Communications Library (oneCCL)

C++ 191 67 Updated Aug 22, 2024

oneCCL Bindings for Pytorch*

C++ 85 23 Updated Sep 10, 2024

A chrome extension that presents your tabs vertically. Problem solved.

JavaScript 465 103 Updated Jan 7, 2023
Next