Skip to content
View starsy's full-sized avatar
  • Cisco Systems
  • Shanghai, China

Block or report starsy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Convert PDF to markdown quickly with high accuracy

Python 16,796 953 Updated Sep 7, 2024

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …

Python 4,984 396 Updated Sep 30, 2024

LLM inference in C/C++

C++ 65,788 9,447 Updated Oct 4, 2024

🍦 ChatTTS-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

Python 690 85 Updated Oct 4, 2024

官方推荐的 ChatTTS 资源汇总项目,整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project

1,081 72 Updated Jul 3, 2024

Infinity is a high-throughput, low-latency REST API for serving text-embeddings, reranking models and clip

Python 1,310 97 Updated Oct 4, 2024

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Python 31,864 3,909 Updated Oct 1, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 27,811 4,106 Updated Oct 4, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 13,232 1,061 Updated May 23, 2024

OCR, layout analysis, reading order, line detection in 90+ languages

Python 9,961 649 Updated Oct 3, 2024

A Python library to extract tabular data from PDFs

Python 2,957 466 Updated Aug 19, 2024

Go ahead and axolotl questions

Python 7,663 844 Updated Oct 3, 2024

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 47,093 6,685 Updated Oct 3, 2024

Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vi…

Python 3,683 314 Updated Oct 4, 2024

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,387 184 Updated Jul 16, 2024

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,375 468 Updated Sep 28, 2024

Security and compliance proxy for LLM APIs

JavaScript 44 9 Updated Jul 21, 2023

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 12,650 1,469 Updated Oct 4, 2024

用 Express 和 Vue3 搭建的 ChatGPT 演示网页

Vue 31,333 11,224 Updated Aug 16, 2024

OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistributi…

JavaScript 18,243 4,124 Updated Sep 22, 2024

Container plugin for Slurm Workload Manager

C 281 31 Updated Jul 31, 2024

🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transf…

C++ 23,708 1,811 Updated Oct 4, 2024

🚀 基于大语言模型和 RAG 的知识库问答系统。开箱即用、模型中立、灵活编排,支持快速嵌入到第三方业务系统。

Python 10,511 1,385 Updated Oct 4, 2024

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 12,140 850 Updated Sep 13, 2024

This repo includes ChatGPT prompt curation to use ChatGPT better.

HTML 111,289 15,177 Updated Sep 26, 2024

Minimal keyword extraction with BERT

Python 3,469 344 Updated Jul 16, 2024

structured outputs for llms

Python 7,686 613 Updated Oct 4, 2024

Joplin - the privacy-focused note taking app with sync capabilities for Windows, macOS, Linux, Android and iOS.

TypeScript 45,360 4,932 Updated Oct 1, 2024

Fork of turndown-plugin-gfm for Jopin

JavaScript 12 5 Updated Jun 27, 2021

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 33,493 3,841 Updated Oct 2, 2024
Next