Highlights
- Pro
Stars
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.
Official implementation for the paper *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*
Create LLM agents with long-term memory and custom tools 📚🦙
Scalable and Efficient Serverless Deployment for Large AI Models.
Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
A series of math-specific large language models of our Qwen2 series.
[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) on multi-GPU Clusters
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 while maintaining accuracy.
Fast inference engine for Transformer models
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].
This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.
800,000 step-level correctness labels on LLM solutions to MATH problems
Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
DSPy: The framework for programming—not prompting—foundation models
(ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)