Skip to content
View Viol2000's full-sized avatar

Highlights

  • Pro

Block or report Viol2000

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 4,610 431 Updated Jun 22, 2024

LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.

Python 563 44 Updated Jan 13, 2024

Official implementation for the paper *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*

Jupyter Notebook 58 2 Updated Aug 24, 2024
Python 31 5 Updated Jun 18, 2024
Python 486 58 Updated Sep 16, 2024

Create LLM agents with long-term memory and custom tools 📚🦙

Python 11,433 1,245 Updated Sep 21, 2024

Scalable and Efficient Serverless Deployment for Large AI Models.

Python 181 16 Updated Sep 22, 2024
Jupyter Notebook 30 2 Updated Jun 13, 2024

Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 7,532 697 Updated Sep 22, 2024

A series of math-specific large language models of our Qwen2 series.

Python 486 39 Updated Sep 18, 2024

[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?

Python 1,784 309 Updated Sep 3, 2024

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Jupyter Notebook 2,525 126 Updated Aug 4, 2024
Python 46 2 Updated Apr 2, 2024

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) on multi-GPU Clusters

Python 498 42 Updated Sep 20, 2024

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 46,146 6,508 Updated Sep 22, 2024

To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 while maintaining accuracy.

Python 701 26 Updated Sep 12, 2024

Fast inference engine for Transformer models

C++ 3,233 286 Updated Sep 20, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,279 1,007 Updated Sep 20, 2024

ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].

Python 938 69 Updated Feb 22, 2024
Python 36 6 Updated Jul 10, 2024

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

Jupyter Notebook 106 13 Updated Aug 6, 2024

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 1,460 94 Updated Jun 1, 2023

Code for Quiet-STaR

Python 518 74 Updated Aug 21, 2024

σ-GPT: A New Approach to Autoregressive Models

Python 53 8 Updated Aug 14, 2024

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 980 53 Updated Jul 14, 2024

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

1,050 22 Updated Jul 31, 2024

DSPy: The framework for programming—not prompting—foundation models

Python 16,950 1,308 Updated Sep 22, 2024

AICI: Prompts as (Wasm) Programs

Rust 1,903 78 Updated Aug 13, 2024

(ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training

Python 181 18 Updated May 26, 2024

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Python 31,169 3,841 Updated Sep 19, 2024
Next