Skip to content
View ruizheng20's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Fudan University

Block or report ruizheng20

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

3,948 214 Updated Oct 1, 2024

Official implementation of ICLR'24 paper, "Curiosity-driven Red Teaming for Large Language Models" (https://openreview.net/pdf?id=4KqkizXgXU)

Jupyter Notebook 58 9 Updated Mar 15, 2024

🙌 OpenHands: Code Less, Make More

Python 32,271 3,697 Updated Oct 1, 2024

LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment

Python 195 13 Updated Apr 29, 2024
Python 2,490 303 Updated May 19, 2024
Python 310 16 Updated Jul 16, 2024

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 1,504 97 Updated Jun 1, 2023

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

6,334 383 Updated Jul 28, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

15,207 1,410 Updated Sep 19, 2024

website

CSS 359 43 Updated Dec 11, 2023

https://hrl.boyuai.com/

Jupyter Notebook 2,369 523 Updated Nov 22, 2022

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,211 715 Updated Aug 5, 2024

Repo for external large-scale work

Python 6,458 724 Updated Apr 27, 2024

Inference code for Llama models

Python 55,766 9,504 Updated Aug 18, 2024

Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"

907 47 Updated Sep 4, 2024

MOSS-RLHF

Python 1,273 98 Updated Mar 3, 2024

🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation

JavaScript 4,074 390 Updated Sep 9, 2024

ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.

Python 1,331 129 Updated May 27, 2024

Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Language Models in Simulated Human Society".

Python 339 18 Updated Jun 18, 2023

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 36,978 3,229 Updated Aug 17, 2024

🎓 无需编写任何代码即可轻松创建漂亮的学术网站 Easily create a beautiful academic résumé or educational website using Hugo and GitHub. No code.

TeX 3,884 6,338 Updated Aug 20, 2024

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

Python 766 60 Updated Jul 1, 2024

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 10,708 11,022 Updated Sep 28, 2024

This website template was created for a 2 part website workshop that I held at USC.

HTML 6 Updated Apr 12, 2023

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,222 1,860 Updated Apr 30, 2024

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,461 471 Updated Jan 8, 2024

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 167,135 44,189 Updated Oct 1, 2024

🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.

TypeScript 31,453 9,222 Updated Sep 30, 2024

A quick guide (especially) for trending instruction finetuning datasets

2,474 159 Updated Nov 28, 2023

Instruction Tuning with GPT-4

HTML 4,174 300 Updated Jun 11, 2023
Next