Skip to content
View yifan123's full-sized avatar

Highlights

  • Pro

Block or report yifan123

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LenslessFace : An End-to-End Optimized Lensless System for Privacy-Preserving Face Verification

Python 14 1 Updated Jul 21, 2024

[NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models

Python 15 1 Updated Aug 3, 2024

Arena-Hard-Auto: An automatic LLM benchmark.

Jupyter Notebook 430 60 Updated Sep 4, 2024

A Native-PyTorch Library for LLM Fine-tuning

Python 4,072 377 Updated Oct 4, 2024

[ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization

Python 49 3 Updated Aug 20, 2024

[ACL'24, Outstanding Paper] Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!

Python 26 Updated Aug 2, 2024

[ACL 2024 Findings] The official repo for "ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models".

Python 18 Updated May 29, 2024

[ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues

41 8 Updated Jul 24, 2024

[CVPR'24] Text-to-3D Generation with Bidirectional Diffusion using both 2D and 3D priors

Python 160 5 Updated Mar 13, 2024

A framework for few-shot evaluation of autoregressive language models.

Python 23 10 Updated Dec 21, 2023

Code for the paper LEGO-Prover: Neural Theorem Proving with Growing Libraries

Python 53 4 Updated Feb 29, 2024

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

6,349 383 Updated Jul 28, 2024

Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" (ICLR 2024)

Jupyter Notebook 324 47 Updated Aug 25, 2024

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Python 1,767 164 Updated May 25, 2024

Aligning LMMs with Factually Augmented RLHF

Python 309 20 Updated Nov 1, 2023

Assignment Solutions for Berkeley CS 285: Deep Reinforcement Learning (Fall 2021)

Jupyter Notebook 9 Updated Jan 4, 2022

Reference implementation for DPO (Direct Preference Optimization)

Python 2,059 167 Updated Aug 11, 2024

Some Conferences' accepted paper lists (including AI, ML, Robotic)

947 73 Updated Aug 14, 2024

An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"

Python 43 7 Updated Jul 1, 2023

[CVPR 2023] ReasonNet: End-to-End Driving with Temporal and Global Reasoning

Python 153 10 Updated Jun 29, 2023

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 1,511 97 Updated Jun 1, 2023
Python 131 7 Updated Sep 10, 2023

Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Language Models in Simulated Human Society".

Python 339 18 Updated Jun 18, 2023

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 11,277 951 Updated Oct 4, 2024

RSS 2023: This repository contains code for the paper Efficient Reinforcement Learning for Autonomous Driving with Parameterized Skills and Priors.

Python 65 8 Updated May 10, 2023

基于ChatGLM-6B + LoRA的Fintune方案

Python 3,731 440 Updated Nov 25, 2023

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 36,585 4,518 Updated Sep 25, 2024
Next