yifan123

Jie Liu yifan123

Ph.D. student @ MMLab, CUHK

35 followers · 33 following

The Chinese University of Hong Kong
jieliu.site

Achievements

Highlights

Stars

OpenImagingLab / LenslessFace

LenslessFace : An End-to-End Optimized Lensless System for Privacy-Preserving Face Verification

Python 14 1 Updated Jul 21, 2024

ZHZisZZ / weak-to-strong-search

[NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models

Python 15 1 Updated Aug 3, 2024

multimodal-art-projection / MAP-NEO

Python 846 81 Updated Jun 21, 2024

lmarena / arena-hard-auto

Arena-Hard-Auto: An automatic LLM benchmark.

Jupyter Notebook 430 60 Updated Sep 4, 2024

pytorch / torchtune

A Native-PyTorch Library for LLM Fine-tuning

Python 4,072 377 Updated Oct 4, 2024

ZHZisZZ / modpo

[ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization

Python 49 3 Updated Aug 20, 2024

ZHZisZZ / emulated-disalignment

[ACL'24, Outstanding Paper] Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!

Python 26 Updated Aug 2, 2024

conceptmath / conceptmath

[ACL 2024 Findings] The official repo for "ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models".

Python 18 Updated May 29, 2024

mtbench101 / mt-bench-101

[ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues

41 8 Updated Jul 24, 2024

BiDiff / bidiff

[CVPR'24] Text-to-3D Generation with Bidirectional Diffusion using both 2D and 3D priors

Python 160 5 Updated Mar 13, 2024

wellecks / lm-evaluation-harness

Forked from EleutherAI/lm-evaluation-harness

A framework for few-shot evaluation of autoregressive language models.

Python 23 10 Updated Dec 21, 2023

wiio12 / LEGO-Prover

Code for the paper LEGO-Prover: Neural Theorem Proving with Growing Libraries

Python 53 4 Updated Feb 29, 2024

WooooDyy / LLM-Agent-Paper-List

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

6,349 383 Updated Jul 28, 2024

TIGER-AI-Lab / MAmmoTH

Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" (ICLR 2024)

Jupyter Notebook 324 47 Updated Aug 25, 2024

AkariAsai / self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Python 1,767 164 Updated May 25, 2024

llava-rlhf / LLaVA-RLHF

Aligning LMMs with Factually Augmented RLHF

Python 309 20 Updated Nov 1, 2023

ZHZisZZ / cs285-homework-fall2021

Assignment Solutions for Berkeley CS 285: Deep Reinforcement Learning (Fall 2021)

Jupyter Notebook 9 Updated Jan 4, 2022

eric-mitchell / direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Python 2,059 167 Updated Aug 11, 2024

Lionelsy / Conference-Accepted-Paper-List

Some Conferences' accepted paper lists (including AI, ML, Robotic)

947 73 Updated Aug 14, 2024

Agora-Lab-AI / Orca

An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"

Python 43 7 Updated Jul 1, 2023

opendilab / DOS

[CVPR 2023] ReasonNet: End-to-End Driving with Temporal and Global Reasoning

Python 153 10 Updated Jun 29, 2023

InternLM / InternLM-techreport

902 24 Updated Jun 7, 2023

openai / prm800k

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 1,511 97 Updated Jun 1, 2023

i-Eval / FairEval

Python 131 7 Updated Sep 10, 2023

agi-templar / Stable-Alignment

Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Language Models in Simulated Human Society".

Python 339 18 Updated Jun 18, 2023

ShishirPatil / gorilla

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 11,277 951 Updated Oct 4, 2024

Letian-Wang / asaprl

RSS 2023: This repository contains code for the paper Efficient Reinforcement Learning for Autonomous Driving with Parameterized Skills and Priors.

Python 65 8 Updated May 10, 2023

osanseviero / ml_timeline

592 35 Updated Jun 19, 2023

mymusise / ChatGLM-Tuning

基于ChatGLM-6B + LoRA的Fintune方案

Python 3,731 440 Updated Nov 25, 2023

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 36,585 4,518 Updated Sep 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jie Liu yifan123

Achievements

Achievements

Highlights

Block or report yifan123

Stars

OpenImagingLab / LenslessFace

ZHZisZZ / weak-to-strong-search

multimodal-art-projection / MAP-NEO

lmarena / arena-hard-auto

pytorch / torchtune

ZHZisZZ / modpo

ZHZisZZ / emulated-disalignment

conceptmath / conceptmath

mtbench101 / mt-bench-101

BiDiff / bidiff

wellecks / lm-evaluation-harness

wiio12 / LEGO-Prover

WooooDyy / LLM-Agent-Paper-List

TIGER-AI-Lab / MAmmoTH

AkariAsai / self-rag

llava-rlhf / LLaVA-RLHF

ZHZisZZ / cs285-homework-fall2021

eric-mitchell / direct-preference-optimization

Lionelsy / Conference-Accepted-Paper-List

Agora-Lab-AI / Orca

opendilab / DOS

InternLM / InternLM-techreport

openai / prm800k

i-Eval / FairEval

agi-templar / Stable-Alignment

ShishirPatil / gorilla

Letian-Wang / asaprl

osanseviero / ml_timeline

mymusise / ChatGLM-Tuning

lm-sys / FastChat