Skip to content
View pikepokenew's full-sized avatar
Block or Report

Block or report pikepokenew

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Restore safety in fine-tuned language models through task arithmetic

Python 24 1 Updated Mar 28, 2024

Repository for the Bias Benchmark for QA dataset.

Python 73 16 Updated Jan 8, 2024

Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering

Python 61 4 Updated Jul 6, 2024

Source code of "Reasons to Reject? Aligning Language Models with Judgments"

Python 52 5 Updated Feb 29, 2024

[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".

Shell 58 4 Updated May 28, 2024

The Prism Alignment Project

Jupyter Notebook 30 1 Updated Apr 25, 2024
Python 21 15 Updated Aug 6, 2023

Official Code for the papers: "Controlled Text Generation as Continuous Optimization with Multiple Constraints" and "Gradient-based Constrained Sampling from LMs"

Python 58 4 Updated Mar 21, 2024

Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model

Python 39 2 Updated Jan 14, 2024
Python 25 3 Updated Feb 8, 2024

[ACL 2024] Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"

Python 17 2 Updated Mar 28, 2024

WMDP is a LLM proxy benchmark for hazardous knowledge in bio, cyber, and chemical security. We also release code for RMU, an unlearning method which reduces LLM performance on WMDP while retaining …

Jupyter Notebook 53 13 Updated Apr 27, 2024

最好用的 V2Ray 一键安装脚本 & 管理脚本

Shell 23,614 15,869 Updated Jun 10, 2024

Controlled Text Generation via Language Model Arithmetic

Python 189 12 Updated Jul 3, 2024

Code for the ICLR 2023 paper: Unified Detoxifying and Debiasing in Language Generation via Inference-time Adaptive Optimization

Python 7 1 Updated Nov 3, 2023

U.S. Congressional Tweets Dataset used in the ACL 2018 paper for predicting Moral Foundations of the tweets of U.S. politicians.

7 Updated Feb 16, 2019

Released code for「Stance Detection on Social Media with Background Knowledge」in EMNLP2023.

Python 10 Updated Apr 23, 2024
Python 49 6 Updated Mar 2, 2024

Repository for the LREC 2022 submission on Emotion Word Dynamics in Geolocated Tweet data.

Python 74 20 Updated Aug 16, 2023

MLNLP社区用来帮助缩短参考文献的工具。A tool for simplifying bibtex with official info

Python 421 34 Updated Jun 28, 2024

Tips for paper writing and researches 科技论文写作经验记录和总结

115 14 Updated Nov 4, 2021

MLNLP社区用来帮助大家避免论文投稿小错误的整理仓库。 Paper Writing Tips

3,354 435 Updated May 29, 2022

Everything about note management. All in Zotero.

TypeScript 4,909 176 Updated Jul 11, 2024

MLNLP: Paper Picture Writing Code

TeX 987 111 Updated Nov 5, 2022

A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.

278 12 Updated Oct 4, 2023

Learning from Mistakes via Interactive Study Assistant for Large Language Models

Python 7 Updated Nov 27, 2023

This repository contains the dataset and codes for the task of Morality Frames prediction in political tweets using Relational Learning. This work is published as a paper - "Identifying Morality Fr…

5 Updated Nov 22, 2021
Python 158 18 Updated Jan 31, 2024

Generative Agents: Interactive Simulacra of Human Behavior

15,821 1,994 Updated Jun 3, 2024
Next