oldstree

rabbit oldstree

1 follower · 0 following

vit-pytorch Public
Forked from lucidrains/vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python MIT License Updated Jul 9, 2024
halluattack Public

Updated Jul 6, 2024
core-pytorch-utils Public
Forked from serend1p1ty/core-pytorch-utils

Yet another PyTorch Trainer and some core components for deep learning.

Python MIT License Updated May 2, 2024
Firefly Public
Forked from yangjianxin1/Firefly

Firefly: 大模型训练工具，支持训练Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python Updated Mar 6, 2024
trl Public
Forked from huggingface/trl

Train transformer language models with reinforcement learning.

Python Apache License 2.0 Updated Feb 27, 2024
weak-to-strong Public
Forked from openai/weak-to-strong

Python MIT License Updated Feb 13, 2024
geometry-of-truth Public
Forked from saprmarks/geometry-of-truth

Jupyter Notebook Updated Feb 1, 2024
mistral-src Public
Forked from mistralai/mistral-inference

Reference implementation of Mistral AI 7B v0.1 model.

Jupyter Notebook Apache License 2.0 Updated Jan 10, 2024
bytepiece Public
Forked from bojone/bytepiece

更纯粹、更高压缩率的Tokenizer

Python Apache License 2.0 Updated Oct 18, 2023
llama2.c Public
Forked from karpathy/llama2.c

Inference Llama 2 in one file of pure C

Python MIT License Updated Aug 29, 2023
numpy-ml Public
Forked from ddbourgin/numpy-ml

Machine learning, in numpy

Python GNU General Public License v3.0 Updated Aug 25, 2023
llama Public
Forked from meta-llama/llama

Inference code for LLaMA models

Python Other Updated Aug 20, 2023
lit-llama Public
Forked from Lightning-AI/lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python Apache License 2.0 Updated Aug 8, 2023
MOSS-RLHF Public
Forked from OpenLMLab/MOSS-RLHF

MOSS-RLHF

Python Apache License 2.0 Updated Jul 17, 2023
Prompt-Engineering-Guide Public
Forked from dair-ai/Prompt-Engineering-Guide

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

Jupyter Notebook MIT License Updated May 21, 2023
ColossalAI Public
Forked from hpcaitech/ColossalAI

Making large AI models cheaper, faster and more accessible

Python Apache License 2.0 Updated May 17, 2023
transformers_tasks Public
Forked from HarderThenHarder/transformers_tasks

⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.

Jupyter Notebook Updated Apr 27, 2023
MetaICL Public
Forked from facebookresearch/MetaICL

An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi

Python Other Updated Apr 15, 2023
LibMTL Public
Forked from median-research-group/LibMTL

A PyTorch Library for Multi-Task Learning

Python MIT License Updated Mar 14, 2023
nanoGPT Public
Forked from yanjingang/nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python MIT License Updated Feb 24, 2023
100-gdb-tips Public
Forked from hellogcc/100-gdb-tips

A collection of gdb tips. 100 maybe just mean many here.

Go Other Updated Nov 18, 2022
Channel-LM-Prompting Public
Forked from shmsw25/Channel-LM-Prompting

An original implementation of "Noisy Channel Language Model Prompting for Few-Shot Text Classification"

Python Updated Apr 23, 2022
NLPer-Interview Public
Forked from songyingxin/NLPer-Interview

该仓库主要记录 NLP 算法工程师相关的面试题

Updated Apr 12, 2022
ConSERT Public
Forked from yym6472/ConSERT

Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

Python Updated Aug 17, 2021
seq2seq-couplet Public
Forked from wb14123/seq2seq-couplet

Play couplet with seq2seq model. 用深度学习对对联。

Python GNU Affero General Public License v3.0 Updated Jul 11, 2021
grip Public
Forked from joeyespo/grip

Preview GitHub README.md files locally before committing them.

Python MIT License Updated Feb 21, 2021
PyTorchGradientCheckpointing Public
Forked from shandilya1998/PyTorchGradientCheckpointing

This repository contains code for gradient checkpoining for Google's BERT and a CNN

Python Updated Dec 22, 2020
sparse_attention Public
Forked from openai/sparse_attention

Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"

Python Updated Aug 12, 2020
transformers Public
Forked from chizhu/transformers

🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.

Python Apache License 2.0 Updated Mar 9, 2020
NLP_model Public

some models of NLP

Python Updated Sep 9, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rabbit oldstree

Block or report oldstree

vit-pytorch Public

halluattack Public

core-pytorch-utils Public

Firefly Public

trl Public

weak-to-strong Public

geometry-of-truth Public

mistral-src Public

bytepiece Public

llama2.c Public

numpy-ml Public

llama Public

lit-llama Public

MOSS-RLHF Public

Prompt-Engineering-Guide Public

ColossalAI Public

transformers_tasks Public

MetaICL Public

LibMTL Public

nanoGPT Public

100-gdb-tips Public

Channel-LM-Prompting Public

NLPer-Interview Public

ConSERT Public

seq2seq-couplet Public

grip Public

PyTorchGradientCheckpointing Public

sparse_attention Public

transformers Public

NLP_model Public