-
vit-pytorch Public
Forked from lucidrains/vit-pytorchImplementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Python MIT License UpdatedJul 9, 2024 -
-
core-pytorch-utils Public
Forked from serend1p1ty/core-pytorch-utilsYet another PyTorch Trainer and some core components for deep learning.
Python MIT License UpdatedMay 2, 2024 -
Firefly Public
Forked from yangjianxin1/FireflyFirefly: 大模型训练工具,支持训练Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Python UpdatedMar 6, 2024 -
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
Python Apache License 2.0 UpdatedFeb 27, 2024 -
-
geometry-of-truth Public
Forked from saprmarks/geometry-of-truthJupyter Notebook UpdatedFeb 1, 2024 -
mistral-src Public
Forked from mistralai/mistral-inferenceReference implementation of Mistral AI 7B v0.1 model.
Jupyter Notebook Apache License 2.0 UpdatedJan 10, 2024 -
bytepiece Public
Forked from bojone/bytepiece更纯粹、更高压缩率的Tokenizer
Python Apache License 2.0 UpdatedOct 18, 2023 -
llama2.c Public
Forked from karpathy/llama2.cInference Llama 2 in one file of pure C
Python MIT License UpdatedAug 29, 2023 -
numpy-ml Public
Forked from ddbourgin/numpy-mlMachine learning, in numpy
Python GNU General Public License v3.0 UpdatedAug 25, 2023 -
llama Public
Forked from meta-llama/llamaInference code for LLaMA models
Python Other UpdatedAug 20, 2023 -
lit-llama Public
Forked from Lightning-AI/lit-llamaImplementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Python Apache License 2.0 UpdatedAug 8, 2023 -
MOSS-RLHF Public
Forked from OpenLMLab/MOSS-RLHFMOSS-RLHF
Python Apache License 2.0 UpdatedJul 17, 2023 -
Prompt-Engineering-Guide Public
Forked from dair-ai/Prompt-Engineering-Guide🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
Jupyter Notebook MIT License UpdatedMay 21, 2023 -
ColossalAI Public
Forked from hpcaitech/ColossalAIMaking large AI models cheaper, faster and more accessible
Python Apache License 2.0 UpdatedMay 17, 2023 -
transformers_tasks Public
Forked from HarderThenHarder/transformers_tasks⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.
Jupyter Notebook UpdatedApr 27, 2023 -
MetaICL Public
Forked from facebookresearch/MetaICLAn original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi
Python Other UpdatedApr 15, 2023 -
LibMTL Public
Forked from median-research-group/LibMTLA PyTorch Library for Multi-Task Learning
Python MIT License UpdatedMar 14, 2023 -
nanoGPT Public
Forked from yanjingang/nanoGPTThe simplest, fastest repository for training/finetuning medium-sized GPTs.
Python MIT License UpdatedFeb 24, 2023 -
100-gdb-tips Public
Forked from hellogcc/100-gdb-tipsA collection of gdb tips. 100 maybe just mean many here.
Go Other UpdatedNov 18, 2022 -
Channel-LM-Prompting Public
Forked from shmsw25/Channel-LM-PromptingAn original implementation of "Noisy Channel Language Model Prompting for Few-Shot Text Classification"
Python UpdatedApr 23, 2022 -
NLPer-Interview Public
Forked from songyingxin/NLPer-Interview该仓库主要记录 NLP 算法工程师相关的面试题
UpdatedApr 12, 2022 -
ConSERT Public
Forked from yym6472/ConSERTCode for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer
Python UpdatedAug 17, 2021 -
seq2seq-couplet Public
Forked from wb14123/seq2seq-coupletPlay couplet with seq2seq model. 用深度学习对对联。
Python GNU Affero General Public License v3.0 UpdatedJul 11, 2021 -
grip Public
Forked from joeyespo/gripPreview GitHub README.md files locally before committing them.
Python MIT License UpdatedFeb 21, 2021 -
PyTorchGradientCheckpointing Public
Forked from shandilya1998/PyTorchGradientCheckpointingThis repository contains code for gradient checkpoining for Google's BERT and a CNN
Python UpdatedDec 22, 2020 -
sparse_attention Public
Forked from openai/sparse_attentionExamples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"
Python UpdatedAug 12, 2020 -
transformers Public
Forked from chizhu/transformers🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
Python Apache License 2.0 UpdatedMar 9, 2020 -