wuji3

🎯

Focusing

wuji3 wuji3

🎯

Focusing

36 followers · 6 following

shanghai

Achievements

Block or Report

Block or report wuji3

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

LLM

25 repositories

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 9,988 634 Updated May 2, 2024

THUDM / ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 40,171 5,171 Updated Jun 27, 2024

HqWu-HITCS / Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

13,915 1,281 Updated Jul 21, 2024

meta-llama / llama

Inference code for Llama models

Python 54,856 9,400 Updated Jul 25, 2024

adapter-hub / adapters

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Jupyter Notebook 2,487 332 Updated Aug 4, 2024

hyunwoongko / transformer

Transformer: PyTorch Implementation of "Attention Is All You Need"

Python 2,580 396 Updated Apr 17, 2024

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 35,352 5,472 Updated Aug 2, 2024

mlabonne / llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 35,087 3,688 Updated Jul 28, 2024

XiangLi1999 / PrefixTuning

Prefix-Tuning: Optimizing Continuous Prompts for Generation

Python 873 158 Updated Apr 26, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 25,180 2,777 Updated Jul 31, 2024

meta-llama / llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supportin…

Jupyter Notebook 11,125 1,572 Updated Aug 2, 2024

THUDM / P-tuning-v2

An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks

Python 1,948 196 Updated Nov 16, 2023

microsoft / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,123 3,994 Updated Aug 4, 2024

NielsRogge / Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 8,727 1,361 Updated Jul 23, 2024

hiyouga / LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 28,366 3,476 Updated Aug 2, 2024

apple / corenet

CoreNet: A library for training deep neural networks

Python 6,858 528 Updated May 28, 2024

google / sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 9,909 1,148 Updated Aug 1, 2024

LargeWorldModel / LWM

Python 7,040 545 Updated Jul 25, 2024

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 29,945 6,344 Updated Jul 26, 2024

lucidrains / PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,659 672 Updated Jan 14, 2024

codertimo / BERT-pytorch

Google AI 2018 BERT pytorch implementation

Python 6,116 1,290 Updated Sep 15, 2023

dhlee347 / pytorchic-bert

Pytorch Implementation of Google BERT

Python 587 181 Updated Mar 29, 2020

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 12,893 1,041 Updated Jul 30, 2024

OpenBMB / MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Python 8,163 576 Updated Aug 3, 2024

google-research / text-to-text-transfer-transformer

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

Python 6,054 750 Updated Jun 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

wuji3 wuji3

Achievements

Achievements

Block or report wuji3

LLM

microsoft / LoRA

THUDM / ChatGLM-6B

HqWu-HITCS / Awesome-Chinese-LLM

meta-llama / llama

adapter-hub / adapters

hyunwoongko / transformer

karpathy / nanoGPT

mlabonne / llm-course

XiangLi1999 / PrefixTuning

meta-llama / llama3

meta-llama / llama-recipes

THUDM / P-tuning-v2

microsoft / DeepSpeed

NielsRogge / Transformers-Tutorials

hiyouga / LLaMA-Factory

apple / corenet

google / sentencepiece

LargeWorldModel / LWM

facebookresearch / fairseq

lucidrains / PaLM-rlhf-pytorch

codertimo / BERT-pytorch

dhlee347 / pytorchic-bert

QwenLM / Qwen

OpenBMB / MiniCPM-V

google-research / text-to-text-transfer-transformer