Skip to content
View YuzaChongyi's full-sized avatar

Block or report YuzaChongyi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Tools for merging pretrained large language models.

Python 4,524 397 Updated Sep 16, 2024

Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs

Python 54 4 Updated Aug 7, 2024

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Python 248 5 Updated Aug 29, 2024

Official repository for CoMM Dataset

Python 17 Updated Sep 15, 2024

MINT-1T: A one trillion token multimodal interleaved dataset.

731 20 Updated Jul 31, 2024

Ongoing research training transformer models at scale

Python 10,036 2,260 Updated Sep 21, 2024

Stable Diffusion web UI

Python 139,892 26,501 Updated Sep 9, 2024

Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks

Python 1,042 144 Updated Sep 20, 2024

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 6,892 436 Updated Sep 18, 2024

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 11,976 842 Updated Sep 13, 2024

ChatGPT资料汇总学习,持续更新......

4,050 383 Updated Dec 12, 2023

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 5,015 325 Updated Jun 28, 2024

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 5,887 402 Updated May 29, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 13,406 1,091 Updated Sep 2, 2024

A series of large language models developed by Baichuan Intelligent Technology

Python 4,074 293 Updated Jun 22, 2024

DataComp: In search of the next generation of multimodal datasets

Python 637 54 Updated Jan 2, 2024

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

Shell 50,666 11,337 Updated Sep 18, 2024

Stable diffusion webui based on diffusers.

Python 984 68 Updated Sep 29, 2023

A 13B large language model developed by Baichuan Intelligent Technology

Python 2,980 237 Updated Sep 6, 2023

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列

Python 1,074 93 Updated Jun 13, 2024

✨✨Latest Advances on Multimodal Large Language Models

11,802 760 Updated Sep 19, 2024

百亿参数的中英文双语基座大模型

Python 2,685 211 Updated Jul 28, 2023

Research Trends in LLM-guided Multimodal Learning.

347 16 Updated Oct 17, 2023

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 19,368 2,128 Updated Aug 12, 2024

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,305 2,906 Updated Sep 2, 2024

Painter & SegGPT Series: Vision Foundation Models from BAAI

Python 2,499 168 Updated Oct 31, 2023

Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch

Python 853 81 Updated Feb 29, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 9,674 948 Updated Aug 23, 2024

Foundation Architecture for (M)LLMs

Python 3,003 202 Updated Apr 11, 2024

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 14,700 2,575 Updated Aug 20, 2024
Next