LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.

Python 1,448 87 Updated Nov 7, 2023

Ucas-HaoranWei / GOT-OCR2.0

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 4,750 389 Updated Sep 29, 2024

KwaiVGI / LivePortrait

Bring portraits to life!

Python 12,055 1,267 Updated Sep 6, 2024

huangwl18 / ReKep

ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation

Python 408 39 Updated Aug 30, 2024

yipoh / AesBench

An expert benchmark aiming to comprehensively evaluate the aesthetic perception capacities of MLLMs.

Python 205 7 Updated Aug 15, 2024

TMElyralab / MuseTalk

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Python 2,507 306 Updated Sep 23, 2024

lisadunlap / LADS

Official Implementation of LADS (Latent Augmentation using Domain descriptionS)

Python 49 7 Updated Apr 18, 2023

lisadunlap / ALIA

Augmenting with Language-guided Image Augmentation (ALIA)

Python 62 9 Updated Oct 30, 2023

QwenLM / Qwen2-VL

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 2,413 133 Updated Sep 24, 2024

gbstox / agronomy_llm_benchmarking

Python 16 1 Updated Aug 24, 2024

QwenLM / Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 4,880 373 Updated Aug 7, 2024

ShengranHu / ADAS

Automated Design of Agentic Systems

Python 893 128 Updated Aug 20, 2024

ygtxr1997 / CelebBasis

Official Implementation of 'Inserting Anybody in Diffusion Models via Celeb Basis'

Jupyter Notebook 253 7 Updated Oct 11, 2023

yuhangzang / ContextDET

Contextual Object Detection with Multimodal Large Language Models

183 5 Updated May 30, 2023

Yutong-Zhou-cv / AgriBench

[ECCV 2024 Workshop🎈] The first agriculture benchmark to evaluate MM-LLMs.

5 Updated Aug 27, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

11,982 769 Updated Sep 25, 2024

EvolvingLMMs-Lab / lmms-eval

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Python 1,396 115 Updated Oct 1, 2024

ShareGPT4Omni / ShareGPT4V

[ECCV 2024] ShareGPT4V: Improving Large Multi-modal Models with Better Captions

Python 124 4 Updated Jul 1, 2024

SakanaAI / AI-Scientist

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 7,669 1,040 Updated Sep 10, 2024

soraw-ai / Awesome-Text-to-Video-Generation

A list for Text-to-Video, Image-to-Video works

173 8 Updated Aug 19, 2024

AtsuMiyai / Awesome-OOD-VLM

Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey [Miyai+, arXiv2024]

55 2 Updated Aug 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sanctuary Yutong-Zhou-cv

Achievements

Achievements

Highlights

Block or report Yutong-Zhou-cv

Stars

Yutong-Zhou-cv / Awesome-Text-to-Image

zhenyuw16 / GenArtist

zeyuwang-zju / DiffX

KishoreP1 / DetailCLIP

X-PLUG / mPLUG-Owl

DeepPros / DeepDR-LLM

ualsg / global-streetscapes

gohtanii / DiverSeg-dataset

FreedomIntelligence / LongLLaVA

CStanKonrad / long_llama