-
-
Event-Bench Public
Official code of *Towards Event-oriented Long Video Understanding*
-
LongVA Public
Forked from EvolvingLMMs-Lab/LongVALong Context Transfer from Language to Vision
Python Apache License 2.0 UpdatedJul 3, 2024 -
VideoLLaMA2 Public
Forked from DAMO-NLP-SG/VideoLLaMA2VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
Python Apache License 2.0 UpdatedJul 2, 2024 -
Richar-Du.github.io Public
Forked from academicpages/academicpages.github.ioGithub Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
JavaScript MIT License UpdatedFeb 19, 2024 -
LAMOC Public
The official repository our ACL 2023 paper: Zero-shot Visual Question Answering with Language Model Feedback
-
ComVint Public
The official GitHub page for ''What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Instruction Tuning''
-
Qwen-VL Public
Forked from QwenLM/Qwen-VLThe official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Python Other UpdatedSep 12, 2023 -
MyLAVIS Public
Forked from salesforce/LAVISLAVIS - A One-stop Library for Language-Vision Intelligence
Jupyter Notebook BSD 3-Clause "New" or "Revised" License UpdatedAug 5, 2023 -
My-FastChat Public
Forked from lm-sys/FastChatAn open platform for training, serving, and evaluating large language models. Release repo for Vicuna and FastChat-T5.
Python Apache License 2.0 UpdatedJul 13, 2023 -
My-MiniGPT4 Public
Forked from Vision-CAIR/MiniGPT-4MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
-
magma Public
Forked from Aleph-Alpha/magmaMAGMA - a GPT-style multimodal model that can understand any combination of images and language. NOTE: The freely available model from this repo is only a demo. For the latest multimodal and multil…
Python MIT License UpdatedJan 25, 2023 -
OFA Public
Forked from OFA-Sys/OFAOfficial repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Python Apache License 2.0 UpdatedDec 25, 2022 -
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedJul 18, 2022 -
Chinese-CLIP Public
Forked from billjie1/Chinese-CLIPChinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Python MIT License UpdatedJul 8, 2022 -
-
-
-
-
-
-
-
This project used the multilayer perceptron to recognize human's activity and is deployed on Wechat mini-program.
-
-
TextBox Public
Forked from RUCAIBox/TextBoxTextBox is an open-source library for building text generation system.
Python MIT License UpdatedMar 22, 2021 -
-
-
-
-
LeetCodeAnimation Public
Forked from MisterBooo/LeetCodeAnimationDemonstrate all the questions on LeetCode in the form of animation.(用动画的形式呈现解LeetCode题目的思路)