Skip to content
View Jun-CEN's full-sized avatar
Block or Report

Block or report Jun-CEN

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 935 33 Updated Jun 29, 2024

Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding

Python 42 2 Updated Mar 26, 2024

Consistent Prompting for Rehearsal-Free Continual Learning [CVPR2024]

Python 19 Updated Jun 20, 2024

OMG-LLaVA and OMG-Seg codebase

Python 903 44 Updated Jun 28, 2024

[CVPR 2024] Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners

93 4 Updated Feb 28, 2024

Code for RoboFlamingo

Python 257 17 Updated May 8, 2024

Text-to-3D Generation within 5 Minutes

Python 584 39 Updated Mar 10, 2024

[CVPR 2024] A world model for autonomous driving.

Python 237 2 Updated Dec 7, 2023

[RA-L 2023] CMDFusion: Bidirectional Fusion Network with Cross-modality Knowledge Distillation for LIDAR Semantic Segmentation

Python 18 Updated Nov 22, 2023

Generative Models by Stability AI

Python 23,203 2,562 Updated Jun 8, 2024

[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering

Python 170 20 Updated Jan 14, 2024

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python 2,545 258 Updated Jun 2, 2024

mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model

Python 2,012 159 Updated Apr 5, 2024

Searching prompt modules for parameter-efficient transfer learning.

Python 210 11 Updated Dec 8, 2023

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Python 1,584 183 Updated May 20, 2024

(TPAMI 2024) A Survey on Open Vocabulary Learning

724 41 Updated Jun 27, 2024

FunQA benchmarks funny, creative, and magic videos for challenging tasks including timestamp localization, video description, reasoning, and beyond.

Python 94 1 Updated Jan 3, 2024

Fast Segment Anything

Python 7,098 666 Updated Jun 25, 2024

[Arxiv-04-2023] Transformer-Based Visual Segmentation: A Survey

603 44 Updated Apr 6, 2024

Adapting Segment Anything Model for Medical Image Segmentation

Python 887 72 Updated Jun 21, 2024

Segment Anything in Medical Images

Jupyter Notebook 2,419 318 Updated Jun 27, 2024

✨✨Latest Advances on Multimodal Large Language Models

10,402 697 Updated Jul 2, 2024

[NeurIPS'23 Spotlight] Segment Any Point Cloud Sequences by Distilling Vision Foundation Models

Python 522 26 Updated Dec 16, 2023

This repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).

673 45 Updated Jul 3, 2024

Recent LLM-based CV and related works. Welcome to comment/contribute!

785 32 Updated Jun 5, 2024

4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)

Python 75 1 Updated May 17, 2024

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 80,068 21,523 Updated Jul 3, 2024

ImageBind One Embedding Space to Bind Them All

Python 8,054 733 Updated Jun 10, 2024
Next