Block or Report
Block or report Jun-CEN
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding
Consistent Prompting for Rehearsal-Free Continual Learning [CVPR2024]
[CVPR 2024] Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners
[CVPR 2024] A world model for autonomous driving.
[RA-L 2023] CMDFusion: Bidirectional Fusion Network with Cross-modality Knowledge Distillation for LIDAR Semantic Segmentation
Generative Models by Stability AI
[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model
Searching prompt modules for parameter-efficient transfer learning.
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
(TPAMI 2024) A Survey on Open Vocabulary Learning
FunQA benchmarks funny, creative, and magic videos for challenging tasks including timestamp localization, video description, reasoning, and beyond.
[Arxiv-04-2023] Transformer-Based Visual Segmentation: A Survey
Adapting Segment Anything Model for Medical Image Segmentation
Segment Anything in Medical Images
✨✨Latest Advances on Multimodal Large Language Models
[NeurIPS'23 Spotlight] Segment Any Point Cloud Sequences by Distilling Vision Foundation Models
This repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).
Recent LLM-based CV and related works. Welcome to comment/contribute!
4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)
Tensors and Dynamic neural networks in Python with strong GPU acceleration
ImageBind One Embedding Space to Bind Them All