Skip to content
View vincentlux's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro
Block or Report

Block or report vincentlux

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 522 24 Updated Jul 15, 2024

Image anomaly detection benchmark in industrial manufacturing

Python 88 10 Updated May 14, 2024

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,566 95 Updated Jul 6, 2024

Official implementations for paper: Zero-shot Image Editing with Reference Imitation

Python 901 67 Updated Jun 15, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,043 39 Updated Jul 14, 2024

A general fine-tuning kit geared toward Stable Diffusion 2.1, Stable Diffusion 3, DeepFloyd, and SDXL.

Python 396 26 Updated Jul 18, 2024

Easily compute clip embeddings and build a clip retrieval system with them

Jupyter Notebook 2,279 205 Updated Apr 15, 2024

[CVPR 2022] Official code for "RegionCLIP: Region-based Language-Image Pretraining"

Python 676 49 Updated Mar 20, 2024

Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024)

97 1 Updated Jul 9, 2024

Image Prompter for Gradio

JavaScript 55 9 Updated Dec 14, 2023

Datasets for industrial surface-inspection

64 11 Updated Mar 29, 2022

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥

Jupyter Notebook 1,484 88 Updated Jul 6, 2024

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,102 277 Updated May 4, 2024

Multimodal Models in Real World

Jupyter Notebook 339 17 Updated Jul 12, 2024

An open source, layer-based web interface for Collage Diffusion - use a familiar Photoshop-like interface and let the AI harmonize the details.

Python 57 4 Updated Sep 11, 2023

Official implementations for paper: Anydoor: zero-shot object-level image customization

Python 3,839 349 Updated Apr 8, 2024

Paper list and datasets for industrial image anomaly/defect detection (updating). 工业异常/瑕疵检测论文及数据集检索库(持续更新)。

1,228 121 Updated Jul 18, 2024

[CVPR24] CoDi: Conditional Diffusion Distillation for Higher-Fidelity and Faster Image Generation

Python 73 1 Updated Mar 2, 2024

【ECCV2024】The official repo of Griffon series

Python 86 5 Updated Jul 4, 2024
Python 8,218 480 Updated Jan 27, 2024

[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥

Python 1,896 197 Updated Jul 15, 2024

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Python 979 78 Updated Jul 2, 2024

A checklist for incorporation so you can get back to building your product, fundraising, etc.

2,518 205 Updated Jan 25, 2024

Official PyTorch implementation code for realizing the technical part of Mixture of All Intelligence (MoAI) to improve performance of numerous zero-shot vision language tasks. (Under Review)

Python 298 25 Updated Mar 28, 2024

Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed

Jupyter Notebook 40 2 Updated Mar 27, 2024

[CVPR 2024] Official implementation of the paper "Visual In-context Learning"

Python 326 11 Updated Apr 8, 2024

Scenic: A Jax Library for Computer Vision Research and Beyond

Python 3,154 421 Updated Jul 18, 2024

MoVA: Adapting Mixture of Vision Experts to Multimodal Context

Python 100 Updated Jun 28, 2024

CoreNet: A library for training deep neural networks

Python 6,764 522 Updated May 28, 2024

Build a chatbot powered by LlamaIndex that augments GPT 3.5 with the contents of the Streamlit docs (or your own data).

Python 171 230 Updated Jul 2, 2024
Next