Skip to content
View lifrary's full-sized avatar

Highlights

  • Pro

Block or report lifrary

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models l…

Jupyter Notebook 5,288 820 Updated Oct 4, 2024

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 6,449 663 Updated Aug 12, 2024

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 4,460 436 Updated Jul 30, 2024
Python 105 7 Updated Oct 3, 2024

CSGO: Content-Style Composition in Text-to-Image Generation 🔥

Jupyter Notebook 229 5 Updated Sep 5, 2024

SAM with text prompt

Jupyter Notebook 1,572 169 Updated Sep 23, 2024

Official code for "RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control"

Jupyter Notebook 326 27 Updated Sep 7, 2024

More relighting!

Python 4,962 335 Updated Jun 27, 2024
Python 1,448 102 Updated Sep 23, 2024

Evaluating text-to-image/video/3D models with VQAScore

Python 187 17 Updated Sep 9, 2024

Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.

5,825 335 Updated Oct 6, 2024

Bring portraits to life!

Python 12,155 1,278 Updated Oct 7, 2024

Understand Human Behavior to Align True Needs

Python 3,321 291 Updated Jul 20, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 17,864 1,716 Updated Oct 8, 2024

[CVPR 2024] Official implementation of the paper "Visual In-context Learning"

Python 370 17 Updated Apr 8, 2024

[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Python 2,172 130 Updated Aug 29, 2024

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 832 67 Updated Sep 24, 2024

OneFormer: One Transformer to Rule Universal Image Segmentation, arxiv 2022 / CVPR 2023

Jupyter Notebook 1,455 129 Updated Oct 3, 2024

[ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"

Python 642 39 Updated Jan 22, 2024

Official inference repo for FLUX.1 models

Python 14,563 1,049 Updated Oct 8, 2024

[ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation

Python 411 13 Updated Jul 2, 2024

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Python 1,627 77 Updated Aug 5, 2024

Google Research

Jupyter Notebook 34,001 7,855 Updated Oct 7, 2024

A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..

405 16 Updated Sep 14, 2024

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Python 2,278 109 Updated Jul 19, 2024

[ECCV 2024] The official code of paper "Open-Vocabulary SAM".

Python 915 27 Updated Jul 31, 2024

Combining Segment Anything (SAM) with Grounded DINO for zero-shot object detection and CLIPSeg for zero-shot segmentation

Jupyter Notebook 367 24 Updated May 3, 2024

21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Jupyter Notebook 62,794 32,136 Updated Oct 7, 2024

Code for the paper "Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models"

Jupyter Notebook 857 78 Updated Jun 22, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 47,012 5,565 Updated Sep 18, 2024
Next