Skip to content
View tileb1's full-sized avatar
😀
😀
  • KU Leuven
  • Seattle, USA
  • 19:26 (UTC -07:00)

Highlights

  • Pro

Block or report tileb1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,777 107 Updated Jul 29, 2024

LPIPS metric. pip install lpips

Python 3,616 499 Updated Jul 2, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 9,715 953 Updated Aug 23, 2024

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 2,880 207 Updated Sep 25, 2024

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 1,328 84 Updated Sep 23, 2024

FFCV: Fast Forward Computer Vision (and other ML workloads!)

Python 2,839 178 Updated Jun 16, 2024

The official Meta Llama 3 GitHub site

Python 26,417 2,987 Updated Aug 12, 2024

Chronos: Pretrained (Language) Models for Probabilistic Time Series Forecasting

Python 2,385 273 Updated Aug 6, 2024

Benchmark for Multi-domain Evaluation of Semantic Segmentation

Python 39 4 Updated Aug 25, 2024

Create your own long term database of Immoweb photos and metadata based on criterias

JavaScript 27 5 Updated May 18, 2023

Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paper

Python 725 71 Updated Jan 11, 2023

This is the official code release for our work, Denoising Vision Transformers.

Python 286 8 Updated Jul 22, 2024

(NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection

Python 105 7 Updated Apr 26, 2024

Official implementation of 'CLIP-DINOiser: Teaching CLIP a few DINO tricks' paper.

Jupyter Notebook 171 7 Updated Jul 18, 2024

An open-source framework for training large multimodal models.

Python 3,682 277 Updated Aug 31, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 46,945 5,556 Updated Sep 18, 2024

Most popular metrics used to evaluate object detection algorithms.

Python 4,935 1,027 Updated Aug 30, 2024

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 747 37 Updated Jun 2, 2024

[NeurIPS 2021] You Only Look at One Sequence

Jupyter Notebook 834 118 Updated May 4, 2022

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 10,712 11,029 Updated Oct 1, 2024
Python 26 1 Updated Mar 1, 2023

(TPAMI 2024) A Survey on Open Vocabulary Learning

801 44 Updated Aug 24, 2024

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 17 5 Updated Oct 2, 2024

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 30,084 2,753 Updated Oct 2, 2024
Python 768 77 Updated Jan 27, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 19,569 2,155 Updated Aug 12, 2024
Python 97 2 Updated Jun 11, 2024

🍦 Never use print() to debug again.

Python 8,944 184 Updated Jul 12, 2024

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 2,244 147 Updated Aug 23, 2024

Codebase for "Decoding language spatial relations to 2D spatial arrangements" (Findings of EMNLP 2020).

Python 10 2 Updated Feb 10, 2023
Next