Skip to content
View pablobots's full-sized avatar

Block or report pablobots

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
76 stars written in Python
Clear filter

Interact with your documents using the power of GPT, 100% privately, no data leaks

Python 53,816 7,230 Updated Sep 26, 2024

The world's simplest facial recognition api for Python and the command line

Python 53,070 13,456 Updated Aug 21, 2024

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Python 50,062 16,177 Updated Oct 5, 2024

A collection of design patterns/idioms in Python

Python 40,298 6,930 Updated Sep 5, 2024

Making large AI models cheaper, faster and more accessible

Python 38,703 4,338 Updated Oct 8, 2024

Comprehensive Python Cheatsheet

Python 36,191 6,462 Updated Oct 7, 2024

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.

Python 35,659 5,903 Updated Jul 26, 2024

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 31,725 4,710 Updated Oct 8, 2024

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Python 25,451 5,269 Updated Oct 8, 2024

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 19,962 2,994 Updated Oct 4, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 19,637 2,161 Updated Aug 12, 2024

DALL·E Mini - Generate images from a text prompt

Python 14,749 1,206 Updated Nov 9, 2023

Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

Python 14,105 1,638 Updated Oct 8, 2024

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python 13,695 2,042 Updated Jul 24, 2024

End-to-End Object Detection with Transformers

Python 13,433 2,426 Updated Mar 12, 2024

🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

Python 12,404 2,067 Updated Jan 23, 2024

A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.

Python 12,162 2,515 Updated Aug 15, 2024

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Python 10,336 1,548 Updated Oct 7, 2024

Geometric Computer Vision Library for Spatial AI

Python 9,861 960 Updated Oct 8, 2024

A python library built to empower developers to build applications and systems with self-contained Computer Vision capabilities

Python 8,585 2,189 Updated Aug 3, 2024

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python 8,215 1,038 Updated Apr 24, 2024

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 7,911 737 Updated Oct 5, 2024
Python 7,658 497 Updated Apr 14, 2024

BoxMOT: pluggable SOTA tracking modules for segmentation, object detection and pose estimation models

Python 6,625 1,700 Updated Oct 8, 2024

High accuracy RAG for answering questions from scientific documents with citations

Python 6,047 568 Updated Oct 7, 2024

LSTM built using Keras Python package to predict time series steps and sequences. Includes sin wave and stock market data

Python 4,795 1,953 Updated Mar 24, 2023

🔎 Super-scale your images and run experiments with Residual Dense and Adversarial Networks.

Python 4,622 762 Updated Mar 12, 2024

To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Python 4,495 251 Updated Aug 22, 2024

Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"

Python 4,437 618 Updated Aug 23, 2024

Google Drive Public File Downloader when Curl/Wget Fails

Python 4,229 348 Updated Aug 12, 2024
Next