Block or Report
Block or report bladewaltz1
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (1)
Sort Name ascending (A-Z)
Stars
Language
Sort by: Recently starred
This is a repo with links to everything you'd ever want to learn about data engineering
High-resolution Networks for the Fully Convolutional One-Stage Object Detection (FCOS) algorithm
Unify Efficient Fine-Tuning of 100+ LLMs
[CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale
[AAAI 2024] Official implementation of NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models
🦜🔗 Build context-aware reasoning applications
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Hackable and optimized Transformers building blocks, supporting a composable construction.
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/sp…
[A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Official Pytorch Implementation of SegViT: Semantic Segmentation with Plain Vision Transformers
High-Resolution Image Synthesis with Latent Diffusion Models
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
A length-controllable and non-autoregressive image captioning model.
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.