Skip to content
View HamidSuleman1's full-sized avatar

Block or report HamidSuleman1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)

Python 698 45 Updated Jul 29, 2024

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

JavaScript 1,914 280 Updated Sep 10, 2024

Jekyll website template for personal academic or research group web pages.

SCSS 223 298 Updated Sep 15, 2024

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

JavaScript 11,955 42,680 Updated Oct 3, 2024

Simple project webpage template. Originally used in Colorful Image Colorization. ECCV, 2016.

HTML 455 156 Updated Oct 20, 2020

Official inference repo for FLUX.1 models

Python 14,470 1,042 Updated Oct 3, 2024

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 2,883 208 Updated Sep 25, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 25,021 3,233 Updated Jul 23, 2024

Kolmogorov-Arnold Transformer: A PyTorch Implementation with CUDA kernel

Python 506 29 Updated Sep 26, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

4,046 221 Updated Oct 5, 2024

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python 2,733 245 Updated Jun 4, 2024

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,330 2,911 Updated Sep 2, 2024

VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks

Python 361 10 Updated Jul 9, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 19,598 2,159 Updated Aug 12, 2024

Solve puzzles. Learn CUDA.

Jupyter Notebook 9,365 715 Updated Sep 1, 2024

LLM101n: Let's build a Storyteller

29,160 1,600 Updated Aug 1, 2024
Python 2,538 190 Updated Oct 4, 2024

Official repository for "Hardware Resilience Properties of Text-Guided Image Classifiers" [NeurIPS 2023]

Python 9 Updated Nov 28, 2023

Best Practices, code samples, and documentation for Computer Vision.

Jupyter Notebook 9,469 1,171 Updated Feb 16, 2024
Python 695 47 Updated Mar 6, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 13,235 1,062 Updated May 23, 2024

Scaling Data-Constrained Language Models

Jupyter Notebook 317 19 Updated Sep 22, 2024

CoreNet: A library for training deep neural networks

Python 6,938 540 Updated May 28, 2024

Focus on prompting and generating

Python 40,517 5,650 Updated Aug 21, 2024

Open weights language model from Google DeepMind, based on Griffin.

Python 597 25 Updated Jul 9, 2024

Package for extracting and mapping the results of every single tensor operation in a PyTorch model in one line of code.

Python 462 16 Updated Aug 30, 2024

PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)

Python 1,287 117 Updated Jun 1, 2024

Examples and guides for using the OpenAI API

MDX 58,880 9,357 Updated Oct 4, 2024

An open-source academic paper management tool.

TypeScript 1,517 67 Updated Sep 15, 2024

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 2,869 186 Updated Sep 19, 2024
Next