Skip to content
View bryanSwk's full-sized avatar
🐕‍🦺
🐕‍🦺
Block or Report

Block or report bryanSwk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild

Python 3,076 465 Updated Jun 8, 2024

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,468 88 Updated Jul 10, 2024

This is the repo for our new project Highly Accurate Dichotomous Image Segmentation

Jupyter Notebook 2,106 244 Updated Jul 3, 2024

[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.

Python 446 25 Updated Jul 2, 2024

MARS5 speech model (TTS) from CAMB.AI

Python 2,182 167 Updated Jul 5, 2024

Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation

Python 370 23 Updated Jul 3, 2024

LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images

Python 262 14 Updated Apr 14, 2024

ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment

Python 986 51 Updated Jun 14, 2024

Low-code framework for building custom LLMs, neural networks, and other AI models

Python 10,987 1,180 Updated Jul 8, 2024

High-Resolution 3D Human Digitization from A Single Image.

Python 9,459 1,427 Updated Mar 11, 2024
Jupyter Notebook 6,964 506 Updated Jun 16, 2024

Schedule-Free Optimization in PyTorch

Python 1,667 53 Updated Jul 9, 2024

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Python 7,911 551 Updated Jul 3, 2024

Kolmogorov Arnold Networks

Jupyter Notebook 13,709 1,211 Updated Jul 10, 2024

Sequence Parallel Attention for Long Context LLM Model Training and Inference

Python 212 7 Updated Jun 27, 2024

Reference implementation of Megalodon 7B model

Cuda 492 50 Updated Apr 18, 2024

Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models

Python 145 18 Updated Apr 23, 2024

Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new AI research

Python 107 4 Updated Jun 7, 2024

A simple but complete full-attention transformer with a set of promising experimental features from various papers

Python 4,369 370 Updated Jul 6, 2024

YaRN: Efficient Context Window Extension of Large Language Models

Python 1,258 112 Updated Apr 17, 2024

[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

Python 279 14 Updated Jul 3, 2024

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 29,814 6,307 Updated Jul 4, 2024

Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHX…

Python 2,564 197 Updated Mar 31, 2024
69 Updated Mar 29, 2024

A Comparative Framework for Multimodal Recommender Systems

Python 845 138 Updated Jul 5, 2024

Latte: Latent Diffusion Transformer for Video Generation.

Python 1,455 148 Updated Jul 4, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 10,893 973 Updated Jul 8, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 18,061 1,960 Updated Jul 3, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 4,306 331 Updated May 28, 2024
Next