senwang86

Follow

Sen Wang senwang86

Follow

ex-Meta, ML/AI enthusiast. "Stay hungry. Stay foolish."

3 followers · 1 following

Achievements

Achievements

Block or Report

Block or report senwang86

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

Paper Source Code

Source code to reproduce the result in papers

148 repositories

cliangyu / Cola

[NeurIPS2023] Official implementation of the paper "Large Language Models are Visual Reasoning Coordinators"

Jupyter Notebook 98 7 Updated Nov 9, 2023

HKUDS / RLMRec

[WWW'2024] "RLMRec: Representation Learning with Large Language Models for Recommendation"

Python 239 23 Updated Jun 26, 2024

kyegomez / BitNet

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

Python 1,458 139 Updated Jun 27, 2024

Srijith-rkr / Whispering-LLaMA

EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction

Jupyter Notebook 209 15 Updated May 19, 2024

sczhou / ProPainter

[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting

Python 5,114 614 Updated Apr 17, 2024

omerbt / TokenFlow

Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)

Python 1,518 134 Updated Jan 23, 2024

XPixelGroup / DiffBIR

Official codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior

Python 3,159 264 Updated Jul 3, 2024

labmlai / annotated_deep_learning_paper_implementations

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gan…

Python 51,970 5,371 Updated Jul 18, 2024

apple / ml-fastvit

This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023

Python 1,777 98 Updated Nov 30, 2023

dvlab-research / LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 1,652 113 Updated Jul 2, 2024

rese1f / StableVideo

[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing

Python 1,357 85 Updated Sep 7, 2023

OSVAI / KernelWarehouse

The official project website of "KernelWarehouse: Rethinking the Design of Dynamic Convolution" (KW for short, accepted to ICML 2024)

Python 75 3 Updated Jun 13, 2024

showlab / BoxDiff

[ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion

Python 229 12 Updated Jul 14, 2024

liming-ai / AlignDet

Official code for ICCV 2023 Paper: AlignDet: Aligning Pre-training and Fine-tuning in Object Detection.

Python 136 13 Updated Sep 26, 2023

OPPO-Mente-Lab / Subject-Diffusion

Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning

Python 262 11 Updated Jul 11, 2024

bo-miao / SgMg

[ICCV 2023] Spectrum-guided Multi-granularity Referring Video Object Segmentation.

Python 76 9 Updated Sep 28, 2023

Wangt-CN / DisCo

[CVPR2024] DisCo: Referring Human Dance Generation in Real World

Python 1,013 112 Updated Apr 10, 2024

LeapLabTHU / FLatten-Transformer

Official repository of FLatten Transformer (ICCV2023)

Python 361 19 Updated Jul 17, 2024

THU-MIG / RepViT

RepViT: Revisiting Mobile CNN From ViT Perspective [CVPR 2024] and RepViT-SAM: Towards Real-Time Segmenting Anything

Jupyter Notebook 678 55 Updated Jun 14, 2024

gregor-ge / mBLIP

Python 84 7 Updated Jan 10, 2024

DAMO-NLP-SG / Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python 2,607 236 Updated Jun 4, 2024

berkeley-hipie / HIPIE

[NeurIPS2023] Code release for "Hierarchical Open-vocabulary Universal Image Segmentation"

Jupyter Notebook 257 19 Updated Mar 21, 2024

OpenLMLab / LOMO

LOMO: LOw-Memory Optimization

Python 956 69 Updated Jul 2, 2024

Nikunj-Gupta / Efficient_ResNets

A Residual Network Design with less than 5 million trainable parameters achieving an accuracy of 96.04% on CIFAR-10.

Python 26 2 Updated Jun 24, 2024

ni9elf / 3HAN

Official implementation of "3HAN: A Deep Neural Network for Fake News Detection" (ICONIP 2017)

Python 90 17 Updated Jun 21, 2018

X-PLUG / Youku-mPLUG

Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks

Python 271 11 Updated Jan 8, 2024

Lichang-Chen / InstructZero

Official Implementation of InstructZero; the first framework to optimize bad prompts of ChatGPT(API LLMs) and finally obtain good prompts!

Python 165 14 Updated Nov 17, 2023

SysCV / sam-hq

Segment Anything in High Quality [NeurIPS 2023]

Python 3,549 210 Updated Jul 7, 2024

artidoro / qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 9,725 799 Updated Jun 10, 2024

haoosz / ViCo

Official PyTorch codes for the paper: "ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation"

Jupyter Notebook 235 14 Updated Mar 20, 2024