HK007-0425

Follow

HK007-0425

Follow

Stars

LeapLabTHU / Slide-Transformer

Official repository of Slide-Transformer (CVPR2023)

Python 157 6 Updated Aug 27, 2024

232525 / ImageCaptioning_Verbose

Pytorch implementation for Image Captioning.

Python 1 Updated Jun 3, 2024

BorealisAI / scaleformer

Python 115 12 Updated Feb 7, 2023

thuml / Time-Series-Library

A Library for Advanced Deep Time Series Models.

Python 6,393 1,018 Updated Sep 19, 2024

decisionintelligence / pathformer

Python 129 16 Updated Aug 14, 2024

KimManjin / StructViT

The official repository of the paper "Learning Correlation Structures for Vision Transformers" accepted to CVPR 2024.

39 Updated Apr 2, 2024

wangyuchi369 / LaDiC

[NAACL 2024] LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-text Generation?

Python 34 1 Updated Jun 9, 2024

MILVLG / bottom-up-attention.pytorch

A PyTorch reimplementation of bottom-up-attention models

Jupyter Notebook 291 75 Updated Apr 7, 2022

JDAI-CV / image-captioning

Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]

Python 269 52 Updated Jul 27, 2021

facebookresearch / grid-feats-vqa

Grid features pre-training code for visual question answering

Python 268 48 Updated Sep 17, 2021

zhangxuying1004 / RSTNet

Official Code for 'RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words' (CVPR 2021)

Python 119 27 Updated Dec 17, 2022

luo3300612 / image-captioning-DLCT

Official pytorch implementation of paper "Dual-Level Collaborative Transformer for Image Captioning" (AAAI 2021).

Jupyter Notebook 193 31 Updated Jun 8, 2022

232525 / PureT

Implementation of 'End-to-End Transformer Based Model for Image Captioning' [AAAI 2022]

Jupyter Notebook 64 12 Updated Jun 1, 2024

aimagelab / meshed-memory-transformer

Meshed-Memory Transformer for Image Captioning. CVPR 2020

Python 515 136 Updated Dec 21, 2022

facebookarchive / fb.resnet.torch

Torch implementation of ResNet from http://arxiv.org/abs/1512.03385 and training scripts

Lua 2,290 664 Updated Aug 24, 2022

KaimingHe / resnet-1k-layers

Deep Residual Networks with 1K Layers

Lua 901 249 Updated May 24, 2017

facebookresearch / mmf

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Python 5,481 934 Updated May 25, 2024

LuoweiZhou / VLP

Vision-Language Pre-training for Image Captioning and Question Answering

Python 411 62 Updated Jan 18, 2022

rmokady / CLIP_prefix_caption

Simple image captioning model

Jupyter Notebook 1,287 214 Updated Jun 9, 2024

ruotianluo / self-critical.pytorch

Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.

Python 993 278 Updated Oct 5, 2023

zhjohnchan / awesome-image-captioning

A curated list of image captioning and related area resources. :-)

1,058 185 Updated Mar 28, 2023

huawei-noah / Efficient-Computing

Efficient computing methods developed by Huawei Noah's Ark Lab

Jupyter Notebook 1,182 207 Updated Jul 6, 2024

sgrvinod / a-PyTorch-Tutorial-to-Image-Captioning

Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning

Python 2,749 711 Updated Jul 28, 2022

husthuaan / AoANet

Code for paper "Attention on Attention for Image Captioning". ICCV 2019

Python 325 62 Updated May 2, 2021

labmlai / annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 54,055 5,585 Updated Aug 24, 2024

doocs / leetcode

🔥LeetCode solutions in any programming language | 多种编程语言实现 LeetCode、《剑指 Offer（第 2 版）》、《程序员面试金典（第 6 版）》题解

Java 30,959 6,756 Updated Sep 21, 2024