Skip to content
View HK007-0425's full-sized avatar

Block or report HK007-0425

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repository of Slide-Transformer (CVPR2023)

Python 157 6 Updated Aug 27, 2024

Pytorch implementation for Image Captioning.

Python 1 Updated Jun 3, 2024
Python 115 12 Updated Feb 7, 2023

A Library for Advanced Deep Time Series Models.

Python 6,393 1,018 Updated Sep 19, 2024

The official repository of the paper "Learning Correlation Structures for Vision Transformers" accepted to CVPR 2024.

39 Updated Apr 2, 2024

[NAACL 2024] LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-text Generation?

Python 34 1 Updated Jun 9, 2024

A PyTorch reimplementation of bottom-up-attention models

Jupyter Notebook 291 75 Updated Apr 7, 2022

Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]

Python 269 52 Updated Jul 27, 2021

Grid features pre-training code for visual question answering

Python 268 48 Updated Sep 17, 2021

Official Code for 'RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words' (CVPR 2021)

Python 119 27 Updated Dec 17, 2022

Official pytorch implementation of paper "Dual-Level Collaborative Transformer for Image Captioning" (AAAI 2021).

Jupyter Notebook 193 31 Updated Jun 8, 2022

Implementation of 'End-to-End Transformer Based Model for Image Captioning' [AAAI 2022]

Jupyter Notebook 64 12 Updated Jun 1, 2024

Meshed-Memory Transformer for Image Captioning. CVPR 2020

Python 515 136 Updated Dec 21, 2022

Torch implementation of ResNet from http://arxiv.org/abs/1512.03385 and training scripts

Lua 2,290 664 Updated Aug 24, 2022

Deep Residual Networks with 1K Layers

Lua 901 249 Updated May 24, 2017

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Python 5,481 934 Updated May 25, 2024

Vision-Language Pre-training for Image Captioning and Question Answering

Python 411 62 Updated Jan 18, 2022

Simple image captioning model

Jupyter Notebook 1,287 214 Updated Jun 9, 2024

Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.

Python 993 278 Updated Oct 5, 2023

A curated list of image captioning and related area resources. :-)

1,058 185 Updated Mar 28, 2023

Efficient computing methods developed by Huawei Noah's Ark Lab

Jupyter Notebook 1,182 207 Updated Jul 6, 2024

Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning

Python 2,749 711 Updated Jul 28, 2022

Code for paper "Attention on Attention for Image Captioning". ICCV 2019

Python 325 62 Updated May 2, 2021

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 54,055 5,585 Updated Aug 24, 2024

🔥LeetCode solutions in any programming language | 多种编程语言实现 LeetCode、《剑指 Offer(第 2 版)》、《程序员面试金典(第 6 版)》题解

Java 30,959 6,756 Updated Sep 21, 2024