Lists (11)
Sort Name ascending (A-Z)
Stars
Implementation of the rainflow-counting algorythm in Python
Rainflow counting in Python using the 4 point method
Hybrid Pointer Networks for Traveling Salesman Problems Optimization
多模态情感分析——基于BERT+ResNet的多种融合方法
PyTorch implemented C3D, R3D, R2Plus1D models for video activity recognition.
Finding the genre of a song with Deep Learning
Code for YouTube series: Deep Learning for Audio Classification
The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a variety of preprocessing methods.
UrbanSound classification using Convolutional Recurrent Networks in PyTorch
Understanding emotions from audio files using neural networks and multiple datasets.
Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".
This project focuses on the classification of animal sounds using deep learning. The core idea is to utilize audio processing techniques and a fine-tuned version of the hubert-base-ls960 model to a…
Two simple and effective designs of vision transformer, which is on par with the Swin transformer
MaxVIT implementation(MaxViT: Multi-Axis Vision Transformer) This is an unofficial implementation. https://arxiv.org/abs/2204.01697
PyTorch reimplementation of the paper "MaxViT: Multi-Axis Vision Transformer" [ECCV 2022].
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
[ECCV 2022] Official repository for "MaxViT: Multi-Axis Vision Transformer". SOTA foundation models for classification, detection, segmentation, image quality, and generative modeling...
[AAAI 2022] This is the official PyTorch implementation of "Less is More: Pay Less Attention in Vision Transformers"
leoxiaobin / CvT
Forked from microsoft/CvTThis is an official implementation of CvT: Introducing Convolutions to Vision Transformers.
Reading list for research topics in multimodal machine learning
ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
A PyTorch implementation of EfficientNet
Implementation of Convolutional enhanced image Transformer