Official PyTorch implementation of the TIP paper "Generating Visually Aligned Sound from Videos" and the corresponding Visually Aligned Sound (VAS) dataset.

Python 49 12 Updated Dec 15, 2020

akoepke / audio-retrieval-benchmark

Implementation of "Audio Retrieval with Natural Language Queries: A Benchmark Study".

Python 45 2 Updated Jul 22, 2022

ExplainableML / TCAF-GZSL

This repository contains the code for our ECCV 2022 paper "Temporal and cross-modal attention for audio-visual zero-shot learning"

Python 24 Updated Nov 29, 2022

speedyseal / audiosetdl

Scripts for download AudioSet

Jupyter Notebook 66 45 Updated Nov 7, 2017

hhc1997 / vggsound_download

download the vggsound dataset

Shell 18 2 Updated Feb 22, 2022

anyc / avcut

Frame-accurate video cutting with only small quality loss

C 113 14 Updated Feb 2, 2024

ozmartian / vidcutter

A modern yet simple multi-platform video cutter and joiner.

Python 1,790 135 Updated Aug 31, 2024

mifi / lossless-cut

The swiss army knife of lossless video/audio editing

TypeScript 26,781 1,281 Updated Oct 4, 2024

kyuyeonpooh / objects-that-sound

The unofficial implementation of paper, "Objects that Sound", from ECCV 2018.

Python 32 4 Updated Jan 29, 2024

YapengTian / AVE-ECCV18

Audio-Visual Event Localization in Unconstrained Videos, ECCV 2018

Python 170 31 Updated Apr 3, 2021

PaddlePaddle / PaddleVideo

Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video ta…

Python 1,505 376 Updated Jun 24, 2024

v-iashin / SparseSync

Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)

Python 50 9 Updated Jan 29, 2024

leandromoreira / digital_video_introduction

A hands-on introduction to video technology: image, video, codec (av1, vp9, h265) and more (ffmpeg encoding). Translations: 🇺🇸 🇨🇳 🇯🇵 🇮🇹 🇰🇷 🇷🇺 🇧🇷 🇪🇸

Jupyter Notebook 15,438 1,325 Updated Sep 7, 2023

ZehaoYao ZehaoYao

Lists (13)

Audio-Visual corrspondence

Audio-Visual event detection

Audio-Visual generation

Audio-Visual-models

Audio-Visual ZSL

classfication models

Cross modal retrieval

Datasets

Datasets download

Hierarchical learning

learning resource

Python-library

Tools

Stars