Skip to content
View ZehaoYao's full-sized avatar

Block or report ZehaoYao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Code for Discriminative Sounding Objects Localization (NeurIPS 2020)

Python 57 9 Updated Jan 19, 2022

A dataset for Audio-Visual Sound Event Detection in Movies

Python 26 1 Updated Jan 23, 2023

Implementation for Label Relation Graphs Enhanced Hierarchical Residual Network for Hierarchical Multi-Granularity Classification

Python 51 3 Updated Mar 24, 2022

CVPR2022 - Deep Hierarchical Semantic Segmentation - A structured, pixel-wise description of visual scenes in terms of the class hierarchy.

Python 262 24 Updated Apr 24, 2023

Localizing Visual Sounds the Hard Way

Python 76 15 Updated Jul 6, 2022
Python 24 4 Updated Oct 31, 2023

Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline (CVPR 2023)

Python 54 4 Updated Feb 12, 2024

[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning

HTML 478 68 Updated Jan 27, 2024

The repo for "Class-aware Sounding Objects Localization", TPAMI 2021.

Python 29 3 Updated Mar 4, 2022

Codebase for ECCV18 "The Sound of Pixels"

Python 370 74 Updated Apr 25, 2022

The code repo for ICASSP 2023 Paper "MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning"

Python 17 1 Updated May 18, 2023

非正常人类研究中心 存储中国大陆各类非正常女性所为的非正常案件,欢迎补充

1,522 105 Updated Sep 29, 2024

必应每日超清壁纸(4K) Bing Daily Wallpaper (4K)

Java 1,958 321 Updated Oct 5, 2024
Python 35 1 Updated Feb 21, 2023

Video datasets

1,149 91 Updated Mar 8, 2023

This repository contains the code for our CVPR 2022 paper on "Audio-visual Generalised Zero-shot Learning with Cross-modal Attention and Language"

Python 33 1 Updated Nov 29, 2022

Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)

Jupyter Notebook 341 38 Updated Jul 12, 2024

Official PyTorch implementation of the TIP paper "Generating Visually Aligned Sound from Videos" and the corresponding Visually Aligned Sound (VAS) dataset.

Python 49 12 Updated Dec 15, 2020

Implementation of "Audio Retrieval with Natural Language Queries: A Benchmark Study".

Python 45 2 Updated Jul 22, 2022

This repository contains the code for our ECCV 2022 paper "Temporal and cross-modal attention for audio-visual zero-shot learning"

Python 24 Updated Nov 29, 2022

Scripts for download AudioSet

Jupyter Notebook 66 45 Updated Nov 7, 2017

download the vggsound dataset

Shell 18 2 Updated Feb 22, 2022

Frame-accurate video cutting with only small quality loss

C 113 14 Updated Feb 2, 2024

A modern yet simple multi-platform video cutter and joiner.

Python 1,790 135 Updated Aug 31, 2024

The swiss army knife of lossless video/audio editing

TypeScript 26,781 1,281 Updated Oct 4, 2024

The unofficial implementation of paper, "Objects that Sound", from ECCV 2018.

Python 32 4 Updated Jan 29, 2024

Audio-Visual Event Localization in Unconstrained Videos, ECCV 2018

Python 170 31 Updated Apr 3, 2021

Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video ta…

Python 1,505 376 Updated Jun 24, 2024

Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)

Python 50 9 Updated Jan 29, 2024

A hands-on introduction to video technology: image, video, codec (av1, vp9, h265) and more (ffmpeg encoding). Translations: 🇺🇸 🇨🇳 🇯🇵 🇮🇹 🇰🇷 🇷🇺 🇧🇷 🇪🇸

Jupyter Notebook 15,438 1,325 Updated Sep 7, 2023
Next