Skip to content
View yzyouzhang's full-sized avatar
🏠
Working from home
🏠
Working from home

Highlights

  • Pro

Organizations

@AirLabUR
Block or Report

Block or report yzyouzhang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

Python 58,876 10,627 Updated Jul 6, 2024

RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024)

Python 123 11 Updated Jul 4, 2024

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝

Python 152 7 Updated Jul 5, 2024

A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization

Python 35 3 Updated Jul 1, 2024
Python 21 Updated Jul 8, 2024

A list of tools, papers and code related to Deepfake Detection.

775 82 Updated Jun 15, 2024

An 1D optimal transport inspired loss function in the spectral domain. Can be used for improving frequency localization/estimation in differentiable digital signal processing. Experiments from pape…

Python 15 Updated Apr 23, 2024

A large synthetic dataset of spatial audio with multiple labels

79 5 Updated Oct 25, 2023

🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".

Jupyter Notebook 400 33 Updated Jan 19, 2024

An Audio Language model for Audio Tasks

Python 267 14 Updated Apr 19, 2024
Jupyter Notebook 1 Updated Jun 10, 2023

Solve forward and inverse problems related to partial differential equations using finite basis physics-informed neural networks (FBPINNs)

Python 256 57 Updated Jun 20, 2024

"Brian Hears" auditory modelling toolbox for the brian2 simulator

Python 25 3 Updated Jan 26, 2021
Python 2 Updated Jun 15, 2024

Scaling Out-of-Distribution Detection for Multiple Modalities

Python 18 Updated Jul 2, 2024

Deep learning for audio processing

Jupyter Notebook 545 95 Updated Jan 9, 2024

DeepFake Detection using Siamese Neural Networks

Python 3 2 Updated Jun 9, 2020

Official release of StyleTalk dataset.

51 2 Updated Jul 1, 2024

openpilot is an open source driver assistance system. openpilot performs the functions of Automated Lane Centering and Adaptive Cruise Control for 250+ supported car makes and models.

Python 48,674 8,826 Updated Jul 10, 2024

The LAP Challenge aims at advancing spatial audio technologies through the personalization of HRTFs.

MATLAB 11 2 Updated Jul 4, 2024

Audio Diarization Annotation tool

JavaScript 21 6 Updated Nov 8, 2019

Inspect: A framework for large language model evaluations

Python 409 43 Updated Jul 10, 2024

Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)

Python 134 6 Updated Jul 5, 2024

Community list of startups working with AI in audio and music technology

1,498 131 Updated Jul 3, 2024
6 1 Updated May 28, 2024

A list of papers for child ASR

23 3 Updated Apr 2, 2024
Python 13 Updated Jan 10, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 34,039 3,552 Updated Jun 11, 2024

Implementation of "A conformer-based classifier for variable-length utterance processing in anti-spoofing" published in Interspeech 2023.

Python 10 1 Updated Nov 7, 2023

Spherical CNNs

Python 940 176 Updated Oct 20, 2021
Next