Skip to content
View cwu307's full-sized avatar
  • Netflix, Inc.
  • Los Gatos

Block or report cwu307

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Reference Software for IAMF

C 36 12 Updated Sep 3, 2024

HAAQI-Net is a novel DNN-based non-intrusive method for assessing music audio quality in hearing aid users.

Python 9 2 Updated Jan 25, 2024

Perceived Music Quality Dataset

Python 8 3 Updated Jul 1, 2024

Code accompanying the paper "Looking Similar, Sounding Different: Leveraging Counterfactual Cross-Modal Pairs for Audiovisual Representation Learning" (CVPR 2024)

Python 2 Updated Sep 17, 2024

openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system in 275+ supported cars.

Python 49,627 9,019 Updated Oct 5, 2024

Learning audio concepts from natural language supervision

Python 465 35 Updated Sep 18, 2024

CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)

Python 1,109 158 Updated Aug 19, 2024

ITU-T Rec. P.1203 Implementation

Python 98 27 Updated Sep 20, 2024

GStreamer open-source multimedia framework

C 2,327 578 Updated Oct 4, 2024

PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)

C 525 98 Updated Sep 5, 2024

Revisiting Singing Voice Detection : a Quantitative Review and the Future Outlook

Python 66 9 Updated Nov 21, 2022

Audiogen Codec

Python 118 11 Updated Jul 9, 2024

An efficient loudness meter with support for anchoring, median, and multithreading

C++ 7 Updated Aug 12, 2024

A pytorch package for non-negative matrix factorization.

Python 225 24 Updated Jul 25, 2024

Differentiable dynamic range controller in PyTorch.

Python 42 1 Updated Sep 11, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 68,814 8,106 Updated Sep 30, 2024

Machine Learning applied to sound

Jupyter Notebook 237 48 Updated May 11, 2024
Jupyter Notebook 135 5 Updated Sep 26, 2024

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

Python 350 30 Updated Jan 25, 2024

Self-supervised learning for fast pitch estimation

Python 175 15 Updated Oct 2, 2024

Codes for ICASSP 2024 paper: BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer. An online beat tracking system based on streaming Transformer

Python 32 1 Updated Sep 11, 2024

End-to-End Speech Processing Toolkit

Python 8,353 2,167 Updated Oct 4, 2024

Code for the paper "Soft Dynamic Time Warping With Variable Step Weights", ICASSP 2024

Jupyter Notebook 3 Updated Jan 4, 2024

A simple library for Fréchet Audio Distance (FAD) calculation

Python 139 20 Updated Sep 6, 2024

PAM is a no-reference audio quality metric for audio generation tasks

Python 42 5 Updated Jul 19, 2024
Python 162 5 Updated Feb 14, 2024

VBAP & Define Loudspeaker from Ville Pulkki - updated and adapted by Christophe B.

HTML 5 Updated Jun 22, 2022

AudioLDM training, finetuning, evaluation and inference.

Python 196 38 Updated Jun 2, 2024

Temporal service

Go 11,569 824 Updated Oct 5, 2024

Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.

Python 141 16 Updated Jul 25, 2024
Next