Skip to content
View The-Sad-Zewalian's full-sized avatar
💭
(O ω O)
💭
(O ω O)
Block or Report

Block or report The-Sad-Zewalian

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Bring portraits to life!

Python 5,162 416 Updated Jul 10, 2024

A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities

Python 885 54 Updated Jun 27, 2024

Image augmentation for machine learning experiments.

Python 14,263 2,422 Updated Apr 6, 2024

Flexible Python configuration system. The last one you will ever need.

Python 1,869 98 Updated May 30, 2024

Matplotlib styles for scientific plotting

Python 6,739 687 Updated Jun 3, 2024

List of Computer Science courses with video lectures.

65,967 9,021 Updated Jul 7, 2024

Web Application Framework in JavaScript and Objective-J

Objective-J 2,206 334 Updated Jul 5, 2024

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Python 1,757 183 Updated Jul 9, 2024

A python library for voice activity detection (VAD) for speech/non-speech segmentation.

Python 78 12 Updated Sep 7, 2022

The Voxseg implementation in PyTorch. Voxseg is a python library for voice activity detection (VAD) for speech/non-speech segmentation.

Python 9 4 Updated Oct 18, 2023

PyTorch implementations of Generative Adversarial Networks.

Python 16,006 4,031 Updated Jun 18, 2024

official implementation of VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning

155 8 Updated Oct 6, 2023

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Python 1,375 419 Updated Jul 8, 2024

Generating room impulse responses

C++ 409 146 Updated Dec 20, 2023

Easy to use Beamformers for multi-channel speech separation/enhancement

Python 169 46 Updated Jan 26, 2021

This thesis applies an autoencoder deep neural network to the multichannel speech enhancement problem. It takes the problem from dataset generation to the model training

Jupyter Notebook 10 Updated Sep 1, 2022

MultiSV: scripts for data preparation

Shell 23 3 Updated Jun 12, 2024

Conformer-based Metric GAN for speech enhancement

Python 282 55 Updated May 3, 2024

Python bindings for llama.cpp

Python 7,171 850 Updated Jul 9, 2024

Fast and accurate spell correction library

Python 74 20 Updated Mar 23, 2022

Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/

Python 687 101 Updated Mar 9, 2024

Real-Time De-noising and De-reverbing with Tiny Recurrent UNet

Python 24 12 Updated Jun 7, 2023

📷🔦💭 A 3D Scanner using Laser Structured Light, written in Python using OpenCV and NumPy.

Python 78 25 Updated Dec 6, 2020

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 1 Updated Mar 6, 2023

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 21,941 5,431 Updated Jun 11, 2024

A python package to simulate typographical errors.

Python 30 4 Updated Dec 12, 2023

DNN-based SE in the frequency domain using Pytorch. You can test some state-of-the-art networks using T-F masking or spectral mapping method.

Python 49 14 Updated Apr 2, 2022

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 29,815 6,307 Updated Jul 4, 2024

SpotX patcher used for patching the desktop version of Spotify

PowerShell 12,414 704 Updated Jul 10, 2024

⚡ Finetune Wa2vec 2.0 For Speech Recognition

Python 102 21 Updated Nov 7, 2023
Next