A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities

Python 885 54 Updated Jun 27, 2024

aleju / imgaug

Image augmentation for machine learning experiments.

Python 14,263 2,422 Updated Apr 6, 2024

omry / omegaconf

Flexible Python configuration system. The last one you will ever need.

Python 1,869 98 Updated May 30, 2024

garrettj403 / SciencePlots

Matplotlib styles for scientific plotting

Python 6,739 687 Updated Jun 3, 2024

Developer-Y / cs-video-courses

List of Computer Science courses with video lectures.

65,967 9,021 Updated Jul 7, 2024

cappuccino / cappuccino

Web Application Framework in JavaScript and Objective-J

Objective-J 2,206 334 Updated Jul 5, 2024

iver56 / audiomentations

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Python 1,757 183 Updated Jul 9, 2024

NickWilkinson37 / voxseg

A python library for voice activity detection (VAD) for speech/non-speech segmentation.

Python 78 12 Updated Sep 7, 2022

rafaelgreca / voxseg-pytorch

The Voxseg implementation in PyTorch. Voxseg is a python library for voice activity detection (VAD) for speech/non-speech segmentation.

Python 9 4 Updated Oct 18, 2023

eriklindernoren / PyTorch-GAN

PyTorch implementations of Generative Adversarial Networks.

Python 16,006 4,031 Updated Jun 18, 2024

HL-hanlin / VideoDirectorGPT

official implementation of VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning

155 8 Updated Oct 6, 2023

LCAV / pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Python 1,375 419 Updated Jul 8, 2024

ehabets / RIR-Generator

Generating room impulse responses

C++ 409 146 Updated Dec 20, 2023

Enny1991 / beamformers

Easy to use Beamformers for multi-channel speech separation/enhancement

Python 169 46 Updated Jan 26, 2021

furkanarius / Multichannel-Speech-Enhancement-with-Deep-Neural-Networks

This thesis applies an autoencoder deep neural network to the multichannel speech enhancement problem. It takes the problem from dataset generation to the model training

Jupyter Notebook 10 Updated Sep 1, 2022

BUTSpeechFIT / MultiSV

MultiSV: scripts for data preparation

Shell 23 3 Updated Jun 12, 2024

ruizhecao96 / CMGAN

Conformer-based Metric GAN for speech enhancement

Python 282 55 Updated May 3, 2024

abetlen / llama-cpp-python

Python bindings for llama.cpp

Python 7,171 850 Updated Jul 9, 2024

hellohaptik / spello

Fast and accurate spell correction library

Python 74 20 Updated Mar 23, 2022

barrust / pyspellchecker

Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/

Python 687 101 Updated Mar 9, 2024

Okrio / tinyrecurrentunet

Real-Time De-noising and De-reverbing with Tiny Recurrent UNet

Python 24 12 Updated Jun 7, 2023

giulioz / laser-scanning

📷🔦💭 A 3D Scanner using Laser Structured Light, written in Python using OpenCV and NumPy.

Python 78 25 Updated Dec 6, 2020

Valdiolus / nanoGPT

Forked from karpathy/nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 1 Updated Mar 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KnowsNothing The-Sad-Zewalian

Achievements

Achievements

Block or report The-Sad-Zewalian

Stars

KwaiVGI / LivePortrait

OFA-Sys / ONE-PEACE