Skip to content
View kelechi-c's full-sized avatar
👾
conjuring models
👾
conjuring models

Highlights

  • Pro

Block or report kelechi-c

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Implementation of MagViT2 Tokenizer in Pytorch

Python 541 35 Updated Jul 23, 2024

SOTA Text-to-music (TTM) Generation (OpenMusic)

Python 372 34 Updated Sep 26, 2024

The official Meta Llama 3 GitHub site

Python 26,400 2,984 Updated Aug 12, 2024

Credit card fraud detection through logistic regression, k-means, and deep learning.

Jupyter Notebook 215 115 Updated Jan 31, 2018

WIP Pytorch code for stably training single-step, mode-dropping, deterministic autoencoders

Jupyter Notebook 20 1 Updated May 5, 2024

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Python 433 39 Updated Jun 9, 2024

Text to Image Latent Diffusion using a Transformer core

Python 128 14 Updated Aug 29, 2024

Solve puzzles. Learn CUDA.

Jupyter Notebook 9,164 554 Updated Sep 1, 2024

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …

Python 4,032 302 Updated Jul 16, 2024

Scenic: A Jax Library for Computer Vision Research and Beyond

Python 3,265 429 Updated Sep 30, 2024
Python 4 1 Updated Aug 31, 2024

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,565 2,215 Updated Jul 29, 2024

LLM inference in C/C++

C++ 65,662 9,421 Updated Sep 30, 2024

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

C++ 69,658 7,623 Updated Sep 30, 2024

Joint speech-language model - respond directly to audio!

Python 342 31 Updated Jul 1, 2024

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Python 2,401 221 Updated Jun 2, 2024

FMA: A Dataset For Music Analysis

Jupyter Notebook 2,210 432 Updated Jan 5, 2023

A toolbox that provides hackable building blocks for generic 1D/2D/3D UNets, in PyTorch.

Python 81 8 Updated Jun 12, 2023

A repository for generating and training short audio samples with unconditional waveform diffusion on accessible consumer hardware (<2GB VRAM GPU)

Python 152 15 Updated Jun 6, 2024

Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)

Python 264 46 Updated Oct 8, 2021

This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation

Jupyter Notebook 402 16 Updated Sep 25, 2024

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,445 306 Updated Jan 4, 2024

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,119 66 Updated Aug 13, 2024

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,079 120 Updated Sep 24, 2024

Speech, Language, Audio, Music Processing with Large Language Model

Python 512 43 Updated Sep 29, 2024

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 81,994 7,585 Updated Sep 30, 2024

Open weights LLM from Google DeepMind.

Python 2,413 306 Updated Sep 20, 2024

A fast and memory-efficient libarary for sparse transformer with varying token numbers (e.g., 3D point cloud).

Python 153 11 Updated Sep 6, 2023

Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"

Python 1,513 192 Updated Aug 12, 2020

Collection of AWESOME vision-language models for vision tasks

2,275 203 Updated Aug 29, 2024
Next