OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…

Jupyter Notebook 6,745 1,039 Updated Mar 15, 2024

NVIDIA / flowtron

Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer

Jupyter Notebook 888 176 Updated Jul 6, 2023

e3donline / ToolChanger

STPs / STLs / DXFs / PDFs

298 61 Updated Mar 6, 2023

NVIDIA / OpenSeq2Seq

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

Python 1,539 372 Updated May 11, 2021

speechmatics / legacy-v1-python-example

Example script (supported) to help you integrate with our SaaS v1 API

Python 14 5 Updated Apr 15, 2020

chmodsss / noizeus_corpora

Speech corpora for the speech recognition evaluation system

17 14 Updated Mar 20, 2018

abhilasha23 / StoryTelling

A neural network based StoryTeller that outputs a short story from an input image

Python 13 Updated Dec 15, 2018

taki0112 / SPADE-Tensorflow

Simple Tensorflow implementation of "Semantic Image Synthesis with Spatially-Adaptive Normalization" a.k.a. GauGAN, SPADE (CVPR 2019 Oral)

Python 364 67 Updated Jun 6, 2022

facebookresearch / pytorch_GAN_zoo

A mix of GAN implementations including progressive growing

Python 1,607 268 Updated Oct 12, 2021

mozilla / TTS

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Jupyter Notebook 9,033 1,224 Updated Nov 9, 2023

vacancy / SceneGraphParser

A python toolkit for parsing captions (in natural language) into scene graphs (as symbolic representations).

Python 519 54 Updated Jan 23, 2024

NVlabs / planercnn

PlaneRCNN detects and reconstructs piece-wise planar surfaces from a single RGB image

Python 545 121 Updated Oct 9, 2022

scaelles / DEXTR-PyTorch

Deep Extreme Cut http://www.vision.ee.ethz.ch/~cvlsegmentation/dextr

Python 846 153 Updated Sep 4, 2020

AdolfVonKleist / Phonetisaurus

Phonetisaurus G2P

Shell 440 122 Updated Jun 1, 2024

daniilidis-group / neural_renderer

A PyTorch port of the Neural 3D Mesh Renderer

Python 1,122 248 Updated Mar 17, 2022

TimoBolkart / voca

This codebase demonstrates how to synthesize realistic 3D character animations given an arbitrary speech signal and a static character mesh.

Python 1,123 275 Updated Jul 9, 2024

zszyellow / WER-in-python

This program calculates the word error rate of hypothesis in ASR and print the aligned result.

Python 150 77 Updated Jan 30, 2020

Jakobovski / free-spoken-digit-dataset

A free audio dataset of spoken digits. An audio version of MNIST.

Python 610 250 Updated May 2, 2024

Mythra / text-to-ssml

Converts your text to AWS Polly's SSML.

Rust 11 2 Updated Aug 28, 2021

Franck-Dernoncourt / ASR_benchmark

Program to benchmark various speech recognition APIs

Python 79 18 Updated Sep 6, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Andrew Sofie andrewsofie

Achievements

Achievements

Block or report andrewsofie

Stars

CompVis / taming-transformers

CompVis / stable-diffusion

langchain-ai / langchain

fudan-generative-vision / hallo

cracker0dks / whiteboard

dendry / dendry

manexagirrezabal / erato

AntonioND / ucity

cpacker / MemGPT

turtlesoupy / this-word-does-not-exist

open-mmlab / mmagic