Skip to content
View sophiayk20's full-sized avatar

Block or report sophiayk20

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Code and pretrained models for "DUB: Discrete Unit Back-translation for Speech Translation" (ACL 2023 Findings)

Python 26 3 Updated Jun 28, 2023

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 34,318 4,157 Updated Aug 16, 2024

A PyTorch-based Speech Toolkit

Python 8,671 1,372 Updated Oct 1, 2024

[CVPR 2024] AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation

Python 23 2 Updated Sep 6, 2024

A curated list of awesome voice conversion, projects and communities.

175 12 Updated Sep 4, 2024

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Python 1,163 113 Updated Apr 24, 2024

[TASLP 2024] Textless Unit-to-Unit training for Many-to-Many Multilingual Speech-to-Speech Translation

Python 20 5 Updated Sep 6, 2024

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,810 1,054 Updated Aug 15, 2024

Prosodic Speech Segmentation with Transformers

Jupyter Notebook 22 5 Updated Feb 25, 2024

🌈 Github colors for all the languages

Python 707 163 Updated Sep 30, 2024

Minimal, responsive Jekyll theme for hackers

HTML 725 197 Updated Jul 31, 2024

User-friendly WebUI for AI (Formerly Ollama WebUI)

Svelte 41,337 4,881 Updated Oct 5, 2024

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Go 92,011 7,244 Updated Oct 3, 2024

C++ associative containers

C++ 1,551 258 Updated Nov 30, 2021

A neural word aligner based on multilingual BERT

Python 323 47 Updated Mar 10, 2022

MessagePack implementation for C and C++ / msgpack.org[C/C++]

3,008 875 Updated Aug 17, 2024

Universal cross-platform tokenizers binding to HF and sentencepiece

C++ 255 58 Updated Aug 12, 2024

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 10,136 1,167 Updated Oct 1, 2024

PyTorch original implementation of Cross-lingual Language Model Pretraining.

Python 2,877 496 Updated Feb 14, 2023

This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalence Constant Theory and Matrix Language Theory.

Jupyter Notebook 52 13 Updated Jul 30, 2024

Simple, fast unsupervised word aligner

C++ 734 158 Updated Jul 19, 2022

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Jupyter Notebook 5,056 1,379 Updated Jun 12, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 11,714 2,447 Updated Oct 5, 2024