sophiayk20

Follow

Sophia Yeeun Kang sophiayk20

Follow

cs undergrad @ yale. interests: spoken language & natural language processing

1 follower · 0 following

Yale University
Seongnam, South Korea
10:28 (UTC +09:00)
https://sophiayk20.github.io
https://huggingface.co/sophiayk20

Achievements

Achievements

Lists (1)

Sort

🔮 Future ideas

Beta Lists are currently in beta. Share feedback and report bugs.

Stars

0nutation / DUB

Code and pretrained models for "DUB: Discrete Unit Back-translation for Speech Translation" (ACL 2023 Findings)

Python 26 3 Updated Jun 28, 2023

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 34,318 4,157 Updated Aug 16, 2024

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Python 8,671 1,372 Updated Oct 1, 2024

choijeongsoo / av2av

[CVPR 2024] AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation

Python 23 2 Updated Sep 6, 2024

JeffC0628 / awesome-voice-conversion

A curated list of awesome voice conversion, projects and communities.

175 12 Updated Sep 4, 2024

microsoft / SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Python 1,163 113 Updated Apr 24, 2024

choijeongsoo / utut

[TASLP 2024] Textless Unit-to-Unit training for Many-to-Many Multilingual Speech-to-Speech Translation

Python 20 5 Updated Sep 6, 2024

facebookresearch / seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,810 1,054 Updated Aug 15, 2024

Nathan-Roll1 / PSST

Prosodic Speech Segmentation with Transformers

Jupyter Notebook 22 5 Updated Feb 25, 2024

ozh / github-colors

🌈 Github colors for all the languages

Python 707 163 Updated Sep 30, 2024

yous / whiteglass

Minimal, responsive Jekyll theme for hackers

HTML 725 197 Updated Jul 31, 2024

open-webui / open-webui

User-friendly WebUI for AI (Formerly Ollama WebUI)

Svelte 41,337 4,881 Updated Oct 5, 2024

ollama / ollama

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Go 92,011 7,244 Updated Oct 3, 2024

sparsehash / sparsehash

C++ associative containers

C++ 1,551 258 Updated Nov 30, 2021

neulab / awesome-align

A neural word aligner based on multilingual BERT

Python 323 47 Updated Mar 10, 2022

msgpack / msgpack-c

MessagePack implementation for C and C++ / msgpack.org[C/C++]

3,008 875 Updated Aug 17, 2024

mlc-ai / tokenizers-cpp

Universal cross-platform tokenizers binding to HF and sentencepiece

C++ 255 58 Updated Aug 12, 2024

google / sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 10,136 1,167 Updated Oct 1, 2024

facebookresearch / XLM

PyTorch original implementation of Cross-lingual Language Model Pretraining.

Python 2,877 496 Updated Feb 14, 2023

microsoft / CodeMixed-Text-Generator

This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalence Constant Theory and Matrix Language Theory.

Jupyter Notebook 52 13 Updated Jul 30, 2024

clab / fast_align

Simple, fast unsupervised word aligner

C++ 734 158 Updated Jul 19, 2022

NVIDIA / tacotron2

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Jupyter Notebook 5,056 1,379 Updated Jun 12, 2024

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 11,714 2,447 Updated Oct 5, 2024