Lists (1)
Sort Name ascending (A-Z)
Stars
📕 Clarity in the current fast-paced mess of Open Source innovation
🗺️ Data Cleaning and Textual Data Visualization 🗺️
Robust recipes to align language models with human and AI preferences
SciRepEval benchmark training and evaluation scripts
Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup
Tools to scrape publication metadata from pubmed, arxiv, medrxiv and chemrxiv.
CMU multilingual speech repository
This repo contains the syllabus of the Hugging Face Deep Reinforcement Learning Course.
Code for BERT-based bagging-stacking for multi-topic classification
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Large, modern dataset for speech recognition
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/TV shows transcripts, Youtube Video transcripts, Online sour…
Biological applications of knowledge graph embedding methods
Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training
Python library for Representation Learning on Knowledge Graphs https://docs.ampligraph.org
Pytorch-Named-Entity-Recognition-with-transformers
🔍 AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your da…
Minimal example utilizing fastapi and celery with RabbitMQ for task queue, Redis for celery backend and flower for monitoring the celery tasks.
An Implementation of Encoder-Decoder model with global attention mechanism.