Skip to content
View Zacchaeus00's full-sized avatar
  • Pittsburgh
  • 07:36 (UTC -04:00)

Block or report Zacchaeus00

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

nlp

19 repositories
Python 2,648 301 Updated Oct 7, 2024

a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini

Jupyter Notebook 339 100 Updated Dec 21, 2023

Must-read papers on prompt-based tuning for pre-trained language models.

4,059 377 Updated Jul 17, 2023

Robust Speech Recognition via Large-Scale Weak Supervision

Python 68,945 8,113 Updated Sep 30, 2024

Pytorch implementation and extension of "DocUnet: Document Image Unwarping via A Stacked U-Net"

Python 104 18 Updated Jul 2, 2020

Unsupervised text tokenizer focused on computational efficiency

C++ 953 101 Updated Mar 29, 2024

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 54,538 5,627 Updated Aug 24, 2024

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

Python 6,133 753 Updated Sep 20, 2024

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 10,143 1,167 Updated Oct 1, 2024

Minimalist BERT implementation assignment for CS11-711

Python 76 78 Updated Sep 26, 2022

An unofficial implementation of Poly-encoder (Poly-encoders: Transformer Architectures and Pre-training Strategies for Fast and Accurate Multi-sentence Scoring)

Python 252 36 Updated Jun 12, 2023

Long Range Arena for Benchmarking Efficient Transformers

Python 716 78 Updated Dec 16, 2023

Implementation of H-Transformer-1D, Hierarchical Attention for Sequence Learning

Python 154 21 Updated Feb 12, 2024

Transformer based on a variant of attention that is linear complexity in respect to sequence length

Python 678 66 Updated May 5, 2024

This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"

Python 1,622 282 Updated Jun 12, 2023

Some notebooks for NLP

Jupyter Notebook 186 46 Updated Nov 2, 2023

The implementation of DeBERTa

Python 1,975 224 Updated Sep 29, 2023

MPNet: Masked and Permuted Pre-training for Language Understanding https://arxiv.org/pdf/2004.09297.pdf

Python 286 33 Updated Sep 11, 2021

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Python 3,375 512 Updated Jul 2, 2024