- General
- random insertion, deletion, word, sentence shuffling
- Replacing words with synonyms
- Replace the words from dicitionary of the same label
- Perturbations (letter, word, or sentence level)
- Language model
- Back translation
- Round-trip translation
- Leverage External Data
- Using external data derived from Wikipedia. linking wikipedia articles to arbitrary input text. The idea is that if the input text were on Wikipedia, it would have links to other Wikipedia articles (that are semantically related and provide additional info).
- break the input text into n-grams
- check whether each n-gram exists as a wikipedia article to create a set of ‘candidate links’
- prune the candidate links by computing the similarity of the input text and the abstract of each candidate
- Using external data derived from Wikipedia. linking wikipedia articles to arbitrary input text. The idea is that if the input text were on Wikipedia, it would have links to other Wikipedia articles (that are semantically related and provide additional info).
- Conversational Systems
- Reading Comprehension
forked from quincyliang/nlp-data-augmentation
-
Notifications
You must be signed in to change notification settings - Fork 0
mp432/nlp-data-augmentation
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Data Augmentation for NLP. NLP数据增强
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published