-
This repo is a tool package of frequently used NLP tools for personal use.
-
Build
- python setup.py bdist_wheel
-
Code tree
- test
- some test scripts
- nlptools/utils
some common utils
- nlptools/utils/utils.py
- common tools
- nlptools/utils/dataread.py
- table read from mysql, xls, csv, etc.
- nlptools/utils/config.py
- parse yaml config
- nlptools/utils/logger.py
- create logger
- nlptools/utils/odsread.py
- parse libreoffice ods file
- nlptools/utils/utils.py
- nlptools/text
some nlp tools
- nlptools/text/docsim.py
- calculate similarity between documents
- nlptools/text/embedding.py
- read word2vec from several data sources redis/dynamodb/file/api
- nlptools/text/tokenizer.py
- tokenizer wrapper, support jieba/mecab/ltp/corenlp/spacy/simple/bert_tokenizer
- nlptools/text/ner.py
- ner training and predict class, also included keyword/regex entity extraction
- nlptools/text/translate.py
- google api for translate
- nlptools/text/vocab.py
- dictionary class, word/character <-> id, vec, bow
- nlptools/text/ngrams.py
- NGrams dictionary
- nlptools/text/topicmodel.py
- lsi, lda model
- nlptools/text/acorasearch.py
- search using acora, a keyword search engine
- nlptools/text/annoysearch.py
- use annoy for fast vector based search
- nlptools/text/synonyms.py
- get synonyms via word embedding
- nlptools/text/vectfidf.py
- modified TF-IDF algorithm with wordvector
- nlptools/text/docsim.py
- nlptools/zoo
some universal models
- nlptools/zoo/classifier
- simple document classifier
- nlptools/zoo/demodata
- some simple demo data
- nlptools/zoo/encoders
- encoders and decoders
- nlptools/zoo/classifier
- test
-
Notifications
You must be signed in to change notification settings - Fork 1
This repo is a tool package of frequently used NLP tools for personal use.
License
zhupengjia/nlptools
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
This repo is a tool package of frequently used NLP tools for personal use.
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published