- 实现一个朴素贝叶斯分类器,用于新闻文本分类;
- 利用了TF-IDF算法进行特征提取,并构建特征库;
- 主要使用nltk自然语言处理工具包;
- 数据集来自爬虫在国外新闻网站采集的各类新闻;
- src-02 是新闻分类器,src是《机器学习实战》一书中提供的代码;
- material 目录是挑选的各类新闻集合,用于构建特征库以及训练集和测试集;
- test 目录是分类器训练集和测试集;
- features 目录是提取的各类文本的特征库和保存的训练模型;
forked from Times125/ML--Native-Bayes
-
Notifications
You must be signed in to change notification settings - Fork 0
lichanging/ML--Native-Bayes
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
一个基于朴素贝叶斯算法的新闻文本分类器
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published
Languages
- Python 100.0%