Skip to content

Latest commit

 

History

History
24 lines (17 loc) · 829 Bytes

File metadata and controls

24 lines (17 loc) · 829 Bytes

Text Classification on Twenty Newsgroups Data Set

Dataset Download:

Abstract:

This data set consists of 20000 messages taken from 20 newsgroups.

  • Implemented Sklearn MultinomialNB()
  • Implemented Naive Bayes from scratch
  • Comparison between the two classifier

Results:

  • Sklearn MultinomialNB(): 0.8394
  • Naive Bayes from scratch: 0.8422

Clone the Project:

git clone https://github.com/wikiabhi/Text-Classification.git

MIT License Copyright (c) 2018 Abhishek