OCR (OPTICAL CHARACTER RECOGNITION)

This is a simple OCR project which can recognize the character present in the image. For now, it is only capable of recognizing digits and capital letter.

Dataset

For digits, I have used MNIST dataset, and for alphabets I have used kaggle A-Z Handwritten Alphabets

Model

For the model architecture I have used Resnet architecture which you can read about here.

This is the image of the architecture I have used.

Usage

First install the required modules from requirements.txt

Testing the model

If you would like to test the model, you can run GUI_predict.py and try. Here is the sample output you could expect.

Training the model

If you would like to train the model, you need to download and extract the 'A-Z Handwritten Alphabets' to data/ and edit the ocr_train.py as per your needs. I have set it to 50 epochs, which would take around 300sec/epoch in google colab

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
GUI_predict.py		GUI_predict.py
OCR_Resnet1.h5		OCR_Resnet1.h5
README.md		README.md
dataset.py		dataset.py
model.PNG		model.PNG
model.py		model.py
ocr_train.py		ocr_train.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OCR (OPTICAL CHARACTER RECOGNITION)

Dataset

Model

Usage

Testing the model

Training the model

About

Releases

Packages

Languages

arthiondaena/SimpleOCR

Folders and files

Latest commit

History

Repository files navigation

OCR (OPTICAL CHARACTER RECOGNITION)

Dataset

Model

Usage

Testing the model

Training the model

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages