GitHub - burcgokden/A-Robust-Deep-CNN-Model-using-Numpy: Numerically stable implementation of a CNN Model from scratch using Numpy. Model trained on MNIST data.

A Numerically Stable Deep Convolutional Neural Network Implementation from Scratch Using Numpy

CONVNET MODEL 1:

Stride is 1 for all filters and max-pool.
Padding is VALID.
Activation used is ReLU.
Weights initialized with truncated normal distribution.
Biases initialized to a small constant value.
Numerically stable softmax and cross-entropy definitions are implemented to train deeper models for multiple epochs.
Adagrad optimizer is implemented to adaptively adjust learning rate for each parameter.
Forward and backward propagation for filters and pool layers are implemented using numpy only.

How to Train the Model

To train the model using MNIST data set, run as:

python train_cnn_md1.py 1

Training generates output_cnn_md1.pickle file that stores trained model variables for prediction and a log file train_cnn_md1_log.txt showing loss and accuracy progress. Log files are generated from a prior run and available in the repo.

Training uses 5 epochs with 100 images/batch and total of 20000 images per epoch. (less than half of total MNIST training images.)

How to Predict the Model

To predict the model using trained variables in pickle file over MNIST test dataset, run:

python train_cnn_md1.py 0

The test accuracy and loss progress will be logged in predict_cnn_md1_log.txt. This file is also available in the repo.

The trained model predicts the MNIST test data with ~94% accuracy.

Acknowledgements:

MNIST implementation from below repo was used as a starting point for implementing this deeper CNN model and to prepare MNIST data: https://github.com/zishansami102/CNN-from-Scratch/.

Another repo with useful ideas and insights: https://github.com/dorajam/Convolutional-Network.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
cnn_md1_funs.py		cnn_md1_funs.py
convnet_md1.py		convnet_md1.py
predict_cnn_md1_log.txt		predict_cnn_md1_log.txt
remtime.py		remtime.py
requirements.txt		requirements.txt
t10k-images-idx3-ubyte.gz		t10k-images-idx3-ubyte.gz
t10k-labels-idx1-ubyte.gz		t10k-labels-idx1-ubyte.gz
train-images-idx3-ubyte.gz		train-images-idx3-ubyte.gz
train-labels-idx1-ubyte.gz		train-labels-idx1-ubyte.gz
train_cnn_md1.py		train_cnn_md1.py
train_cnn_md1_log.txt		train_cnn_md1_log.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Numerically Stable Deep Convolutional Neural Network Implementation from Scratch Using Numpy

How to Train the Model

How to Predict the Model

Acknowledgements:

About

Releases

Packages

Languages

License

burcgokden/A-Robust-Deep-CNN-Model-using-Numpy

Folders and files

Latest commit

History

Repository files navigation

A Numerically Stable Deep Convolutional Neural Network Implementation from Scratch Using Numpy

How to Train the Model

How to Predict the Model

Acknowledgements:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages