Project Title

Digit Classification using Two Layer Neural Network and Back Propagation. Written in Python and depends only on Numpy

Getting Started

These instructions will showcase how to access the data, train and test the model.

Getting Data

Download the data in csv format from here :- https://pjreddie.com/projects/mnist-in-csv/.
Already downloaded and kept the MNIST test data in the ./data/ folder. Training data size was huge for Github.

Loading the data

Python script, dataloader.py helps in converting the data from csv to numpy format.
Verify your above steps are correct by running the script
The script requires two user inputs i) datapath = path to the data folder inside which the MNIST CSVs files are stored.
ii) mode = 'train' or 'test' to extract the training or test data
iii) For example :-
```
    python dataloader.py ./data test
 ```
```

Training the model

Python Script , nn.py contains all the APIs used for training the model, saving the model and running over the test data
The script requires three user inputs
i) mode = 'train' or 'test'
ii) datapath = path to the data folder inside which the MNIST CSVs files are stored.
iii) modelpath = path to store the trained weight or load the weights during the test time

iv) Example:-
```
 ```python
     python nn.py train ./data ./new_model
 ```
```
v) Caution:-
```
 If you see the following exception:- 
         Exception : Not able to parse MNIST Data
 Please download the mnist_train.csv from the above mentioned link.
```

Testing the model

I have already provided the trained model inside the model folder and the test data inside the data folder.
To get started, use the follwing command.
```
    python nn.py test ./data ./model_bn
```

If everything is set-up well, you should see the following results on your console.
Loading Dataset===>
Done!
Loading Trained Weights......
Testing Iteration===>0, Acc ====>0.9766
Testing Iteration===>1, Acc ====>0.9609
Testing Iteration===>2, Acc ====>0.9844
Testing Iteration===>3, Acc ====>0.9766
Testing Iteration===>4, Acc ====>0.9531
Testing Iteration===>5, Acc ====>0.9531
Testing Iteration===>6, Acc ====>0.9453
Testing Iteration===>7, Acc ====>0.9844
Testing Iteration===>8, Acc ====>0.9687
Testing Iteration===>9, Acc ====>0.9609
Testing Iteration===>10, Acc ====>0.9844
Testing Iteration===>11, Acc ====>0.9297
Testing Iteration===>12, Acc ====>0.9531
Testing Iteration===>13, Acc ====>0.9531
Testing Iteration===>14, Acc ====>0.9297
Testing Iteration===>15, Acc ====>0.9453
Testing Iteration===>16, Acc ====>0.9687
Testing Iteration===>17, Acc ====>0.9766
Testing Iteration===>18, Acc ====>0.9531
Testing Iteration===>19, Acc ====>0.9141
Testing Iteration===>20, Acc ====>0.9766
Testing Iteration===>21, Acc ====>0.9922
Testing Iteration===>22, Acc ====>0.9219
Testing Iteration===>23, Acc ====>0.9531
Testing Iteration===>24, Acc ====>0.9922
Testing Iteration===>25, Acc ====>0.9687
Testing Iteration===>26, Acc ====>0.9531
Testing Iteration===>27, Acc ====>0.8984
Testing Iteration===>28, Acc ====>0.9687
Testing Iteration===>29, Acc ====>0.9453
Testing Iteration===>30, Acc ====>0.9453
Testing Iteration===>31, Acc ====>0.9453
Testing Iteration===>32, Acc ====>0.9531
Testing Iteration===>33, Acc ====>0.9687
Testing Iteration===>34, Acc ====>0.9844
Testing Iteration===>35, Acc ====>0.9766
Testing Iteration===>36, Acc ====>0.9766
Testing Iteration===>37, Acc ====>0.9687
Testing Iteration===>38, Acc ====>0.9375
Testing Iteration===>39, Acc ====>0.9687
Testing Iteration===>40, Acc ====>0.9687
Testing Iteration===>41, Acc ====>0.9609
Testing Iteration===>42, Acc ====>0.9844
Testing Iteration===>43, Acc ====>0.9453
Testing Iteration===>44, Acc ====>0.9531
Testing Iteration===>45, Acc ====>0.9687
Testing Iteration===>46, Acc ====>0.9687
Testing Iteration===>47, Acc ====>0.9766
Testing Iteration===>48, Acc ====>0.9609
Testing Iteration===>49, Acc ====>0.9766
Testing Iteration===>50, Acc ====>0.9531
Testing Iteration===>51, Acc ====>0.9922
Testing Iteration===>52, Acc ====>0.9453
Testing Iteration===>53, Acc ====>0.9766
Testing Iteration===>54, Acc ====>0.9531
Testing Iteration===>55, Acc ====>0.9453
Testing Iteration===>56, Acc ====>0.9453
Testing Iteration===>57, Acc ====>0.9453
Testing Iteration===>58, Acc ====>0.9219
Testing Iteration===>59, Acc ====>0.9609
Testing Iteration===>60, Acc ====>0.9531
Testing Iteration===>61, Acc ====>0.9609
Testing Iteration===>62, Acc ====>0.9297
Testing Iteration===>63, Acc ====>0.9687
Testing Iteration===>64, Acc ====>0.9297
Testing Iteration===>65, Acc ====>0.9766
Testing Iteration===>66, Acc ====>0.9687
Testing Iteration===>67, Acc ====>0.9453
Testing Iteration===>68, Acc ====>0.9531
Testing Iteration===>69, Acc ====>0.9219
Testing Iteration===>70, Acc ====>0.9531
Testing Iteration===>71, Acc ====>0.9531
Testing Iteration===>72, Acc ====>0.9531
Testing Iteration===>73, Acc ====>0.9531
Testing Iteration===>74, Acc ====>0.9297
Testing Iteration===>75, Acc ====>0.9531
Testing Iteration===>76, Acc ====>0.9297
Testing Iteration===>77, Acc ====>0.9687

Run on sample images

I have kept some images from MNIST inside the images folder.
To use this code, install opencv to read the image.
Run using :-

    python run_on_image.py images/img_4.png ./model_bn/

Model Desgin

Number of Hidden Layers - 2
Hidden Layer Sizes - (1024,2048)
Learning Rate - 0.001
Batch Size - 128
Maximum Iterations for training - 1000000
Batch Norm Decay Rate - 0.9

Observations

Faster Convergence and better accuracy by using Batch Normalization before Relu operation. Please refere to the plots below.
Experiments with increasing the hidden layers and size might help us in finding a sweet spot where we are neither underfitting nor overfitting.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

figs

figs

images

images

model_bn

model_bn

README.md

README.md

dataloader.py

dataloader.py

nn.py

nn.py

requirements.txt

requirements.txt

run_on_image.py

run_on_image.py

Repository files navigation

Project Title

Getting Started

Getting Data

Loading the data

Training the model

Testing the model

Run on sample images

Model Desgin

Observations

Loss and Accuracy Curves with Batch Normalization

Loss and Accuracy Curves without Batch Normalization

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
data		data
figs		figs
images		images
model_bn		model_bn
README.md		README.md
dataloader.py		dataloader.py
nn.py		nn.py
requirements.txt		requirements.txt
run_on_image.py		run_on_image.py

345ishaan/Digit-Classification-using-Numpy

Folders and files

Latest commit

History

Repository files navigation

Project Title

Getting Started

Getting Data

Loading the data

Training the model

Testing the model

Run on sample images

Model Desgin

Observations

Loss and Accuracy Curves with Batch Normalization

Loss and Accuracy Curves without Batch Normalization

About

Topics

Resources

Stars

Watchers

Forks

Languages