Applying Convolutional Neural Network on mnist dataset

10 May

Applying Convolutional Neural Network on mnist dataset

CNN is basically a model known to be Convolutional Neural Network and in the recent time it has gained a lot of popularity because of it’s usefullness. CNN uses multilayer perceptrons to do computational works. CNNs use relatively little pre-processing compared to other image classification algorithms. This means the network learns through filters that in traditional algorithms were hand-engineered. So, for image processing task CNNs are the best-suited option.

MNIST dataset:
mnist dataset is a dataset of handwritten images as shown below in image.

We can get 99.06% accuracy by using CNN(Convolutionary neural Network) with functional model. The reason of using functional model is maintaining easiness while connecting the layers.

Firstly, include all necessary libraries
import numpy as np
import keras
from keras.datasets import mnist
from keras.models import Model
from keras.layers import Dense, Input
from keras.layers import Conv2D, MaxPooling2D, Dropout, Flatten
from keras import backend as k

Create the train data and test data
Test data: Used for testing the model that how are model has been trained.
Train data: Used to train our model.

(x-train, y-train), (X-test, y-test) = mnist.load_data()
While proceeding further, img_rows and img_cols are used as the image dimensions. In mnist dataset, it is 28 and 28. We also need to check the data format i.e. ‘channels_first’ or ‘channels_last’. In CNN, we can normalize data before hands such that large terms of the calculations can be reduced to smaller terms. Like, we can normalize the x-train and X-test data by dividing it with 255.

Checking data-format:

img_rows, img_cols=28, 28

if k.image_data_format() == 'channels_first':
x-train = x-train.reshape(x-train.shape[0], 1, img_rows, img_cols)
X-test = X-test.reshape(X-test.shape[0], 1, img_rows, img_cols)
inpx = (1, img_rows, img_cols)

else:
x-train = x-train.reshape(x-train.shape[0], img_rows, img_cols, 1)
X-test = X-test.reshape(X-test.shape[0], img_rows, img_cols, 1)
inpx = (img_rows, img_cols, 1)

x-train = x-train.astype('float32')
X-test = X-test.astype('float32')
x-train /= 255
X-test /= 255
Description of the output classes:
Since output of the model can comprise of any of the digits between 0 to 9.so, we need 10 classes in output. To make output for 10 classes, use keras.utils.to_categorical function, which will provide with the 10 columns. Out of these 10 columns only one value will be one and rest 9 will be zero and this one value of the output will denote the class of the digit.

y-train = keras.utils.to_categorical(y-train)
y-test = keras.utils.to_categorical(y-test)
Now, dataset is ready so let’s move towards the cnn model :

inpx = Input(shape=inpx)
layer1 = Conv2D(32, kernel_size=(3, 3), activation='relu')(inpx)
layer2 = Conv2D(64, (3, 3), activation='relu')(layer1)
layer3 = MaxPooling2D(pool_size=(3, 3))(layer2)
layer4 = Dropout(0.5)(layer3)
layer5 = Flatten()(layer4)
layer6 = Dense(250, activation='sigmoid')(layer5)
layer7 = Dense(10, activation='softmax')(layer6)
Explanation of the working of each layer in CNN model:
layer1 is Conv2d layer which convolves the image using 32 filters each of size (3*3).
layer2 is again a Conv2D layer which is also used to convolve the image and is using 64 filters each of size (3*3).
layer3 is MaxPooling2D layer which picks the max value out of a matrix of size (3*3).
layer4 is showing Dropout at a rate of 0.5.
layer5 is flattening the output obtained from layer4 and this flatten output is passed to layer6.
layer6 is a hidden layer of neural network containng 250 neurons.
layer7 is the output layer having 10 neurons for 10 classes of output that is using the softmax function.

Calling compile and fit function:
model = Model([inpx], layer7)
model.compile(optimizer=keras.optimizers.Adadelta(),
loss=keras.losses.categorical_crossentropy,
metrics=['accuracy'])

model.fit(x-train, y-train, epochs=12, batch_size=500)

Firstly, we made an object of the model as shown in the above given lines, where [inpx] is the input in the model and layer7 is the output of the model. We compiled the model using required optimizer, loss function and printed the accuracy and at the last model.fit was called along with parameters like x-train(means image vectors), y-train(means the label), number of epochs and the batch size. Using fit function x-train, y-train dataset is fed to model in a particular batch size.

Evaluate function:
model.evaluate provides the score for the test data i.e. provided the test data to the model. Now, model will predict class of the data and predicted class will be matched with y-test label to give us the accuracy.
score = model.evaluate(X-test, y-test, verbose=0)
print('loss=', score[0])
print('accuracy=', score[1])
Output:

Course Curriculum

Applying Convolutional Neural Network on mnist dataset

loss = 0.0295960184669

accuracy = 0.991