Image Classification with CIFAR-10 dataset

In this notebook, I am going to classify images from the CIFAR-10 dataset. The dataset consists of airplanes, dogs, cats, and other objects. You'll preprocess the images, then train a convolutional neural network on all the samples. The images need to be normalized. Some more interesting datasets can be found here

2. Understanding the dataset

The original a batch data is (10000 x 3072) dimensional tensor expressed in numpy array, where the number of columns, (10000), indicates the number of sample data. As stated in the CIFAR-10/CIFAR-100 dataset, the row vector, (3072) represents an color image of 32x32 pixels.

Since this project is going to use CNN for the classification tasks, the row vector, (3072), is not an appropriate form of image data to feed. In order to feed an image data into a CNN model, the dimension of the tensor representing an image data should be either (width x height x num_channel) or (num_channel x width x height).

It depends on your choice (check out the tensorflow conv2d). In this particular project, I am going to use the dimension of the first choice because the default choice in tensorflow's CNN operation is so.

The row vector (3072) has the exact same number of elements if you calculate 32*32*3==3072. In order to reshape the row vector, (3072), there are two steps required. The first step is involved with using reshape function in numpy, and the second step is involved with using transpose function in numpy as well.

5. Model Architecture

The entire model consists of 14 layers in total. In addition to layers below lists what techniques are applied to build the model.

Convolution with 6 different filters in size of (3x3)
Max Pooling by 3

ReLU activation function

Convolution with 16 different filters in size of (3x3)
Max Pooling by 2

ReLU activation function

Convolution with 64 different filters in size of (3x3)

ReLU activation function

Flattening the 3-D output of the last convolutional operations.
Fully Connected Layer with 120 units
Fully Connected Layer with 84 units
Fully Connected Layer with 10 units

the image below decribes how the conceptual convolving operation differs from the tensorflow implementation when you use [Channel x Width x Height] tensor format.

6. Training the model

achieving over 88.93% accuracy in 140 epochs through 5 batches.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
CIFAR-10.py		CIFAR-10.py
README.md		README.md
prediction1.PNG		prediction1.PNG
prediction2.PNG		prediction2.PNG
training.PNG		training.PNG

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image Classification with CIFAR-10 dataset

Contents

1. Get the Data

2. Understanding the dataset

3. Hands-on experience implementing normalize

4. Pytorch Basics

5. Model Architecture and construction (Using different types of APIs (tf.nn, tf.layers, tf.contrib))

6. Training the model

7. Prediction

2. Understanding the dataset

5. Model Architecture

6. Training the model

7. Prediction

About

Releases

Packages

Languages

qazimbhat1/CIFAR-10-image-classification-

Folders and files

Latest commit

History

Repository files navigation

Image Classification with CIFAR-10 dataset

Contents

1. Get the Data

2. Understanding the dataset

3. Hands-on experience implementing normalize

4. Pytorch Basics

5. Model Architecture and construction (Using different types of APIs (tf.nn, tf.layers, tf.contrib))

6. Training the model

7. Prediction

2. Understanding the dataset

5. Model Architecture

6. Training the model

7. Prediction

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages