Disclaimer

This project is only for personal challenge and educational purpose, no other pretention than those ones.
My goal was not to (obviously) reivent the wheel, this project is highly inspired from several (good) readings (see some references below + do not forget that search engines are your friends).
In the end, objective was also to discover, understand and improve my personal skills in scikit-learn and keras usage when building neural networks.

Useful and interesting readings before starting

pyimagesearch blog: How to get started with Keras, Deep Learning, and Python. You can also follow links inside the post, they all contains useful informations on VGGNet models for example. Almost everything in this project comes from this blog post, it is a very good starting point. Great tutorial !
pyimagesearch blog: Implement and use already trained networks.
Kaggle kernel: Cats vs. Dogs
Keras blog: Building powerful image classification models using little data
scikit-learn tutorial: face recognition with SVM

Grab also some informations on Convolutional Neural Networks to understand their purpose and (mainly) how it works. Wikipedia might help you (FR or EN).

Module dependencies (requirements)

This project works with Python 3.6.x (not 3.7 as Tensorflow backend is not yet supported for more than 3.6). If not already installed, use pip to install those packages:

keras (deep learning and neural networks made easy)
tensorflow (backend used by keras)
scikit-learn (machine learning)
numpy
opencv (image manipulation)

How to use it and assumptions

Goal of this project is either:

to build and train a neural network
to use an already trained one
to use a pre-trained network provided with Keras (which are able of to classify images among 1,000 different object categories, similar to objects we encounter in our day-to-day lives with high accuracy) to be able, in the end, to classify a picture the model has never seen and says whether it is a dog or a cat (or actually whatever else you would like to implement).

For that, the main.py takes 3 arguments (2 first are mandatory):

-m (or "--model"), mandatory: shoud equals 'simple' or 'cvn', depending on which NN you would like to use (NB: so far, only simple one is implemented)
-o (or "--objective"), mandatory: should equals 'train' or 'predict', depending on what you would like to do
-w (or "--working-dir"): use it to set another working directory than the one where this module is executed.

Directory structure

Some folders are (configured) and required:

training: the folder for training image dataset (put images in subfolders named against categories). For example: create 2 folders named 'dog' and 'cat' to classify between both those animals. I have been able to download the Kaggle dataset 'Pets vs Cats' here. It is of course not added to this repository and contains more than 10K entries for each class.
output: folder where trained models are stored or loaded, depending on what is required
validation_test: folder where images are loaded to perform predictions
All those directories are set in the external configuration config.json file. They are not added to this git repository so that you can create them and name them how you want to.

External configuration file

Those parameters plus some model hyperparameters are set in the external <working_directory>/config/config.json file. Change them to fit your needs.

Simple NN

Goal here is to discover the libraries, it is written almost everywhere (for example here) that "The right tool for an image classification job is a convnet (convolutional network)".
Our simple NN will not give wonderful results but might be able to predict most of the cases. This simple Neural Network is built with 4 layers (3072-1024-512-x):

3072 as input layer (3072 = 32x32x3 pixels = flattened images)
x as output is the number if possible class labels (2 if binary classification, n if multi-class classification) Activation function (sigmoid) on each layers are common ones found over several tutorials

Convolutional NN

Implementation of a smaller version of VGG Neural Network. VGG architecture:

use only 3×3 convolutional layers stacked on top of each other in increasing depth.
reduce volume size by using max pooling.
in the end, 2 FC (fully-connected) layers followed by a softmax classifier

Layers are: INPUT -> ['CONV -> RELU -> POOL'] -> [(CONV -> RELU) * 2 -> POOL] -> [(CONV -> RELU) * 3 -> POOL] -> [FC -> RELU] -> FC -> softmax classifier

With:

RELU (Rectified Linear Unit): the activation function used in this network architecture.
Batch Normalization: used to normalize the activations of a given input volume before passing it to the next layer in the network. Proven to be very effective at reducing the number of epochs required to train a CNN as well as stabilizing training itself.
POOL: used to progressively reducing the the input volume to a layer.
Dropout: process of disconnecting random neurons between layers: reduce overfitting, increase accuracy, and allow our network to generalize better for unfamiliar images. Here, 25% of the node connections are randomly disconnected between layers during each training iteration. Source

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
config		config
dataloader		dataloader
models		models
utils		utils
.gitignore		.gitignore
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Disclaimer

Useful and interesting readings before starting

Module dependencies (requirements)

How to use it and assumptions

Directory structure

External configuration file

Simple NN

Convolutional NN

About

Releases

Packages

Languages

nidragedd/image_classification

Folders and files

Latest commit

History

Repository files navigation

Disclaimer

Useful and interesting readings before starting

Module dependencies (requirements)

How to use it and assumptions

Directory structure

External configuration file

Simple NN

Convolutional NN

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages