Live Facial Recognition

Live Facial Recogition of person using web-cam. The framework can detect multiple people in a single frame. The model was developed as part of a team project work to explore the integration of different ML and deep learning tools in a single application.

Model

Every i’th frame in the video is passed to the MTCNN module which detects the faces and provides the bounding box.
The aligned image is then passed to facenet model.
The embedding is the passed to a classifier.

Face Detection (MTCNN)

Multi-task Cascaded Convolutional Networks is a three stage cascade deep neural network that predicts face and its location (bounding box). This framework exploits the inherent correlation between detection and alignment.

1 (Proposal Network -PNET): A Fully convolutional network, called Proposal Network (P-Net) is used to obtain the facial windows and their bounding box regression vectors. Facial candidates are further calibrated using bounding boxes followed by a non maximal suppression.

2: (Refine Network-RNET): All candidates are fed to a Refine Network (R-Net), which further rejects a large number of false candidates, performs calibration with bounding box regression followed by non maximal suppression

3: (Output Network- ONET): Similar to the second stage, but in this stage face regions are identified with more supervision. In particular, the network will output five facial landmarks positions.

Facenet (Feature Extraction)

FaceNet, learns a mapping from face images to a compact 128-D Euclidean space Once this space has been produced, tasks such as face recognition, verification and clustering can be easily implemented using standard techniques with FaceNet embeddings as feature vectors. The pre-trained model of Inception Resnet v1 trained on CASIA-Webface dataset is being used and it gives an accuracy of 98.72%

Classification

The extracted features from facenet are trained on RF and SVM classifier for the created dataset.

Preparing the Dataset

For preparing the dataset, separate folders for each person need to be created. For each folder should have nearly 15-25 images of the person in different positions, keeping the camera in front of it.

Running the Code

python face_recog_live.py

For Training the Classifier with the dataset

python train_classifier.py --input-dir 'INPUT IMAGE DIRECTORY' --model-path models/20170511-185253.pb --classifier-path 'PATH TO SAVE THE TRAINED MODEL' --num-threads 10 --num-epochs 5 --min-num-images-per-class 1

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
models		models
README.md		README.md
align_dataset_mtcnn.py		align_dataset_mtcnn.py
classifier.py		classifier.py
det1.npy		det1.npy
det2.npy		det2.npy
det3.npy		det3.npy
detect_face.py		detect_face.py
face_recog_live.py		face_recog_live.py
facenet.py		facenet.py
lfw_input.py		lfw_input.py
train_classifier.py		train_classifier.py
train_classifier_new.py		train_classifier_new.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Live Facial Recognition

Model

Face Detection (MTCNN)

Facenet (Feature Extraction)

Classification

Preparing the Dataset

Running the Code

Click here to see my latest projects!

Latest Deep Learning Project: Live Object Detection

About

Releases

Packages

Languages

nvios/facial_recognition

Folders and files

Latest commit

History

Repository files navigation

Live Facial Recognition

Model

Face Detection (MTCNN)

Facenet (Feature Extraction)

Classification

Preparing the Dataset

Running the Code

Click here to see my latest projects!

Latest Deep Learning Project: Live Object Detection

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages