TY - GEN
T1 - Facial Mask Detection using Semantic Segmentation
AU - Meenpal, Toshanlal
AU - Balakrishnan, Ashutosh
AU - Verma, Amit
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/10/1
Y1 - 2019/10/1
N2 - Face Detection has evolved as a very popular problem in Image processing and Computer Vision. Many new algorithms are being devised using convolutional architectures to make the algorithm as accurate as possible. These convolutional architectures have made it possible to extract even the pixel details. We aim to design a binary face classifier which can detect any face present in the frame irrespective of its alignment. We present a method to generate accurate face segmentation masks from any arbitrary size input image. Beginning from the RGB image of any size, the method uses Predefined Training Weights of VGG-16 Architecture for feature extraction. Training is performed through Fully Convolutional Networks to semantically segment out the faces present in that image. Gradient Descent is used for training while Binomial Cross Entropy is used as a loss function. Further the output image from the FCN is processed to remove the unwanted noise and avoid the false predictions if any and make bounding box around the faces. Furthermore, proposed model has also shown great results in recognizing non-frontal faces. Along with this it is also able to detect multiple facial masks in a single frame. Experiments were performed on Multi Parsing Human Dataset obtaining mean pixel level accuracy of 93.884 % for the segmented face masks.
AB - Face Detection has evolved as a very popular problem in Image processing and Computer Vision. Many new algorithms are being devised using convolutional architectures to make the algorithm as accurate as possible. These convolutional architectures have made it possible to extract even the pixel details. We aim to design a binary face classifier which can detect any face present in the frame irrespective of its alignment. We present a method to generate accurate face segmentation masks from any arbitrary size input image. Beginning from the RGB image of any size, the method uses Predefined Training Weights of VGG-16 Architecture for feature extraction. Training is performed through Fully Convolutional Networks to semantically segment out the faces present in that image. Gradient Descent is used for training while Binomial Cross Entropy is used as a loss function. Further the output image from the FCN is processed to remove the unwanted noise and avoid the false predictions if any and make bounding box around the faces. Furthermore, proposed model has also shown great results in recognizing non-frontal faces. Along with this it is also able to detect multiple facial masks in a single frame. Experiments were performed on Multi Parsing Human Dataset obtaining mean pixel level accuracy of 93.884 % for the segmented face masks.
KW - Face Segmentation and Detection
KW - Fully Convolutional Network
KW - Semantic Segmentation
UR - https://www.scopus.com/pages/publications/85075349374
U2 - 10.1109/CCCS.2019.8888092
DO - 10.1109/CCCS.2019.8888092
M3 - Conference contribution
AN - SCOPUS:85075349374
T3 - 2019 4th International Conference on Computing, Communications and Security, ICCCS 2019
BT - 2019 4th International Conference on Computing, Communications and Security, ICCCS 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 4th International Conference on Computing, Communications and Security, ICCCS 2019
Y2 - 10 October 2019 through 12 October 2019
ER -