A Hybrid ROI Extraction Approach for Mask and Unmask
Facial Recognition System using Light-CNN

Ahmed, Ahmed; S. Alghareb, Faris

doi:http://dx.doi.org/10.12785/ijcds/160190

Journals About us Ethics and Policies Objectives Values Contact us

UOB Journals
→
02. International Journal of Computing and Digital Systems
→
Preprint
→
View Item

A Hybrid ROI Extraction Approach for Mask and Unmask Facial Recognition System using Light-CNN

Ahmed, Ahmed; S. Alghareb, Faris

DOI: http://dx.doi.org/10.12785/ijcds/160190

ISSN: 2210-142X

Date: 2024-03-06

Abstract:

In recent years, deep learning-based algorithms have been immensely employed and tested in a variety of real-world applications. The efficacy of such algorithms has been thoroughly examined in a practical setting. In this paper, CNN-based deep learning approaches are utilized to recognize faces in real-time to identify faces with and without mask. We employ pre-trained algorithms (YOLOv2 and SSD) to identify people wearing a face mask, which enables a machine to perform recognition tasks while evolving through a learning method. Meanwhile, if there is more than one person in the scene, the one with the max score will be selected for classification. Thus, a hybrid approach that combines YOLOv2 and SSD algorithms to work in parallel is developed for masked-face extraction. Likewise, the Viola-Jones algorithm is used here to detect faces without mask and randomly select a single region of interest (ROI) to be stored for classification. All pre-processing algorithms work separately in parallel as reconstruction steps for preprocessing to crop the ROI and store images for training and testing dataset. Followed by developing a lightweight computational complexity CNN model for face mask recognition to identify whether the selected person’s face is wearing a mask or not. The dataset contains numerous variations in appearance and viewpoint to capture different scenarios with and without mask faces. On average, the proposed face mask detection architecture realizes recall and F1 score of 98.3 and 98.31, respectively. The training performance, on the other hand, has improved by 19.7% and 95.9% for training time and storage space (model size) compared to AlexNet. The presented framework architecture is an efficient face mask and unmask detector and can be employed as a robust medical assistant face detector in the healthcare sector for automated tracking of a patient, visitor, or staff member wearing a mask or not.

Show full item record