AccScience Publishing / IJOSI / Volume 9 / Issue 1 / DOI: 10.6977/IJoSI.202502_9(1).0003
Cite this article
9
Download
16
Citations
34
Views
Journal Browser
Volume | Year
Issue
Search
News and Announcements
View All
ARTICLE

Brightness Augmentation Implementation to Evaluate Perfor-mance Classification of Face Masked Base on CNN Model

Desty Mustika Ramadhan1 Husni Mubarok1 Rianto .1*
Show Less
1 Department of Informatics, Siliwangi University, Tasikmalaya 46115, West Java, Indonesia
Submitted: 7 June 2024 | Revised: 20 September 2024 | Accepted: 21 October 2024 | Published: 19 February 2025
© 2025 by the Author(s). Licensee AccScience Publishing, USA. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution -Noncommercial 4.0 International License (CC BY-NC 4.0) ( https://creativecommons.org/licenses/by-nc/4.0/ )
Abstract

Deep learning methods with convolutional neural network (CNN) models have increasingly been applied to facial expression recognition. However, due to the recent pandemic, many individuals wear masks for work or health reasons, obstructing the complete visibility of their faces. This can impact social interactions, particularly in areas involving facial expression cues like the mouth. This study explores the application of CNNs in identifying facial expressions obscured by masks, focusing on the VGG16 and MobileNet architectures. Additionally, the research investigates the effects of data augmentation, including geometric and brightness augmentation, on the accuracy of facial expression classification. The findings indicate that the VGG16 architecture with cross-validation (VGG16-FLCV) outperforms MobileNet-FLCV in recognizing and classifying masked facial expressions. Data augmentation, particularly brightness augmentation, significantly enhances CNN model performance. For the VGG16-FLCV architecture, the brightness range (1.00, 1.25) yields the best accuracy, with a training accuracy of 81.73% and a validation accuracy of 70.71%. The most optimal brightness ranges for VGG16-FLCV are in the dark category (0.25, 0.50), (0.50, 0.75), and (0.75, 1.00), as well asthe bright category (1.00, 1.25). Meanwhile, MobileNet-FLCV with brightness ranges (0.25, 0.50), (0.50, 0.75), (0.75, 1.00), (1.00, 1.25), and (1.25, 1.50) can be used as alternative brightness ranges without significant accuracy degradation. These findings provide valuable insights for improving the accuracy of masked facial expression recognition by applying appropriate data augmentation techniques.

Keywords
Brightness Augmentation
CNN
Cross-validation
Masked Facial Expressions
References

Agrawal, A. & Mittal, N. (2019). Using CNN for facial expression recognition: a study of the effects of kernel size and number of filters on accuracy. The Visual Computer, 2. https://doi.org/10.1007/s00371-019-01630-9

Berrar, D. (2018). Cross-validation. Encyclopedia of Bioinformatics and Computational Biology: ABC of Bioinformatics, 1–3(April), 542–545. https://doi.org/10.1016/B978-0-12-809633-8.20349-X

Castellano, G., De Carolis, B. & Macchiarulo, N. (2021). Automatic emotion recognition from facial expressions when wearing a mask. ACM International Conference Proceeding Series. https://doi.org/10.1145/3464385.3464730

Cheng, X., Lu, J., Member, S. & Yuan, B. (n.d.). Face Segmentor-Enhanced Deep Feature Learning for Face Recognition. 1–14.

Choe, J. & Shim, H. (n.d.). Attention-based Dropout Layer for Weakly Supervised Object Localization.2219–2228.

Cotter, S.F. (n.d.). MobiExpressNet: A Deep Learning Network for Face Expression Recognition on Smart Phones. 2020 IEEE International Conference on Consumer Electronics (ICCE), 1–4.

Ding, H., Zhou, P. & Chellappa, R. (2020). Occlusion-adaptive deep network for robust facial expression recognition. IJCB 2020 -IEEE/IAPR International Joint Conference on Biometrics. https://doi.org/10.1109/IJCB48548.2020.9304923

Farkhod, A. & Abdusalomov, A.B. (2022). Development of Real-Time Landmark-Based Emotion Recognition CNN for Masked Faces.

Genç, Ç., Colley, A., Löchtefeld, M. & Häkkilä, J. (2020). Face mask design to mitigate facial expression occlusion. Proceedings -International Symposium on Wearable Computers, ISWC, 40–44. https://doi.org/10.1145/3410531.3414303.

Grundmann, F., Epstude, K. & Id, S.S. (2021). Face masks reduce emotion-recognition accuracy and perceived closeness. 1–18. https://doi.org/10.1371/journal.pone.0249792.

Kandel, I., Castelli, M., & Manzoni, L. (2022). Brightness as an Augmentation Technique for Image Classification. Emerging Science Journal, 6(4), 881–892. https://doi.org/10.28991/ESJ-2022-06-04-015.

Li, J., Jin, K., Zhou, D., Kubota, N., & Ju, Z. (2020). Attention Mechanism-based CNN for Facial Expression. Neurocomputing. https://doi.org/10.1016/j.neucom.2020.06.014

Li, Z., Kamnitsas, K., & Glocker, B. (2021). Analyzing Overfitting Under Class Imbalance in Neural Networks for Image Segmentation. 40(3), 1065–1077.

Merghani, W., & Yap, M. H. (n.d.). Adaptive Mask for Region-based Facial Micro-Expression Recognition. 2–7.

Pei, Z., Xu, H., Zhang, Y., Guo, M., & Yang, Y. (2019). Face Recognition via Deep Learning Using Data Augmentation Based on Orthogonal Experiments. 1–16. https://doi.org/10.3390/electronics8101088.

Rice, L., Wong, E., & Kolter, J. Z. (2020). Overfitting in adversarially robust deep learning. 37th International Conference on Machine Learning, ICML 2020, PartF16814, 8049–8074.

Shinde, P.P. & Shah S. (2018). A Review of Machine Learning and Deep Learning Applications. 2018 Fourth International Conference on Computing Communication Control and Automation (ICCUBEA). IEEE, 1–6. https://doi.org/10.1109/ICCUBEA.2018.8697857.

Waheed, A., Goyal, M., Gupta, D., Khanna, A., Al-Turjman, F. & Pinheiro, P.R. (2020). CovidGAN: Data Augmentation Using Auxiliary Classifier GAN for Improved Covid-19 Detection. IEEE Access, 8, 91916–91923. https://doi.org/10.1109/ACCESS.2020.2994762.

Yang, B. (2021). Face Mask Aware Robust Facial Expression Recognition. 240–244.

Yang, B., Wu, J., Ikeda, K., Hattori, G., Sugano, M., Iwasawa, Y., & Matsuo, Y. (2022). Face-mask-aware Facial Expression Recognition based on Face Parsing and Vision Transformer. Pattern Recognition Letters, 164, 173–182. https://doi.org/10.1016/j.patrec.2022.11.004.

Zhang, H., Zhang, L., & Jiang, Y. (n.d.). Overfitting and Underfitting Analysis for Deep Learning Based End-to-end Communication Systems. 2019 11th International Conference on Wireless Communications and Signal Processing (WCSP), 1–6.

 

Share
Back to top
International Journal of Systematic Innovation, Electronic ISSN: 2077-8767 Print ISSN: 2077-7973, Published by AccScience Publishing