skip to main content
10.1145/3650400.3650567acmotherconferencesArticle/Chapter ViewAbstractPublication PageseitceConference Proceedingsconference-collections
research-article

Multi-scale Face Detection Algorithm with Texture Feature Enhancement

Published: 17 April 2024 Publication History

Abstract

A multi-scale face detection algorithm with texture feature enhancement is proposed for the difficult problem of tiny and occluded face detection in natural scenes. In order to obtain more fine-grained face texture features and enhance the ability of tiny face detection, the backbone network structure of the baseline algorithm was adjusted. The feature enhancement module was introduced to obtain rich contextual feature information and improve local semantic information characterization, which facilitated detection of tiny and occluded faces by fusing the feature of different receptive fields. To increase the neck network's ability, an improved shuffle attention module ISAM was adopted to perceive face detail information and facilitate the performance of occlusion and tiny face detection. Multi-task Associative Detector Head (MTADH) is used to learn the correlation between the classification task and the regression task to better utilize regression features and classification features to improve face detection results. By controlling the width and depth of the detection networks, two detection networks with different sizes, MFDTFE-M and MFDTFE-S, are constructed. On the WIDER FACE validation dataset, the average precision of the two detection networks on the Hard subset is improved by 2.99% and 2.77% relative to the m-model and s-model of the baseline algorithm, and the performance of the tiny and occluded face detection is significantly improved.

References

[1]
Qilin He and Pingan Mu. 2021. Occlusion face detection based on VGG network and multi-feature fusion. Electronic Measurement Technology, 44(18): 150-154. https://doi.org/10.19651/j.cnki.emt.2107045
[2]
Rouhollah Kian Ara, Andrzej Matiolanski, Michał Grega, Andrzej Dziech and Remigiusz Baran. 2023. Efficient Face Detection Based Crowd Density Estimation using Convolutional Neural Networks and an Improved Sliding Window Strategy. International Journal of Applied Mathematics and Computer Science, 33(1): 7-20. https://doi.org/10.34768/amcs-2023-0001
[3]
Ziping Yu, Hongbo Huang, Weijun Chen, Yongxin Su, Yahui Liu, and Xiuying Wang. 2022. YOLO-FaceV2: A Scale and Occlusion Aware Face Detector. arXiv:2208.02019. https://arxiv.org/abs/2208.02019
[4]
Asher Trockman and J. Zico Kolter. 2022. Patches Are All You Need? arXiv:2201.09792. https://arxiv.org/abs/2201.09792
[5]
Jian Li, Yabiao Wang, Changan Wang, Ying Tai, Jianjun Qian, Jian Yang, Chengjie Wang, Jilin Li and Feiyue Huang. 2019. DSFD: dual shot face detector. 32nd IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE, Long Beach, CA, USA, 5055-5064. https://doi.org/10.1109/CVPR.2019.00520
[6]
Jiankang Deng, Jia Guo, Evangelos Ververas, Irene Kotsia and Stefanos Zafeiriou. 2020. RetinaFace: single-shot multi-level face localisation in the wild. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE, Seattle, WA, USA, 5202-5211. https://doi.org/10.1109/CVPR42600.2020.00525
[7]
Jian Li, Bin Zhang, Yabiao Wang, Ying Tai, Zhenyu Zhang, Chengjie Wang, Jilin Li, Xiaoming Huang and Yili Xia. 2022. ASFD: Automatic and scalable face detector. arXiv:2201.10781. https://arxiv.org/abs/2201.10781
[8]
Yanjia Zhu, Hongxiang Cai, Shuhan Zhang, Chenhao Wang and Yichao Xiong. 2020. Tinaface: Strong but Simple Baseline for Face Detection. arXiv:2011.13183. https://arxiv.org/abs/2011.13183
[9]
Zhi Zhang, Jin Wang, Jie Wang and Jin Zheng. 2021. Multi-scale and texture feature enhancement for small face detection. Application Research of Computers, 38(03): 914-918. https://doi.org/10.19734/j.issn.1001-3695.2019.12.0696
[10]
Delong Qi, Weijun Tan and Qi Yao. 2022. YOLO5Face: Why reinventing a face detector. arXiv:2105.12931. https://arxiv.org/abs/2105.12931
[11]
Glenn Jocher. 2021. YOLOv5. https://github.com/ultralytics/yolov5.
[12]
Youngwan Lee, Joong-Won Hwang, Sangrok Lee, Yuseok Bae and Jongyoul Park. 2019. An Energy and GPU-Computation Efficient Backbone Network for Real-Time Object Detection. 32nd IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE, Long Beach, CA, USA, 752-760. https://doi.org/10.1109/CVPRW.2019.00103
[13]
Xue Yang, Jirui Yang, Junchi Yan, Yue Zhang, Tengfei Zhang, Zhi Guo, Xian Sun and Kun Fu. 2020. SCRDet: Towards More Robust Detection for Small, Cluttered and Rotated Objects. 2019 IEEE/CVF International Conference on Computer Vision. IEEE, Seoul, South Korea, 8232-8241. https://doi.org/10.1109/ICCV.2019.00832
[14]
Qinglong Zhang and Yubin Yang. 2021. SA-NET: Shuffle Attention for Deep Convolutional Neural Networks. 2021 IEEE International Conference on Acoustics, Speech, and Signal Processing. IEEE, Toronto, Canada, 2235-2239. https://doi.org/10.1109/ICASSP39728.2021.9414568
[15]
Sanghyun Woo, Jongchan Park, Joon-Young Lee and In So Kweon. 2018. CBAM: Convolutional Block Attention Module. 15th European Conference on Computer Vision. Springer, Munich, Germany, 3-19. https://doi.org/10.1007/978-3-030-01234-2_1
[16]
Xiaozhong Tong, Junyu Wei, Shaojing Su, Bei Sun and Zhen Zuo. 2023. Typical small target detection on water surfaces fusing attention and multi-scale features, Chinese Journal of Scientific Instrument, 44(01): 212-222. https://doi.org/10.19650/j.cnki.cjsi.J2210128
[17]
Shuo Yang, Ping Luo, Chen Change Loy and Xiaoou Tang. 2016. Wider Face: A Face Detection Benchmark. 2016 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, Seattle, WA, USA, 5525-5533. https://doi.org/10.1109/CVPR.2016.596
[18]
Mahyar Najibi, Pouya Samangouei, Rama Chellappa and Larry S. Davis. 2017. SSH: Single Stage Headless Face Detector. 16th IEEE International Conference on Computer Vision. IEEE, Venice, Italy, 4885-4894. https://doi.org/10.1109/ICCV.2017.522
[19]
Shunfu Hua, Huijie Fan, Naida Ding, Wei Li and Yandong Tang. 2022. A self-attention network for face detection based on unmanned aerial vehicles. International Conference on Intelligent Robotics and Applications. Springer, Harbin, China, 2022(13456): 440-449. https://doi.org/10.1007/978-3-031-13822-5_39
[20]
Yang Liu, Xu Tang, Jingtuo Liu, Junyu Han, Dinger Rui and Xiang Wu. 2020. HAMBox: Delving into Mining High-Quality Anchors on Face Detection. 2020 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, Seattle, WA, USA, 13043-13051. https://doi.org/10.1109/CVPR42600.2020.01306

Index Terms

  1. Multi-scale Face Detection Algorithm with Texture Feature Enhancement

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    EITCE '23: Proceedings of the 2023 7th International Conference on Electronic Information Technology and Computer Engineering
    October 2023
    1809 pages
    ISBN:9798400708305
    DOI:10.1145/3650400
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 17 April 2024

    Permissions

    Request permissions for this article.

    Check for updates

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    EITCE 2023

    Acceptance Rates

    Overall Acceptance Rate 508 of 972 submissions, 52%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 6
      Total Downloads
    • Downloads (Last 12 months)6
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 09 Jan 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media