skip to main content
10.1145/3562007.3562046acmotherconferencesArticle/Chapter ViewAbstractPublication PagesccrisConference Proceedingsconference-collections
research-article

Pornographic Image Recognition Based on High and Low Level Feature Fusion with Human Body Masking and Attention

Published: 12 October 2022 Publication History

Abstract

Pornographic image recognition is an important task for Internet content providers, especially for the video streaming service providers such as Bilibili, Tiktok and Tencent. However, most of existing pornographic image recognition works treat this task as a binary classification problem, which cannot achieve satisfactory performance in practice. In this paper, we develop a simple yet effective method to recognize pornographic images. First, we construct a dataset for model training and testing via collaboration with a famous video streaming service provider. Different from existing studies, our dataset consists of six classes and 14,885 images, of which 3627 are pornographic images. Then, considering that the focus of pornographic image recognition is human body, we employ mask-rcnn to mask the human bodies in images, and highlight the body areas by attention mechanism. Furthermore, we fuse both high-level and low-level features of bodies for classification. Here, the high-level features are extracted by a ResNet, and the low-level features are extracted by a Spatial Pyramid Pooling network SPP-net. Experiments on our dataset show that our proposed method performs better than the existing methods and achieves a precision of 82.7%. More importantly, our method has been applied to the pornographic image monitoring platform of the famous video streaming service provider mentioned above.

References

[1]
Buddhi Ashan, Hyuk Cho, and Qingzhong Liu. 2019. Performance Evaluation of Transfer Learning for Pornographic Detection. In Advances in Natural Computation, Fuzzy Systems and Knowledge Discovery - Proceedings of the 15th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD 2019), Kunming, China, July 20-22, 2019 - Volume 2(Advances in Intelligent Systems and Computing, Vol. 1075), Yong Liu, Lipo Wang, Liang Zhao, and Zhengtao Yu (Eds.). Springer, 403–414. https://doi.org/10.1007/978-3-030-32591-6_43
[2]
Kai Chen, Jiaqi Wang, Jiangmiao Pang, Yuhang Cao, Yu Xiong, Xiaoxiao Li, Shuyang Sun, Wansen Feng, Ziwei Liu, Jiarui Xu, Zheng Zhang, Dazhi Cheng, Chenchen Zhu, Tianheng Cheng, Qijie Zhao, Buyu Li, Xin Lu, Rui Zhu, Yue Wu, Jifeng Dai, Jingdong Wang, Jianping Shi, Wanli Ouyang, Chen Change Loy, and Dahua Lin. 2019. MMDetection: Open MMLab Detection Toolbox and Benchmark. CoRR abs/1906.07155(2019). arXiv:1906.07155http://arxiv.org/abs/1906.07155
[3]
Tan Xiaoyang Chen Xiao, Jin Xin. 2016. [Single-person bad picture recognition based on torso detection](in Chinese). Journal of Image and Graphics 21, 03 (2016), 348–355. https://doi.org/10.11834/jig.20160309
[4]
Margaret M. Fleck, David A. Forsyth, and Christoph Bregler. 1996. Finding Naked People. In Computer Vision - ECCV’96, 4th European Conference on Computer Vision, Cambridge, UK, April 15-18, 1996, Proceedings, Volume II(Lecture Notes in Computer Science, Vol. 1065), Bernard F. Buxton and Roberto Cipolla (Eds.). Springer, 593–602. https://doi.org/10.1007/3-540-61123-1_173
[5]
Kaiming He, Georgia Gkioxari, Piotr Dollár, and Ross B. Girshick. 2017. Mask R-CNN. In IEEE International Conference on Computer Vision, ICCV 2017, Venice, Italy, October 22-29, 2017. IEEE Computer Society, 2980–2988. https://doi.org/10.1109/ICCV.2017.322
[6]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2015. Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 37, 9 (2015), 1904–1916. https://doi.org/10.1109/TPAMI.2015.2389824
[7]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA, June 27-30, 2016. IEEE Computer Society, 770–778. https://doi.org/10.1109/CVPR.2016.90
[8]
Wu Huishi. 2021. Sensitive image recognition algorithm research and system implementation. Master’s thesis. Xidian Universityy. https://doi.org/10.27389/d.cnki.gxadu.2021.000577
[9]
Feng Jiao, Wen Gao, Lijuan Duan, and Guoqin Cui. 2001. Detecting adult image using multiple features. In 2001 International Conferences on Info-Tech and Info-Net. Proceedings (Cat. No. 01EX479), Vol. 3. IEEE, 378–383.
[10]
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2017. ImageNet classification with deep convolutional neural networks. Commun. ACM 60, 6 (2017), 84–90. https://doi.org/10.1145/3065386
[11]
Kai Li, Junliang Xing, Bing Li, and Weiming Hu. 2016. Bootstrapping deep feature hierarchy for pornographic image recognition. In 2016 IEEE International Conference on Image Processing, ICIP 2016, Phoenix, AZ, USA, September 25-28, 2016. IEEE, 4423–4427. https://doi.org/10.1109/ICIP.2016.7533196
[12]
Tsung-Yi Lin, Michael Maire, Serge J. Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C. Lawrence Zitnick. 2014. Microsoft COCO: Common Objects in Context. In Computer Vision - ECCV 2014 - 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V(Lecture Notes in Computer Science, Vol. 8693), David J. Fleet, Tomás Pajdla, Bernt Schiele, and Tinne Tuytelaars (Eds.). Springer, 740–755. https://doi.org/10.1007/978-3-319-10602-1_48
[13]
Zhao Xi Liu Bingzhi. 2019. Retrospect and Prospect of Research on Youth Internet Pornography in my country. Forward Position 02(2019), 95–104. https://doi.org/10.14129/j.cnki.forwordposition.2019.02.014
[14]
David G. Lowe. 2004. Distinctive Image Features from Scale-Invariant Keypoints. Int. J. Comput. Vis. 60, 2 (2004), 91–110. https://doi.org/10.1023/B:VISI.0000029664.99615.94
[15]
Mohamed N. Moustafa. 2015. Applying deep learning to classify pornographic images and videos. CoRR abs/1511.08899(2015). arXiv:1511.08899http://arxiv.org/abs/1511.08899
[16]
Xinyu Ou, Hefei Ling, Han Yu, Ping Li, Fuhao Zou, and Si Liu. 2017. Adult Image and Video Recognition by a Deep Multicontext Network and Fine-to-Coarse Strategy. ACM Trans. Intell. Syst. Technol. 8, 5 (2017), 68:1–68:25. https://doi.org/10.1145/3057733
[17]
Karen Simonyan and Andrew Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. In 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7-9, 2015, Conference Track Proceedings, Yoshua Bengio and Yann LeCun (Eds.). http://arxiv.org/abs/1409.1556
[18]
Josef Sivic and Andrew Zisserman. 2003. Video Google: A Text Retrieval Approach to Object Matching in Videos. In 9th IEEE International Conference on Computer Vision (ICCV 2003), 14-17 October 2003, Nice, France. IEEE Computer Society, 1470–1477. https://doi.org/10.1109/ICCV.2003.1238663
[19]
Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott E. Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. 2015. Going deeper with convolutions. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015, Boston, MA, USA, June 7-12, 2015. IEEE Computer Society, 1–9. https://doi.org/10.1109/CVPR.2015.7298594
[20]
Xie Xin. 2020. Research on Bad Information Detection Technology Based on Deep Learning. Master’s thesis. University of Electronic Science and Technology of China. https://doi.org/10.27005/d.cnki.gdzku.2020.001260
[21]
Yang Xiong. 2018. Research and Application of Deep Learning in Internet Pornographic Image Recognition. Journal of Anyang Institute of Technology 17, 06 (2018), 39–42. https://doi.org/10.19329/j.cnki.1673-2928.2018.06.011
[22]
Wang Yijun Yu Mingyang, Yang Peng. 2018. Pornography image detection based on convolutional neural network. Computer Applications and Software 35, 01 (2018), 232–236+275. https://doi.org/10. 3969 /j. issn. 1000-386x. 2018. 01. 040

Index Terms

  1. Pornographic Image Recognition Based on High and Low Level Feature Fusion with Human Body Masking and Attention

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    CCRIS '22: Proceedings of the 2022 3rd International Conference on Control, Robotics and Intelligent System
    August 2022
    253 pages
    ISBN:9781450396851
    DOI:10.1145/3562007
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 12 October 2022

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Attention
    2. Feature fusion.
    3. Human body masking
    4. Pornographic image recognition
    5. ResNet

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Funding Sources

    Conference

    CCRIS'22

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 62
      Total Downloads
    • Downloads (Last 12 months)20
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 25 Dec 2024

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media