skip to main content
10.1145/3461353.3461369acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiciaiConference Proceedingsconference-collections
research-article

Visible-Thermal Pedestrian Detection via Unsupervised Transfer Learning

Published: 04 September 2021 Publication History

Abstract

Recently, pedestrian detection using visible-thermal pairs plays a key role in around-the-clock applications, such as public surveillance and autonomous driving. However, the performance of a well-trained pedestrian detector may drop significantly when it is applied to a new scenario. Normally, to achieve a good performance on the new scenario, manual annotation of the dataset is necessary, while it is costly and unscalable. In this work, an unsupervised transfer learning framework is proposed for visible-thermal pedestrian detection tasks. Given well-trained detectors from a source dataset, the proposed framework utilizes an iterative process to generate and fuse training labels automatically, with the help of two auxiliary single-modality detectors (visible and thermal). To achieve label fusion, the knowledge of daytime and nighttime is adopted to assign priorities to labels according to their illumination, which improves the quality of generated training labels. After each iteration, the existing detectors are updated using new training labels. Experimental results demonstrate that the proposed method obtains state-of-the-art performance without any manual training labels on the target dataset.

References

[1]
Markus Braun, Sebastian Krebs, Fabian Flohr, and Dariu M Gavrila. 2019. EuroCity Persons: A Novel Benchmark for Person Detection in Traffic Scenes. IEEE Trans. Pattern Anal. Mach. Intell. 41, 8 (2019), 1844–1861.
[2]
Yanpeng Cao, Dayan Guan, Weilin Huang, Jiangxin Yang, Yanlong Cao, and Yu Qiao. 2019. Pedestrian detection with unsupervised multispectral feature learning using deep neural networks. Inf. Fusion 46(2019), 206–217.
[3]
James W Davis and Vinay Sharma. 2007. Background-Subtraction Using Contour-Based Fusion of Thermal and Visible Imagery. Comput. Vis. Image. Underst. 106, 2-3 (2007), 162–182.
[4]
Kevin Fritz, Daniel König, Ulrich Klauck, and Michael Teutsch. 2019. Generalization Ability of Region Proposal Networks for Multispectral Person Detection. In Automatic Target Recognition XXIX, Vol. 10988. International Society for Optics and Photonics, SPIE, 109880Y.
[5]
Alejandro González, Zhijie Fang, Yainuvis Socarras, Joan Serrat, David Vázquez, Jiaolong Xu, and Antonio M López. 2016. Pedestrian Detection at Day/Night Time with Visible and FIR Cameras: A Comparison. Sensors 16, 6 (jun 2016), 820.
[6]
Dayan Guan, Yanpeng Cao, Jiangxin Yang, Yanlong Cao, and Michael Ying Yang. 2019. Fusion of Multispectral Data Through Illumination-Aware Deep Neural Networks for Pedestrian Detection. Inf. Fusion 50(2019), 148–157.
[7]
Dayan Guan, Xing Luo, Yanpeng Cao, Jiangxin Yang, Yanlong Cao, George Vosselman, and Michael Ying Yang. 2019. Unsupervised Domain Adaptation for Multispectral Pedestrian Detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). IEEE, 434–443.
[8]
Jan Hosang, Mohamed Omran, Rodrigo Benenson, and Bernt Schiele. 2015. Taking a Deeper Look at Pedestrians. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 4073–4082.
[9]
Han-Kai Hsu, Chun-Han Yao, Yi-Hsuan Tsai, Wei-Chih Hung, Hung-Yu Tseng, Maneesh Singh, and Ming-Hsuan Yang. 2020. Progressive Domain Adaptation for Object Detection. In The IEEE Winter Conference on Applications of Computer Vision. IEEE, 749–757.
[10]
Soonmin Hwang, Jaesik Park, Namil Kim, Yukyung Choi, and In So Kweon. 2015. Multispectral pedestrian detection: Benchmark dataset and baseline. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 1037–1045.
[11]
Daniel Konig, Michael Adam, Christian Jarvers, Georg Layher, Heiko Neumann, and Michael Teutsch. 2017. Fully Convolutional Region Proposal Networks for Multispectral Person Detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). IEEE, 49–56.
[12]
Chengyang Li, Dan Song, Ruofeng Tong, and Min Tang. 2019. Illumination-Aware Faster R-CNN for Robust Multispectral Pedestrian Detection. Pattern Recognit. 85(2019), 161–171.
[13]
Zuoxin Li and Fuqiang Zhou. 2017. FSSD: Feature Fusion Single Shot Multibox Detector. arxiv:1712.00960
[14]
Jingjing Liu, Shaoting Zhang, Shu Wang, and Dimitris Metaxas. 2016. Multispectral Deep Neural Networks for Pedestrian Detection. In Procedings of the British Machine Vision Conference. BMVC Press, 73.1–73.13.
[15]
Yingwei Pan, Ting Yao, Yehao Li, Yu Wang, Chong-Wah Ngo, and Tao Mei. 2019. Transferrable Prototypical Networks for Unsupervised Domain Adaptation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 2239–2247.
[16]
Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2017. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Trans. Pattern Anal. Mach. Intell. 39, 6 (2017), 1137–1149.
[17]
Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, 2015. ImageNet Large Scale Visual Recognition Challenge. Int. J. Comput. Vision 115, 3 (2015), 211–252.
[18]
Karen Simonyan and Andrew Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. In International Conference on Learning Representations.
[19]
Markus D Solbach and John K Tsotsos. 2017. Vision-Based Fallen Person Detection for the Elderly. In Proceedings of the IEEE International Conference on Computer Vision Workshops (ICCVW). IEEE, 1433–1442.
[20]
Xiaogang Wang, Meng Wang, and Wei Li. 2013. Scene-Specific Pedestrian Detection for Static Video Surveillance. IEEE Trans. Pattern Anal. Mach. Intell. 36, 2 (2013), 361–374.
[21]
Karl Weiss, Taghi M Khoshgoftaar, and DingDing Wang. 2016. A Survey of Transfer Learning. J. Big Data 3, 1 (2016), 9.
[22]
Zhiheng Yang, Jun Li, and Huiyun Li. 2018. Real-time Pedestrian and Vehicle Detection for Autonomous Driving. In IEEE Intelligent Vehicles Symposium (IV). IEEE, 179–184.
[23]
Heng Zhang, Elisa Fromont, Sébastien Lefèvre, and Bruno Avignon. 2020. Multispectral Fusion for Object Detection with Cyclic Fuse-and-Refine Blocks. In IEEE International Conference on Image Processing. IEEE, 276–280.
[24]
Lu Zhang, Zhiyong Liu, Shifeng Zhang, Xu Yang, Hong Qiao, Kaizhu Huang, and Amir Hussain. 2019. Cross-Modality Interactive Attention Network for Multispectral Pedestrian Detection. Inf. Fusion 50(2019), 20–29.
[25]
Yongtao Zhang, Zhishuai Yin, Linzhen Nie, and Song Huang. 2020. Attention Based Multi-Layer Fusion of Multispectral Images for Pedestrian Detection. IEEE Access 8(2020), 165071–165084.

Cited By

View all

Index Terms

  1. Visible-Thermal Pedestrian Detection via Unsupervised Transfer Learning
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Other conferences
        ICIAI '21: Proceedings of the 2021 5th International Conference on Innovation in Artificial Intelligence
        March 2021
        246 pages
        ISBN:9781450388634
        DOI:10.1145/3461353
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 04 September 2021

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. Deep neural networks
        2. Domain adaption
        3. Pedestrian detection
        4. Unsupervised transfer learning

        Qualifiers

        • Research-article
        • Research
        • Refereed limited

        Funding Sources

        Conference

        ICIAI 2021

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)7
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 17 Jan 2025

        Other Metrics

        Citations

        Cited By

        View all

        View Options

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format.

        HTML Format

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media