skip to main content
survey

Human Body Pose Estimation for Gait Identification: A Comprehensive Survey of Datasets and Models

Published: 07 December 2022 Publication History

Abstract

Person identification is a problem that has received substantial attention, particularly in security domains. Gait recognition is one of the most convenient approaches enabling person identification at a distance without the need for high-quality images. There are several review studies addressing person identification such as the utilization of facial images, silhouette images, and wearable sensor. Despite skeleton-based person identification gaining popularity while overcoming the challenges of traditional approaches, existing survey studies lack the comprehensive review of skeleton-based approaches to gait identification. We present a detailed review of the human pose estimation and gait analysis that makes the skeleton-based approaches possible. The study covers various types of related datasets, tools, methodologies, and evaluation metrics with associated challenges, limitations, and application domains. Detailed comparisons are presented for each of these aspects with recommendations for potential research and alternatives. A common trend throughout this paper is the positive impact that deep learning techniques are beginning to have on topics such as human pose estimation and gait identification. The survey outcomes might be useful for the related research community and other stakeholders in terms of performance analysis of existing methodologies, potential research gaps, application domains, and possible contributions in the future.

Supplementary Material

3533384.supp (3533384.supp.pdf)
Supplementary material

References

[1]
Virginia Andersson, Rafael Dutra, and Ricardo Araújo. 2014. Anthropometric and human gait identification using skeleton data from Kinect sensor. In Proceedings of the ACM SympOsium on Applied Computing. 60–61. DOI:
[2]
Mykhaylo Andriluka, Umar Iqbal, Eldar Insafutdinov, Leonid Pishchulin, Anton Milan, Juergen Gall, and Bernt Schiele. 2018. PoseTrack: A benchmark for human pose estimation and tracking. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 5167–5176. DOI:
[3]
Mykhaylo Andriluka, Leonid Pishchulin, Peter Gehler, and Bernt Schiele. 2014. 2D human pose estimation: New benchmark and state of the art analysis. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE, 3686–3693. DOI:
[4]
Anurag Arnab, Carl Doersch, and Andrew Zisserman. 2019. Exploiting temporal context for 3D human pose estimation in the wild. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 3390–3399. DOI:
[5]
Bruno Artacho and Andreas Savakis. 2020. UniPose: Unified human pose estimation in single images and videos. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 7033–7042. DOI:
[6]
Emrah Aydemir. 2020. Gait sounds. Kaggle. Retrieved March 30, 2021 from https://www.kaggle.com/emrahaydemr/gait-sounds/metadata.
[7]
Emrah Aydemir, Turker Tuncer, Sengul Dogan, and Musa Unsal. 2021. A novel biometric recognition method based on multi kernelled bijection octal pattern using gait sound. Appl. Acoust. 173, (2021), 107701. DOI:
[8]
Michal Balazia and Petr Sojka. 2018. You are how you walk: Uncooperative MoCap gait identification for video surveillance with incomplete and noisy data. In Proceedings of the IEEE International Joint Conference on Biometrics, (IJCB 2017). 208–215. DOI:
[9]
Adrian Ball, David Rye, Fabio Ramos, and Mari Velonaki. 2012. Unsupervised clustering of people from “skeleton” data. In HRI’12 - Proceedings of the 7th Annual ACM/IEEE International Conference on Human-Robot Interaction. 225–226. DOI:
[10]
Mark Barnard, Matti Matilainen, and Janne Heikkilä. 2008. Body part segmentation of noisy human silhouette images. 2008 IEEE Int. Conf. Multimed. Expo, ICME 2008 - Proc. (2008), 1189–1192. DOI:
[11]
Francesco Battistone and Alfredo Petrosino. 2019. TGLSTM: A time based graph deep learning approach to gait recognition. Pattern Recognit. Lett. 126, (2019), 132–138. DOI:
[12]
R. Bednarski and A. Bielak. 2018. Use of motion capture in assisted of knee ligament injury diagnosis. J. Appl. Comput. Sci. 26, 1 (2018), 7.
[13]
Vasileios Belagiannis and Andrew Zisserman. 2017. Recurrent human pose estimation. In Proceedings of the 12th IEEE International Conference on Automated Face Gesture Recognition (FG 2017) - 1st Int. Work. Adapt. Shot Learn. Gesture Underst. Prod. ASL4GUP 2017, Biometrics Wild, Bwild 2017, Heteroge (2017), 468–475. DOI:
[14]
Abdallah Benzine, Florian Chabot, Bertrand Luvison, Quoc Cuong Pham, and Catherine Achard. 2020. Pandanet: Anchor-based single-shot multi-person 3D pose estimation. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 6855–6864. DOI:
[15]
Imed Bouchrika, John N. Carter, and Mark S. Nixon. 2016. Towards automated visual surveillance using gait for identity recognition and tracking across multiple non-intersecting cameras. Multimed. Tools Appl. 75, 2 (2016), 1201–1221. DOI:
[16]
Joan Bruna, Wojciech Zaremba, Arthur Szlam, and Yann LeCun. 2014. Spectral networks and deep locally connected networks on graphs. In Proceedings of the 2nd International Conference on Learning Representations (ICLR 2014) - Conference Track Proceedings. 1–14.
[17]
Adrian Bulat and Georgios Tzimiropoulos. 2016. Human pose estimation via convolutional part heatmap regression. In Proceedings of the European Conference on Computer Vision (ECCV). 717–732. DOI:
[18]
Matt Burgess. 2018. Facial recognition tech used by UK police is making a ton of mistakes. WIRED. Retrieved March 24, 2022 from https://www.wired.co.uk/article/face-recognition-police-uk-south-wales-met-notting-hill-carnival.
[19]
Yuanhao Cai, Zhicheng Wang, Zhengxiong Luo, Binyi Yin, Angang Du, Haoqian Wang, Xiangyu Zhang, Xinyu Zhou, Erjin Zhou, and Jian Sun. 2020. Learning delicate local representations for multi-person pose estimation. In Proceedings of the European Conference on Computer Vision (ECCV). 455–472. DOI:
[20]
Zhe Cao, Gines Hidalgo, Tomas Simon, Shih En Wei, and Yaser Sheikh. 2021. OpenPose: Realtime multi-person 2D pose estimation using part affinity fields. IEEE Trans. Pattern Anal. Mach. Intell. 43, 1 (2021), 172–186. DOI:
[21]
Joao Carreira, Pulkit Agrawal, Katerina Fragkiadaki, and Jitendra Malik. 2016. Human pose estimation with iterative error feedback. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 4733–4742. DOI:
[22]
Rafael Lima De Carvalho and Paulo Fernando Ferreira Rosa. 2010. Identification system for smart homes using footstep sounds. In Proceedings of the IEEE International Symposium on Industrial Electronics. IEEE, 1639–1644. DOI:
[23]
Aakarshan Chauhan. 2020. Gait Silhouette Dataset. Kaggle. Retrieved March 30, 2021 from https://www.kaggle.com/watermasterz/gaitsilhouettedataset?select=GaitDatasetC-silh.
[24]
Ching Hang Chen and Deva Ramanan. 2017. 3D human pose estimation = 2D pose estimation + matching. In Proceedings of the 30th IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017). 5759–5767. DOI:
[25]
Yucheng Chen, Yingli Tian, and Mingyi He. 2020. Monocular human pose estimation: A survey of deep learning-based methods. Comput. Vis. Image Underst. 192, January (2020), 102897. DOI:
[26]
Yu Cheng, Bo Yang, Bo Wang, Yan Wending, and Robby Tan. 2019. Occlusion-aware networks for 3D human pose estimation in video. In Proceedings of the IEEE International Conference on Computer Vision. IEEE, 723–732. DOI:
[27]
Roman Chereshnev. 2020. HuGaDB: Human Gait Database. Kaggle. Retrieved March 30, 2021 from https://www.kaggle.com/romanchereshnev/hugadb-human-gait-database.
[28]
Roman Chereshnev and Attila Kertész-Farkas. 2017. HuGaDB: Human gait database for activity recognition from wearable inertial sensor networks. In Proceedings of the International Conference on Analysis of Images, Social Networks and Texts. 131–141. DOI:
[29]
Patrick Connor and Arun Ross. 2018. Biometric recognition by gait: A survey of modalities and features. Comput. Vis. Image Underst. 167, January (2018), 1–27. DOI:
[30]
James E. Cutting and Lynn T. Kozlowski. 1977. Recognizing friends by their walk: Gait perception without familiarity cues. Bull. Psychon. Soc. 9, 5 (1977), 353–356. DOI:
[31]
Qi Dang, Jianqin Yin, Bin Wang, and Wenqing Zheng. 2019. Deep learning-based 2D human pose estimation: A survey. Tsinghua Sci. Technol. 24, 6 (2019), 663–676. DOI:
[32]
Abir Das, Anirban Chakraborty, and Amit K. Roy-Chowdhury. 2014. Consistent re-identification in a camera network. In Proceedings of the European Conference on Computer Vision (ECCV). 330–345. DOI:
[33]
Mehdi Ozel. 2019. Human gait phase dataset. Kaggle. Retrieved March 30, 2021 from https://www.kaggle.com/dasmehdixtr/human-gait-phase-dataset/metadata.
[34]
Bappaditya Debnath, Mary O'brien, Motonori Yamaguchi, and Ardhendu Behera. 2019. Adapting MobileNets for mobile based upper body pose estimation. In Proceedings of the 2018 15th IEEE International Conference on Advanced Video and Signal-Based Surveillance (AVSS'18). 1--6. DOI:
[35]
Fani Deligianni, Yao Guo, and Guang Zhong Yang. 2019. From emotions to mood disorders: A survey on gait analysis methodology. IEEE J. Biomed. Heal. Informatics 23, 6 (2019), 2302–2316. DOI:
[36]
T. A. Exell, M. J. R. Gittoes, G. Irwin, and D. G. Kerwin. 2012. Gait asymmetry: Composite scores for mechanical analyses of sprint running. J. Biomech. 45, 6 (2012), 1108–1111. DOI:
[37]
Matteo Fabbri, Fabio Lanzi, Simone Calderara, Andrea Palazzi, Roberto Vezzani, and Rita Cucchiara. 2018. Learning to detect and track visible and occluded body joints in a virtual world. In Proccedings of the European Conference on Computer Vision (ECCV). 450–466. DOI:
[38]
Xiaochuan Fan, Kang Zheng, Yuewei Lin, and Song Wang. 2015. Combining local appearance and holistic view: Dual-Source deep neural networks for human pose estimation. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 1347–1355. DOI:
[39]
Hao Shu Fang, Shuqin Xie, Yu Wing Tai, and Cewu Lu. 2017. RMPE: Regional multi-person pose estimation. In Proceedings of the IEEE International Conference on Computer Vision 2017-(October 2017), 2353–2362. DOI:
[40]
Mihai Fieraru, Anna Khoreva, Leonid Pishchulin, and Bernt Schiele. 2018. Learning to refine human pose estimation. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops. 318–327. DOI:
[41]
Claudio Filipi Gonçalves dos Santos, Diego de Souza Oliveira, Leandro A. Passos, Rafael Gonçalves Pires, Daniel Felipe Silva Santos, Lucas Pascotti Valem, Thierry P. Moreira, Marcos Cleison S. Santana, Mateus Roder, Jo Paulo Papa, and Danilo Colombo. 2022. Gait recognition based on deep learning: A survey. ACM Comput. Surv. 55, 2 (2022), 1–34. DOI:
[42]
Chris Fox. 2018. Face recognition police tools “staggeringly inaccurate.” BBC News. Retrieved March 24, 2022 from https://www.bbc.co.uk/news/technology-44089161.
[43]
Davrondzhon Gafurov. 2007. A survey of biometric gait recognition: Approaches, security and challenges. Annu. Nor. Comput. Sci. Conf. June (2007), 19–21.
[44]
Giorgia. 2018. Simulation of Parkinson Movement Disorders. Kaggle. Retrieved March 30, 2021 from https://www.kaggle.com/giofra/simulation-of-parkinson-movement-disorders.
[45]
Georgia Gkioxari, Alexander Toshev, and Navdeep Jaitly. 2016. Chained predictions using convolutional neural networks. In Proceedings of the European Conference on Computer Vision (ECCV). DOI:
[46]
Wenjuan Gong, Xuena Zhang, Jordi Gonzàlez, Andrew Sobral, Thierry Bouwmans, Changhe Tu, and El Hadi Zahzah. 2016. Human pose estimation from monocular images: A comprehensive survey. Sensors (Switzerland) 16, 12 (2016), 1–39. DOI:
[47]
Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative adversarial nets. Adv. Neural Inf. Process. Syst. (2014), 4089–4099.
[48]
Ramiz Görkem Birdal, Ahmit Sertbaş, and Bilgisayar Mïhendisliği. 2018. Human identification based on gait analysis: A survey. In Proceedings of the 2018 3rd International Conference on Computer Science and Engineering (UBMK). 6–10.
[49]
Monica Gruosso, Nicola Capece, and Ugo Erra. 2020. Human segmentation in surveillance video with deep learning. Multimed. Tools Appl. 80, 1 (2020), 1175–1199. DOI:
[50]
Ju Han and Bir Bhanu. 2006. Individual recognition using gait energy image. IEEE Trans. Pattern Anal. Mach. Intell. 28, 2 (2006), 316–322. DOI:
[51]
Albert Haque, Alexandre Alahi, and Fei Fei Li. 2016. Recurrent attention models for depth-based person identification. In Proceedings of the IEEE Computer Society Conference on Computer Vision Pattern Recognition. Comput. Vis. Pattern Recognit. 2016-Decem, (2016), 1229–1238. DOI:
[52]
Kaiming He, Georgia Gkioxari, Piotr Dollár, and Ross Girshick. 2020. Mask R-CNN. IEEE Trans. Pattern Anal. Mach. Intell. 42, 2 (2020), 386–397. DOI:
[53]
Martin Hirzer, Csaba Beleznai, Peter M. Roth, and Horst Bischof. 2011. Person re-identification by descriptive and discriminative classification. In Scandinavian conference on Image analysis. 91–102. DOI:
[54]
Martin Hofmann, Sebastian Bachmann, and Gerhard Rigoll. 2012. 2. 5D Gait biometrics using the depth gradient histogram energy image. In Proceedings of the 2012 IEEE 5th International Conference on Biometrics: Theory, Applications and Systems (BTAS). IEEE, 399–403.
[55]
Martin Hofmann, Jürgen Geiger, Sebastian Bachmann, Björn Schuller, and Gerhard Rigoll. 2014. The TUM gait from audio, image and depth (GAID) database: Multimodal recognition of subjects and traits. J. Vis. Commun. Image Represent. 25, 1 (2014), 195–206. DOI:
[56]
Michael B. Holte, Cuong Tran, Mohan M. Trivedi, and Thomas B. Moeslund. 2012. Human pose estimation and activity recognition from multi-view videos: Comparative explorations of recent developments. IEEE J. Sel. Top. Signal Process. 6, 5 (2012), 538–552. DOI:
[57]
Huang Huan. 2018. Walk or run. Kaggle. Retrieved March 30, 2021 from https://www.kaggle.com/huan9huan/walk-or-run.
[58]
Junjie Huang, Zheng Zhu, Feng Guo, and Guan Huang. 2020. The devil is in the details: Delving into unbiased data processing for human pose estimation. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 5699–5708. DOI:
[59]
Shaoli Huang, Mingming Gong, and Dacheng Tao. 2017. A coarse-fine network for keypoint localization. In Proceedings of the IEEE International Conference on Computer Vision. 3047–3056. DOI:
[60]
Eldar Insafutdinov, Mykhaylo Andriluka, Leonid Pishchulin, Siyu Tang, Evgeny Levinkov, Bjoern Andres, and Bernt Schiele. 2017. ArtTrack: Articulated multi-person tracking in the wild. In Proceedings of the 30th IEEE Conference on Computer Vision and Pattern Recognition, (CVPR 2017). 1293–1301. DOI:
[61]
Eldar Insafutdinov, Leonid Pishchulin, Bjoern Andres, Mykhaylo Andriluka, and Bernt Schiele. 2016. Deepercut: A deeper, stronger, and faster multi-person pose estimation model. In Proceedings of the European Conference on Computer Vision (ECCV). 34–50. DOI:
[62]
Catalin Ionescu, Dragos Papava, Vlad Olaru, and Cristian Sminchisescu. 2014. Human3.6M: Large scale datasets and predictive methods for 3D human sensing in natural environments. IEEE Trans. Pattern Anal. Mach. Intell. 36, 7 (2014), 1325–1339. DOI:
[63]
Khawar Islam. 2020. Person search: New paradigm of person re-identification: A survey and outlook of recent works. Image Vis. Comput. 101, (2020), 103970. DOI:
[64]
Haruyuki Iwama, Mayu Okumura, Yasushi Makihara, and Yaushi Yagi. 2012. The OU-ISIR gait database comprising the large population dataset and performance evaluation of gait recognition. IEEE Trans. Inf. FORENSICS Secur. 7, 1 (2012), 1511–1521. DOI:
[65]
Arjun Jain, Jonathan Tompson, Yann LeCun, and Christoph Bregler. 2015. MoDeep: A deep learning framework using motion features for human pose estimation. In Proceedings of the European Conference on Computer Vision (ECCV). 302–315. DOI:
[66]
Ahmad Jalal, Amir Nadeem, and Satoshi Bobasu. 2019. Human body parts estimation and detection for physical sports movements. In Proceedings of the 2019 2nd International Conference on Communications and Computer Digital Systems. (C-CODE 2019). 104–109. DOI:
[67]
Nataraj Jammalamadaka, Andrew Zisserman, Marcin Eichner, Vittorio Ferrari, and C. V. Jawahar. 2012. Has my algorithm succeeded? An evaluator for human pose estimators. In Proceedings of the European Conference on Computer Vision (ECCV). 114–128. DOI:
[68]
Hueihan Jhuang, Juergen Gall, Silvia Zuffi, Cordelia Schmid, and Michael J. Black. 2013. Towards understanding action recognition. In Proceedings of the IEEE International Conference on Computer Vision. 3192–3199. DOI:
[69]
Xiaofei Ji and Honghai Liu. 2010. Advances in view-invariant human motion analysis: A review. IEEE Trans. Syst. Man Cybern. Part C Appl. Rev. 40, 1 (2010), 13–24. DOI:
[70]
Xiaopeng Ji, Qi Fang, Junting Dong, Qing Shuai, Wen Jiang, and Xiaowei Zhou. 2020. A survey on monocular 3D human pose estimation. Virtual Real. Intell. Hardw. 2, 6 (2020), 471–500. DOI:
[71]
Sam Johnson and Mark Everingham. 2010. Clustered pose and nonlinear appearance models for human pose estimation. In Proceedings of the British Visual Conference. (BMVC 2010) - ii (2010), 1–11. DOI:
[72]
Hanbyul Joo, Tomas Simon, Xulong Li, Hao Liu, Lei Tan, Lin Gui, Sean Banerjee, Timothy Godisart, Bart Nabbe, Iain Matthews, Takeo Kanade, Shohei Nobuhara, and Yaser Sheikh. 2019. Panoptic Studio: A massively multiview system for social interaction capture. IEEE Trans. Pattern Anal. Mach. Intell. 41, 1 (2019), 190–204. DOI:
[73]
Ketan Joshi, Vikas Tripathi, Chitransh Bose, and Chaitanya Bhardwaj. 2020. Robust sports image classification using inceptionV3 and neural networks. In International Conference on Computational Intelligence and Data Science (ICCIDS 2019). Elsevier B.V., 2374–2381. DOI:
[74]
Muhammad Hassan Khan, Muhammad Shahid Farid, and Marcin Grzegorzek. 2021. Vision-based approaches towards person identification using gait. Comput. Sci. Rev. 42, (2021), 100432. DOI:
[75]
Wasiq Khan and Atta Badii. 2019. Pathological gait abnormality detection and segmentation by processing the hip joints motion data to support mobile gait rehabilitation. Res. Med. Eng. Sci. 7, 3 (2019). DOI:
[76]
Tae Soo Kim and Austin Reiter. 2017. Interpretable 3D human action analysis with temporal convolutional networks. In Proceedings of the IEEE Computer Society Conference on Computer Vision Pattern Recognition. Work. 2017 (July, 2017), 1623–1631. DOI:
[77]
Natasa Kleanthous, Abir Jaafar Hussain, Wasiq Khan, and Panos Liatsis. 2020. A new machine learning based approach to predict Freezing of Gait. Pattern Recognit. Lett. 140, (2020), 119–126. DOI:
[78]
Sven Kreiss, Lorenzo Bertoni, and Alexandre Alahi. 2019. PifPaf: Composite fields for human pose estimation. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 11969–11978. DOI:
[79]
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2012. Imagenet classification with deep convolutional neural networks. In NeurIPS. DOI:
[80]
Munish Kumar, Navdeep Singh, Ravinder Kumar, Shubham Goel, and Krishan Kumar. 2021. Gait recognition based on vision systems: A systematic survey. J. Vis. Commun. Image Represent. 75, August 2020 (2021), 103052. DOI:
[81]
Giorgia Gkioxari. 2018. Simulation of Parkinson Movement Disorders. Kaggle. Retrieved March 30, 2021 from https://www.kaggle.com/giofra/simulation-of-parkinson-movement-disorders.
[82]
Kyoungoh Lee, Inwoong Lee, and Sanghoon Lee. 2018. Propagating LSTM: 3D pose estimation based on joint interdependency. In Proceedings of the European Conference on Computer Vision (ECCV). 123–141. DOI:
[83]
Chen Li and Gim Hee Lee. 2019. Generating multiple hypotheses for 3D human pose estimation with mixture density network. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 9879–9887. DOI:
[84]
Jia Li, Wen Su, and Zengfu Wang. 2020. Simple pose: Rethinking and improving a bottom-up approach for multi-person pose estimation. In Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI-2020). 11354–11361. DOI:
[85]
Sijin Li, Zhi Qiang Liu, and Antoni B. Chan. 2014. Heterogeneous multi-task learning for human pose estimation with deep convolutional neural network. Int. J. Comput. Vis. 113, 1 (2014), 19–36. DOI:
[86]
Sijin Li, Weichen Zhang, and Antoni B. Chan. 2015. Maximum-margin structured learning with deep networks for 3D human pose estimation. In Proceedings of the IEEE International Conference on Computer Vision. 2848–2856. DOI:
[87]
Shuang Liang, Xiao Sun, and Yichen Wei. 2018. Compositional human pose regression. Comput. Vis. Image Underst. 176–177, (2018), 1–8. DOI:
[88]
Rijun Liao, Shiqi Yu, Weizhi An, and Yongzhen Huang. 2020. A model-based gait recognition method with body pose and human prior knowledge. Pattern Recognit. 98, (2020), 107069. DOI:
[89]
Ita Lifshitz, Ethan Fetaya, and Shimon Ullman. 2016. Human pose estimation using deep consensus voting. In Proceedings of the European Conference on Computer Vision (ECCV). 246–260. DOI:
[90]
Tsung-yi Lin, Genevieve Patterson, Matteo R. Ronchi, Yin Cui, Michael Maire, Serge Belongie, Lobomir Bourdev, Ross Girshick, James Hays, Pietro Perona, Deva Ramanan, Larry Zitnick, and Piotr Dollar. 2020. COCO. Common Objects in Context. Retrieved May 11, 2021 from https://cocodataset.org/#home.
[91]
Tsung Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C. Lawrence Zitnick. 2014. Microsoft COCO: Common objects in context. In Proceedings of the European Conference on Computer Vision (ECCV). 740–755. DOI:
[92]
Mengyuan Liu, Hong Liu, and Chen Chen. 2017. Enhanced skeleton visualization for view invariant human action recognition. Pattern Recognit. 68, (2017), 346–362. DOI:
[93]
Zhao Liu, Jianke Zhu, Jiajun Bu, and Chun Chen. 2015. A survey of human pose estimation: The body parts parsing based methods. J. Vis. Commun. Image Represent. 32, (2015), 10–19. DOI:
[94]
Zongyi Liu and Sudeep Sarkar. 2005. Effect of silhouette quality on hard problems in gait recognition. IEEE Trans. Syst. Man, Cybern. Part B Cybern. 35, 2 (2005), 170–183. DOI:
[95]
Matthew Loper, Naureen Mahmood, Javier Romero, Gerard Pons-Moll, and Michael J. Black. 2015. SMPL: A skinned multi-person linear model. ACM Trans. Graph. 34, 6 (2015), 1–16. DOI:
[96]
Jing Luo, Chunyuan Zi, Jianliang Zhang, and Yue Liu. 2017. Gait recognition using GEI and AFDEI. Int. J. Opt. 44, 4 (2017), 400–404. DOI:
[97]
Yue Luo, Jimmy Ren, Zhouxia Wang, Wenxiu Sun, Jinshan Pan, Jianbo Liu, Jiahao Pang, and Liang Lin. 2018. LSTM pose machines. In Proceedings of the European Conference on Computer Vision (ECCV). 5207–5215.
[98]
Diogo C. Luvizon, Hedi Tabia, and David Picard. 2019. Human pose regression by combining indirect part detection and contextual information. Comput. Graph. 85, (2019), 15–22. DOI:
[99]
Diogo C. Luvizon, David Picard, and Hedi Tabia. 2018. 2D /3D Pose estimation and action recognition using multitask deep learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 5137–5146.
[100]
Timo von Marcard, Roberto Henschel, Michael J. Black, Bodo Rosenhahn, and Gerard Pons-Moll. 2018. Recovering accurate 3D human pose in the wild using IMUs and a moving camera. In Proceedings of the European Conference on Computer Vision. Lecture Notes in Computer Science (LNCS). vol. 11214. Springer, (2018), 614–631. DOI:
[101]
Timo von Marcard, Roberto Henschel, Michael Black, Bodo Rosenhahn, and Gerard Pons-Moll. 2012. 3D poses in the wild dataset. Virtual Humans. Retrieved July 23, 2021 from https://virtualhumans.mpi-inf.mpg.de/3DPW/.
[102]
Eric Marchand, Hideaki Uchiyama, and Fabien Spindler. 2016. Pose estimation for augmented reality: A hands-On Survey. IEEE Trans. Vis. Comput. Graph. 22, 12 (2016), 2633–2651. DOI:
[103]
Maria De Marsico and Alessio Mecca. 2019. A survey on gait recognition via wearable sensors. ACM Comput. Surv. 52, 4 (2019). DOI:
[104]
Niki Martinel and Christian Micheloni. 2012. Re-identify people in wide area camera network. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops. IEEE, 31–36. DOI:
[105]
Darko S. Matovski, Mark S. Nixon, Sasan Mahmoodi, and John N. Carter. 2012. The effect of time on gait recognition performance. IEEE Trans. Inf. Forensics Secur. 7, 2 (2012), 543–552. DOI:
[106]
Dushyant Mehta, Helge Rhodin, Dan Casas, Pascal Fua, Oleksandr Sotnychenko, Weipeng Xu, and Christian Theobalt. 2018. Monocular 3D human pose estimation in the wild using improved CNN supervision. In Proceedings of the 2017 International Conference on 3D Vision (3DV 2017). 506–516. DOI:
[107]
Dushyant Mehta, Oleksandr Sotnychenko, Franziska Mueller, Weipeng Xu, Mohamed Elgharib, Pascal Fua, Hans Peter Seidel, Helge Rhodin, Gerard Pons-Moll, and Christian Theobalt. 2020. XNect: Real-time multi-person 3D motion capture with a single RGB camera. ACM Trans. Graph. 39, 4 (2020). DOI:
[108]
Dushyant Mehta, Oleksandr Sotnychenko, Franziska Mueller, Weipeng Xu, Srinath Sridhar, Gerard Pons-Moll, and Christian Theobalt. 2018. Single-shot multi-person 3D pose estimation from monocular RGB. In Proceedings of the 2018 International Conference on 3D Vision (3DV 2018). 120–130. DOI:
[109]
Dushyant Mehta, Oleksandr Sotnychenko, Franziska Mueller, Weipeng Xu, Srinath Sridhar, Gerard Pons-Moll, and Christian Theobalt. 2018. Single-shot multi-person 3D pose estimation from monocular RGB. In Proceedings of the 2018 International Conference on 3D Vision (3DV 2018). IEEE, 120–130. DOI:
[110]
Antonio S. Micilotta, Eng Jon Ong, and Richard Bowden. 2006. Real-time upper body detection and 3D pose estimation in monoscopic images. In Proceedings of the European Conference on Computer Vision (LNCS), vol. 3953. Springer, (2006), 139–150. DOI:
[111]
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, Marc G. Bellemare, Alex Graves, Martin Riedmiller, Andreas K. Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran, Daan Wierstra, Shane Legg, and Demis Hassabis. 2015. Human-level control through deep reinforcement learning. Nature 518, 7540 (2015), 529–533. DOI:
[112]
Thomas B. Moeslund and Erik Granum. 2001. A survey of computer vision-based human motion capture. Comput. Vis. Image Underst. 81, 3 (2001), 231–268. DOI:
[113]
Thomas B. Moeslund, Adrian Hilton, and Volker Krüger. 2006. A survey of advances in vision-based human motion capture and analysis. Comput. Vis. Image Underst. 104, 2-3 SPEC. ISS. (2006), 90–126. DOI:
[114]
Gyeongsik Moon, Ju Yong Chang, and Kyoung Mu Lee. 2019. Posefix: Model-agnostic general human pose refinement network. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 7765–7773. DOI:
[115]
Gyeongsik Moon, Ju Yong Chang, and Kyoung Mu Lee. 2019. Camera distance-aware top-down approach for 3D multi-person pose estimation from a single RGB image. In Proceedings of the IEEE International Conference on Computer Vision. 10132–10141. DOI:
[116]
Jason K. Moore, Sandra K. Hnat, and Antonie J. van den Bogert. 2015. An elaborate data set on human gait and the effect of mechanical perturbations. PeerJ 2015, 3 (2015). DOI:
[117]
Francesc Moreno-Noguer. 2017. 3d human pose estimation from a single image via distance matrix regression. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2823–2832. Retrieved from http://www.sciencedirect.com/science/article/pii/B008044854202229X.
[118]
Tewodros Legesse Munea, Yalew Zelalem Jembre, Halefom Tekle Weldegebriel, Longbiao Chen, Chenxi Huang, and Chenhui Yang. 2020. The progress of human pose estimation: A survey and taxonomy of models applied in 2D human pose estimation. IEEE Access 8, (2020), 133330–133348. DOI:
[119]
Erik Murphy-Chutorian and Mohan Manubhai Trivedi. 2009. Head pose estimation in computer vision: A survey. IEEE Trans. Pattern Anal. Mach. Intell. 31, 4 (2009), 607–626. DOI:
[120]
Athira Nambiar, Alexandre Bernardino, and Jacinto C. Nascimento. 2019. Gait-based person Re-identification : A survey. ACM Comput. Surv. 52, 2 (2019). DOI:
[121]
Athira Nambiar, Alexandre Bernardino, Jacinto C. Nascimento, and Ana Fred. 2017. Towards view-point invariant person re-identification via fusion of anthropometric and gait features from kinect measurements. In Proceedings of the 12th International Joint Conference on Computer Vision, Imaging Comput. Graph. Theory Appl. 5 (VISIGRAPP 2017). 108–119. DOI:
[122]
Athira Nambiar, Alexandre Bernardino, Jacinto C. Nascimento, and Ana Fred. 2017. Context-aware person re-identification in the wild via fusion of gait and anthropometric features. In Proceedings of the 12th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2017) - 1st International Workshop on Adaptive Shot Learning for Gesture Understanding and Production (ASL4GUP 2017), Biometrics in the Wild (BWILD 2017). Heteroge, IEEE, 973–980. DOI:
[123]
Alejandro Newell, Zhiao Huang, and Jia Deng. 2017. Associative embedding: End-to-end learning for joint detection and grouping. Adv. Neural Inf. Process. Syst. 2017-Decem, (2017), 2278–2288.
[124]
Alejandro Newell, Kaiyu Yang, and Jia Deng. 2016. Stacked hourglass networks for human pose estimation. In Proceedings of the European Conference on Computer Vision (ECCV). 483–499. DOI:
[125]
Bruce Xiaohan Nie, Ping Wei, and Song Chun Zhu. 2017. Monocular 3D human pose estimation by predicting depth on joints. In Proceedings of the IEEE International Conference on Computer Vision. 3467–3475. DOI:
[126]
Xuecheng Nie, Jiashi Feng, Jianfeng Zhang, and Shuicheng Yan. 2019. Single-stage multi-person pose machines. In Proceedings of the IEEE International Conference on Computer Vision. 6950–6959. DOI:
[127]
Sergey I. Nikolenko. 2019. Synthetic data for deep learning. In Springer Optimization and Its Applications. 1–156. Retrieved from http://arxiv.org/abs/1909.11512.
[128]
Guanghan Ning, Zhi Zhang, and Zhiquan He. 2018. Knowledge-Guided deep fractal neural networks for human pose estimation. IEEE Trans. Multimed. 20, 5 (2018), 1246–1259. DOI:
[129]
P. Nithyakani, A. Shanthini, and Godwin Ponsam. 2019. Human gait recognition using deep convolutional neural network. In Proceedings of the 2019 3rd International Conference on Computing and Communications Technologies (ICCCT 2019). IEEE, 208–211. DOI:
[130]
Mark S. Nixon and John N. Carter. 2004. Advances in automatic gait recognition. In Proceedings of the 6th IEEE International Conference on Automatic Face and Gesture Recognition. 139–144. DOI:
[131]
João Ferreira Nunes, Pedro Miguel Moreira, and João Manuel R. S. Tavares. 2018. GRIDDS - A gait recognition image and depth dataset. Instituto Politécnico de Viana do Castelo. Retrieved April 6, 2021 from www.gridds.ipvc.pt.
[132]
João Ferreira Nunes, Pedro Miguel Moreira, and João Manuel R. S. Tavares. 2019. Benchmark RGB-D gait datasets: A systematic review. In Proceedings of the ECCOMAS Thematic Conference on ComputerVision Medical Image Process. 34, September (2019), 366–372. DOI:
[133]
João Ferreira Nunes, Pedro Miguel Moreira, and João Manuel R. S. Tavares. 2019. GRIDDS - A gait recognition image and depth dataset. In ECCOMAS Thematic Conference on Computational Vision and Medical Image Processing. 343–352. DOI:
[134]
Kemal Oksuz, Baris Can Cam, Emre Akbas, and Sinan Kalkan. 2020. Generating positive bounding boxes for balanced training of object detectors. In Proceedings of the 2020 IEEE Winter Conference on Applications of Computer Vision (WACV 2020). 883–892. DOI:
[135]
Mohamed Omran, Christoph Lassner, Gerard Pons-Moll, Peter Gehler, and Bernt Schiele. 2018. Neural body fitting: Unifying deep learning and model based human pose and shape estimation. In Proceedings of the 2018 International Conference on 3D Vision (3DV 2018). IEEE, 484–494. DOI:
[136]
Robert J. Orr and Gregory D. Abowd. 2000. The Smart Floor: A mechanism for natural user identification and tracking. In Proceedings of the CHI’00 Extended Abstracts on Human Factors in Computing Systems. 275. DOI:
[137]
Daniil Osokin. 2019. Real-time 2D multi-person pose estimation on CPU: Lightweight openpose. In Proceedings of the 8th International Conference on Pattern Recognition Applications and Methods (ICPRAM 2019). 744–748. DOI:
[138]
Federico Pala, Riccardo Satta, Giorgio Fumera, and Fabio Roli. 2016. Multimodal person reidentification using RGB-D cameras. IEEE Trans. Circuits Syst. Video Technol. 26, 4 (2016), 788–799. DOI:
[139]
Pietro Pala, Lorenzo Seidenari, Stefano Berretti, and Alberto Del Bimbo. 2019. Enhanced skeleton and face 3D data for person re-identification from depth cameras. Comput. Graph. 79, (2019), 69–80. DOI:
[140]
George Papandreou, Tyler Zhu, Liang Chieh Chen, Spyros Gidaris, Jonathan Tompson, and Kevin Murphy. 2018. Personlab: Person pose estimation and instance segmentation with a bottom-up, part-based, geometric embedding model. In Proceedings of the European Conference on. Computer Vision, Lecture Notes in Computer Science (LNCS). vol. 11218. Springer, (2018), 282–299. DOI:
[141]
George Papandreou, Tyler Zhu, Nori Kanazawa, Alexander Toshev, Jonathan Tompson, Chris Bregler, and Kevin Murphy. 2017. Towards accurate multi-person pose estimation in the wild. In Proceedings of the 30th IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017). 3711–3719. DOI:
[142]
Georgios Pavlakos, Xiaowei Zhou, Konstantinos G. Derpanis, and Kostas Daniilidis. 2017. Coarse-to-fine volumetric prediction for single-image 3D human pose. In Proceedings of the 30th IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017). 1263–1272. DOI:
[143]
Xavier Perez-Sala, Sergio Escalera, Cecilio Angulo, and Jordi Gonzàlez. 2014. A survey on model based approaches for 2D and 3D visual human pose recovery. Sensors (Switzerland) 14, 3 (2014), 4189–4210. DOI:
[144]
Leonid Pishchulin, Eldar Insafutdinov, Siyu Tang, Bjoern Andres, Mykhaylo Andriluka, Peter Gehler, and Bernt Schiele. 2016. DeepCut: Joint subset partition and labeling for multi person pose estimation. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 4929–4937. DOI:
[145]
G. Qian, J. Zhang, and A. Kidane. 2010. People identification using floor pressure sensors. IEEE Sens. J. 10, 9 (2010), 1447–1460.
[146]
Varun Ramakrishna, Daniel Munoz, Martial Hebert, James Andrew Bagnell, and Yaser Sheikh. 2014. Pose machines: Articulated pose estimation via inference machines. In Proceedings of the European Conference on Computer Vision (ECCV). 33–47. DOI:
[147]
Haocong Rao, Siqi Wang, Xiping Hu, Mingkui Tan, Yi Guo, Jun Cheng, Bin Hu, and Xinwang Liu. 2021. A self-supervised gait encoding approach with locality-awareness for 3D skeleton based person re-identification. IEEE Trans. Pattern Anal. Mach. Intell. (Early Access) (2021). DOI:
[148]
Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2017. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 39, 6 (2017), 1137–1149. DOI:
[149]
Helge Rhodin, Mathieu Salzmann, and Pascal Fua. 2018. Unsupervised geometry-aware representation for 3D human pose estimation. In Proceedings of the European Conference on Computer Vision (ECCV). 765–782. DOI:
[150]
Gregory Rogez, Philippe Weinzaepfel, and Cordelia Schmid. 2020. LCR-Net++: Multi-Person 2D and 3D pose detection in natural images. IEEE Trans. Pattern Anal. Mach. Intell. 42, 5 (2020), 1146–1161. DOI:
[151]
Grégory Rogez, Philippe Weinzaepfel, and Cordelia Schmid. 2017. LCR-Net: Localization-classification-regression for human pose. In Proceedings of the 30th IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017). 1216–1224. DOI:
[152]
Mickaël Ropars, Armel Cretual, Hervé Thomazeau, Rajiv Kaila, and Isabelle Bonan. 2015. Volumetric definition of shoulder range of motion and its correlation with clinical signs of shoulder hyperlaxity. A motion capture study. J. Shoulder Elb. Surg. 24, 2 (2015), 310–316. DOI:
[153]
Noor Saffazura Ahmad Safuan, Marina Ismail, and Nursuriati Jamil. 2017. Feature extraction technique for human gait video analysis. J. Eng. Appl. Sci. 12, 3 (2017), 534–541. DOI:
[154]
Ben Sapp and Ben Taskar. 2013. MODEC: Multimodal decomposable models for human pose estimation. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 3674–3681. DOI:
[155]
Nikolaos Sarafianos, Bogdan Boteanu, Bogdan Ionescu, and Ioannis A. Kakadiaris. 2016. 3D Human pose estimation: A review of the literature and analysis of covariates. Comput. Vis. Image Underst. 152, (2016), 1–20. DOI:
[156]
Céline Schreiber and Florent Moissenet. 2019. A multimodal dataset of human gait at different walking speeds established on injury-free adult participants. Sci. Data 6, 1 (2019), 1–7. DOI:
[157]
Alireza Sepas-Moghaddam and Ali Etemad. 2022. Deep gait recognition: A survey. IEEE Trans. Pattern Anal. Mach. Intell. (2022), 1–20. DOI:
[158]
Xiaofeng Shao, Zhenping Qiang, Hong Lin, Yueyu Dong, and Xiaorui Wang. 2020. A survey of head pose estimation methods. In Proceedings of the IEEE Congress on Cybermatics: 2020 IEEE International Conferences on Internet of Things (iThings 2020), IEEE Green Computing and Communications (GreenCom 2020), IEEE Conference on Cyber, Physical and Social Computing (CPSCom 2020), and IEEE Conference on Smart Data (SmartD). 787–796. DOI:
[159]
Jon Sharman. 2018. Metropolitan Police's facial recognition technology 98% inaccurate, figures show. Independent. Retrieved March 24, 2022 from https://www.independent.co.uk/news/uk/home-news/met-police-facial-recognition-success-south-wales-trial-home-office-false-positive-a8345036.html.
[160]
Weijie Sheng and Xinde Li. 2021. Multi-task learning for gait-based identity recognition and emotion recognition using attention enhanced temporal graph convolutional network. Pattern Recognit. 114, (2021). DOI:
[161]
Kohei Shiraga, Yasushi Makihara, Daigo Muramatsu, Tomio Echigo, and Yasushi Yagi. 2016. GEINet: View-invariant gait recognition using a convolutional neural network. In Proceedings of the 2016 International Conference on Biometrics (ICB 2016). IEEE. DOI:
[162]
Y. Shoji, T. Takasuka, and H. Yasukawa. 2004. Personal identification using footstep detection. In Proceedings of the International Symposium on Intelligent Signal Processing and Communication Systems. IEEE, 43–47.
[163]
Jamie Shutler. 2004. On a large sequence-based human gait database. Appl. Sci. Soft Comput. January 2004 (2004). DOI:
[164]
Chenyang Si, Wentao Chen, Wei Wang, Liang Wang, and Tieniu Tan. 2019. An attention enhanced graph convolutional LSTM network for skeleton-based action recognition. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 1227–1236. DOI:
[165]
Leonid Sigal, Alexandru O. Balan, and Michael J. Black. 2010. HumanEva: Synchronized video and motion capture dataset and baseline algorithm for evaluation of articulated human motion. Int. J. Comput. Vis. 87, 1–2 (2010), 4–27. DOI:
[166]
Jasvinder Pal Singh, Sanjeev Jain, Sakshi Arora, and Uday Pratap Singh. 2018. Vision-based gait recognition: A survey. IEEE Access 6, (2018), 70497–70527. DOI:
[167]
Sabesan Sivapalan, Daniel Chen, Simon Denman, Sridha Sridharan, and Clinton Fookes. 2011. Gait energy volumes and frontal gait recognition using depth images. In Proceedings of the 2011 International Joint Conference on Biometrics (IJCB 2011). IEEE. DOI:
[168]
Eduardo de Souza Reis, Lucas Adams Seewald, Rodolfo Stoffel Antunes, Vinicius Facco Rodrigues, Rodrigo da Rosa Righi, Cristiano André da Costa, Luiz Gonzaga da Silveira Jr., Bjoern Eskofier, Andreas Maier, Tim Horz, and Rebecca Fahrig. 2021. Monocular multi-person pose estimation: A survey. Pattern Recognit. 118, (2021), 108046. DOI:
[169]
Ke Sun, Bin Xiao, Dong Liu, and Jingdong Wang. 2019. Deep high-resolution representation learning for human pose estimation. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 5686–5696. DOI:
[170]
Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. 2015. Going deeper with convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 1–9. DOI:
[171]
Kazunari Takeichi, Masaru Ichikawa, Ryota Shinayama, and Takehiro Tagawa. 2018. A mobile application for running form analysis based on pose estimation technique. In Proceedings of the 2018 IEEE International Conference on Multimedia and Expo Workshops (ICMEW 2018). IEEE, 8–11. DOI:
[172]
Noriko Takemura, Yasushi Makihara, Daigo Muramatsu, Tomio Echigo, and Yasushi Yagi. 2018. Multi-view large population gait dataset and its performance evaluation for cross-view gait recognition. IPSJ Trans. Comput. Vis. Appl. 10, 1 (2018). DOI:
[173]
Torben Teepe, Ali Khan, Johannes Gilg, Fabian Herzog, Stefan Hörmann, and Gerhard Rigoll. 2021. GaitGraph: Graph Convolutional Network for Skeleton-Based Gait Recognition. Retrieved from http://arxiv.org/abs/2101.11228.
[174]
Bugra Tekin, Pablo Márquez-Neila, Mathieu Salzmann, and Pascal Fua EPFL. 2017. Learning to fusing 2D uncertainty and 3D cues for monocular body pose estimation. In Proceedings of the IEEE International Conference on Computer Vision (ICCV).
[175]
Atima Tharatipyakul, Kenny T. W. Choo, and Simon T. Perrault. 2020. Pose estimation for facilitating movement learning from online videos. In Proceedings of the ACM International Conference Proceeding Series. 1–5. DOI:
[176]
Yonghong Tian, Lan Wei, Shijian Lu, and Tiejun Huang. 2019. Free-view gait recognition. PLoS One 14, 4 (2019), 1–24. DOI:
[177]
Denis Tome, Chris Russell, and Lourdes Agapito. 2017. Lifting from the deep: Convolutional 3D pose estimation from a single image. In Proceedings of the 30th IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017). 5689–5698. DOI:
[178]
J. Tompson, R. Goroshin, A. Jain, and Y. LeCun. 2015. Efficient object localization using convolutional networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 648–656.
[179]
Jonathan Tompson, Arjun Jain, Yann LeCun, and Christoph Bregler. 2014. Joint training of a convolutional network and a graphical model for human pose estimation. Adv. Neural Inf. Process. Syst. 2, January (2014), 1799–1807.
[180]
Alexander Toshev and Christian Szegedy. 2014. DeepPose: Human pose estimation via deep neural networks. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 1653–1660. DOI:
[181]
Kohei Toyoda, Michinari Kono, and Jun Rekimoto. 2019. Post-data augmentation to improve deep pose estimation of extreme and wild motions. In Proceedings of the 26th IEEE Conference on Virtual Reality and 3D User Interfaces, VR 2019 - Proceedings. 1570–1574. DOI:
[182]
UK Home Office. 2008. i-LIDS Multiple Camera Tracking Scenario Definition.
[183]
University of Southampton. 2003. Automatic gait recognition for human ID at a distance. Gait. Retrieved July 23, 2021 from http://www.gait.ecs.soton.ac.uk/.
[184]
Jaychand Upadhyay, Rohan Paranjpe, Hiralal Purohit, and Rohan Joshi. 2020. Biometric identification using gait analysis by deep learning. In Proceedings of the 2020 IEEE Pune Section International Conference (PuneCon 2020). 152–156. DOI:
[185]
Abhishek Vadnerkar, Sabrina Figueiredo, Nancy E. Mayo, and Robert E. Kearney. 2014. Classification of gait quality for biofeedback to improve heel-to-toe gait. In Proceedings of the 2014 36th Annual International Conference IEEE England Medical Biological Society (EMBC 2014). (2014), 3626–3629. DOI:
[186]
Amir Vajdi. 2019. 93 Human Gait (walking) Database. Kaggle. Retrieved March 30, 2021 from https://www.kaggle.com/drdataboston/93-human-gait-database/metadata.
[187]
Amir Vajdi, Mohammad Reza Zaghian, Saman Farahmand, Elham Rastegar, Kian Maroofi, Shaohua Jia, Marc Pomplun, Nurit Haspel, and Akram Bayat. 2019. Human Gait Database for Normal Walk Collected by Smart Phone Accelerometer. arXiv:1905.03109. Retrieved from https://arxiv.org/abs/1905.03109
[188]
Gül Varol, Duygu Ceylan, Bryan Russell, Jimei Yang, Ersin Yumer, Ivan Laptev, and Cordelia Schmid. 2018. BodyNet: Volumetric inference of 3D human body shapes. In Proceedings of the European Conference on Computer Vision (ECCV). 20–38. DOI:
[189]
Gül Varol, Javier Romero, Xavier Martin, Naureen Mahmood, Michael J. Black, Ivan Laptev, and Cordelia Schmid. 2017. Learning from synthetic humans. In Proceedings of the 30th IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017). 4627–4635. DOI:
[190]
Tanmay Tulsidas Verlekar, Luís Ducla Soares, and Paulo Lobato Correia. 2018. Gait recognition in the wild using shadow silhouettes. Image Vis. Comput. 76, (2018), 1–13. DOI:
[191]
Paul Viola and Michael Jones. 2001. Rapid object detection using a boosted cascade of simple features. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. DOI:
[192]
Changsheng Wan, Li Wang, and Vir V. Phoha. 2018. A survey on gait recognition. ACM Comput. Surv. 51, 5 (2018). DOI:
[193]
Can Wang, Jiefeng Li, Wentao Liu, Chen Qian, and Cewu Lu. 2020. HMOR: Hierarchical multi-person ordinal relations for monocular multi-person 3D pose estimation. In Proceedings of the European Conference on Computer Vision (ECCV). 242–259. DOI:
[194]
Chen Wang, Feng Zhang, and Shuzhi Sam Ge. 2021. A comprehensive survey on 2D multi-person pose estimation methods. Eng. Appl. Artif. Intell. 102, March (2021), 104260. DOI:
[195]
Jianbo Wang, Kai Qiu, Houwen Peng, Jianlong Fu, and Jianke Zhu. 2019. AI coach: Deep human pose estimation and analysis for personalized athletic training assistance. In Proceedings of the27th ACM International Conference on Multimedia. 374–382. DOI:
[196]
Liang Wang, Tieniu Tan, Huazhong Ning, and Weiming Hu. 2003. Silhouette analysis-based gait recognition for human identification. IEEE Trans. Pattern Anal. Mach. Intell. 25, 12 (2003), 1505–1518. DOI:
[197]
Pichao Wang, Wanqing Li, Philip Ogunbona, Jun Wan, and Sergio Escalera. 2018. RGB-D-based human motion recognition with deep learning: A survey. Comput. Vis. Image Underst. 171, April (2018), 118–139. DOI:
[198]
Taiqing Wang, Shaogang Gong, Xiatian Zhu, and Shengjin Wang. 2016. Person re-identification by discriminative selection in video ranking. IEEE Trans. Pattern Anal. Mach. Intell. 38, 12 (2016), 2501–2514. DOI:
[199]
Yiming Wang, Lingchao Guo, Zhaoming Lu, Xiangming Wen, Shuang Zhou, and Wanyu Meng. 2021. From point to space: 3D moving human pose estimation using commodity WiFi. IEEE Commun. Lett. 25, 7 (2021), 2235–2239. DOI:
[200]
Lan Wei, Yonghong Tian, Yaowei Wang, and Tiejun Huang. 2015. Swiss-system based cascade ranking for gait-based person re-identification. In Proceedings of the National Conference on Artificial Intelligence. 1882–1888.
[201]
Shih-En Wei, Varun Ramakrishna, Takeo Kanada, and Yaser Sheikh. 2016. Convolutional pose machines. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. (2016), 4724–4732.
[202]
Philippe Weinzaepfel, Jerome Revaud, Zaid Harchaoui, and Cordelia Schmid. 2013. DeepFlow: Large displacement optical flow with deep matching. In Proceedings of the IEEE International Conference on Computer Vision. 1385–1392. DOI:
[203]
Jiamin Wen, Yongliang Shen, and Jun Yang. 2022. Multi-view gait recognition based on generative adversarial network. Neural Process. Lett. (2022). DOI:
[204]
Zifeng Wu, Yongzhen Huang, Liang Wang, and Senior Member. 2015. Learning representative deep features for image set analysis. IEEE Trans. Multimed. 17, 11 (2015), 1960–1968.
[205]
Fangting Xia, Peng Wang, Xianjie Chen, and Alan Yuille. 2017. Joint multi-person pose estimation and semantic part segmentation. In Proceedings of the 30th IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017). 6080–6089. DOI:
[206]
Bin Xiao, Haiping Wu, and Yichen Wei. 2018. Simple baselines for human pose estimation and tracking. In Proceedings of the European Conference on Computer Vision (ECCV). 472–487. DOI:
[207]
Zhongwei Xie, Lin Li, Xian Zhong, Luo Zhong, and Jianwen Xiang. 2020. Image-to-video person re-identification with cross-modal embeddings. Pattern Recognit. Lett. 133, (2020), 70–76. DOI:
[208]
Sijie Yan, Yuanjun Xiong, and Dahua Lin. 2018. Spatial temporal graph convolutional networks for skeleton-based action recognition. In Proceedings of the 32nd AAAI Conference on Artificial Intelligence 1 (2018). DOI:
[209]
Wei Yang, Shuang Li, Wanli Ouyang, Hongsheng Li, and Xiaogang Wang. 2017. Learning feature pyramids for human pose estimation. In Proceedings of the IEEE International Conference on Computer Vision (ICCV). 1281–1290. DOI:
[210]
Lingxiang Yao, Worapan Kusakunniran, Qiang Wu, Jian Zhang, and Zhenmin Tang. 2018. Robust CNN-based gait verification and identification using skeleton gait energy image. In Proceedings of the 2018 International Conference on Digital Image Computing: Techniques and Applications (DICTA 2018). IEEE. DOI:
[211]
Lingxiang Yao, Worapan Kusakunniran, Qiang Wu, Jian Zhang, Zhenmin Tang, and Wankou Yang. 2019. Robust gait recognition using hybrid descriptors based on skeleton gait energy image. Pattern Recognit. Lett. (2019). DOI:
[212]
Shiqi Yu, Daoliang Tan, and Tieniu Tan. 2006. A framework for evaluating the effect of view angle, clothing and carrying condition on gait recognition. In Proceedings of the International Conference on Pattern Recognition. 441–444. DOI:
[213]
Andrei Zanfir, Elisabeta Marinoiu, and Cristian Sminchisescu. 2018. Monocular 3D pose and shape estimation of multiple people in natural scenes: The importance of multiple scene constraints. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE, 2148–2157. DOI:
[214]
Feng Zhang, Xiatian Zhu, Hanbin Dai, Mao Ye, and Ce Zhu. 2020. Distribution-Aware coordinate representation for human pose estimation. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 7091–7100. DOI:
[215]
Feng Zhang, Xiatian Zhu, and Mao Ye. 2019. Fast human pose estimation. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. (June 2019), 3512–3521. DOI:
[216]
Jing Zhang, Zhe Chen, and Dacheng Tao. 2021. Towards high performance human keypoint detection. Int. J. Comput. Vis. (2021), 1–24. Retrieved from http://arxiv.org/abs/2002.00537.
[217]
Rui Zhang, Zheng Zhu, Peng Li, Rui Wu, Chaoxu Guo, Guan Huang, and Hailun Xia. 2019. Exploiting offset-guided network for pose estimation and tracking. In Proceedings of the CVPR Workshops. 20–28. Retrieved from http://arxiv.org/abs/1906.01344.
[218]
Weiyu Zhang, Menglong Zhu, and Konstantinos G. Derpanis. 2013. From actemes to action: A strongly-supervised representation for detailed action understanding. In Proceedings of the IEEE International Conference on Computer Vision. 2248–2255. DOI:
[219]
Jianan Zhen, Qi Fang, Jiaming Sun, Wentao Liu, Wei Jiang, Hujun Bao, and Xiaowei Zhou. 2020. SMAP: Single-Shot multi-person absolute 3D pose estimation. In Proceedings of the European Conference on Computer Vision (ECCV). 550–566. DOI:
[220]
Ce Zheng, Wenhan Wu, Taojiannan Yang, Sijie Zhu, Chen Chen, Ruixu Liu, Ju Shen, Nasser Kehtarnavaz, and Mubarak Shah. 2020. Deep learning-based human pose estimation: A survey. Tsinghua Sci. Technol. 24, 6 (2020), 663–676. DOI:
[221]
Liang Zheng, Zhi Bie, Yifan Sun, Jingdong Wang, Chi Su, Shengjin Wang, and Qi Tian. 2016. MARS: A video benchmark for large-scale person re-identification. In Proceedings of the European Conference on Computer Vision (ECCV). 868–884. DOI:
[222]
Zhou, Xingyi, Xiao Sun, Wei Zhang, Shuang Liang, and Yichen Wei. 2016. Deep kinematic pose regression. In Proceedings of the European Conference on Computer Vision. 186–201. DOI:
[223]
Kun Zhou, Xiaoguang Han, Nianjuan Jiang, Kui Jia, and Jiangbo Lu. 2019. HEMlets pose: Learning part-centric heatmap triplets for accurate 3D human pose estimation. In Proceedings of the IEEE International Conference on Computer Vision. 2344–2353. DOI:
[224]
Xingyi Zhou, Qixing Huang, Xiao Sun, Xiangyang Xue, and Yichen Wei. 2017. Weakly supervised transfer for 3d human pose estimation in the wild. In Proceedings of the IEEE International Conference on Computer Vision. 3.

Cited By

View all

Index Terms

  1. Human Body Pose Estimation for Gait Identification: A Comprehensive Survey of Datasets and Models

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Computing Surveys
        ACM Computing Surveys  Volume 55, Issue 6
        June 2023
        781 pages
        ISSN:0360-0300
        EISSN:1557-7341
        DOI:10.1145/3567471
        Issue’s Table of Contents

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 07 December 2022
        Online AM: 15 May 2022
        Accepted: 23 April 2022
        Received: 01 November 2021
        Published in CSUR Volume 55, Issue 6

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. Human pose estimation
        2. gait re-identification
        3. face matching
        4. deep learning
        5. human pose datasets
        6. crime suspect identification

        Qualifiers

        • Survey
        • Refereed

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)671
        • Downloads (Last 6 weeks)73
        Reflects downloads up to 15 Sep 2024

        Other Metrics

        Citations

        Cited By

        View all

        View Options

        Get Access

        Login options

        Full Access

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Full Text

        View this article in Full Text.

        Full Text

        HTML Format

        View this article in HTML Format.

        HTML Format

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media