Enhancing Human Pose Estimation in Ancient Vase Paintings via Perceptually-grounded Style Transfer Learning

Madhu, Prathmesh; Villar-Corrales, Angel; Kosti, Ronak; Bendschus, Torsten; Reinhardt, Corinna; Bell, Peter; Maier, Andreas; Christlein, Vincent

doi:10.1145/3569089

Computer Science > Computer Vision and Pattern Recognition

arXiv:2012.05616 (cs)

[Submitted on 10 Dec 2020 (v1), last revised 25 Feb 2024 (this version, v2)]

Title:Enhancing Human Pose Estimation in Ancient Vase Paintings via Perceptually-grounded Style Transfer Learning

Authors:Prathmesh Madhu, Angel Villar-Corrales, Ronak Kosti, Torsten Bendschus, Corinna Reinhardt, Peter Bell, Andreas Maier, Vincent Christlein

View PDF HTML (experimental)

Abstract:Human pose estimation (HPE) is a central part of understanding the visual narration and body movements of characters depicted in artwork collections, such as Greek vase paintings. Unfortunately, existing HPE methods do not generalise well across domains resulting in poorly recognized poses. Therefore, we propose a two step approach: (1) adapting a dataset of natural images of known person and pose annotations to the style of Greek vase paintings by means of image style-transfer. We introduce a perceptually-grounded style transfer training to enforce perceptual consistency. Then, we fine-tune the base model with this newly created dataset. We show that using style-transfer learning significantly improves the SOTA performance on unlabelled data by more than 6% mean average precision (mAP) as well as mean average recall (mAR). (2) To improve the already strong results further, we created a small dataset (ClassArch) consisting of ancient Greek vase paintings from the 6-5th century BCE with person and pose annotations. We show that fine-tuning on this data with a style-transferred model improves the performance further. In a thorough ablation study, we give a targeted analysis of the influence of style intensities, revealing that the model learns generic domain styles. Additionally, we provide a pose-based image retrieval to demonstrate the effectiveness of our method.

Comments:	Link to the repository containing the code to reproduce the experiments. For further details, please read the README. Link: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2012.05616 [cs.CV]
	(or arXiv:2012.05616v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2012.05616
Journal reference:	J. Comput. Cult. Herit. 16, 1, Article 16 (March 2023), 17 pages
Related DOI:	https://doi.org/10.1145/3569089

Submission history

From: Vincent Christlein [view email]
[v1] Thu, 10 Dec 2020 12:08:03 UTC (15,258 KB)
[v2] Sun, 25 Feb 2024 21:07:14 UTC (17,393 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Enhancing Human Pose Estimation in Ancient Vase Paintings via Perceptually-grounded Style Transfer Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Enhancing Human Pose Estimation in Ancient Vase Paintings via Perceptually-grounded Style Transfer Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators