In-Bed Human Pose Estimation from Unseen and Privacy-Preserving Image Domains

Cao, Ting; Armin, Mohammad Ali; Denman, Simon; Petersson, Lars; Ahmedt-Aristizabal, David

doi:10.1109/ISBI52829.2022.9761598

Computer Science > Computer Vision and Pattern Recognition

arXiv:2111.15124 (cs)

[Submitted on 30 Nov 2021 (v1), last revised 24 Jan 2022 (this version, v2)]

Title:In-Bed Human Pose Estimation from Unseen and Privacy-Preserving Image Domains

Authors:Ting Cao, Mohammad Ali Armin, Simon Denman, Lars Petersson, David Ahmedt-Aristizabal

View PDF

Abstract:Medical applications have benefited greatly from the rapid advancement in computer vision. Considering patient monitoring in particular, in-bed human posture estimation offers important health-related metrics with potential value in medical condition assessments. Despite great progress in this domain, it remains challenging due to substantial ambiguity during occlusions, and the lack of large corpora of manually labeled data for model training, particularly with domains such as thermal infrared imaging which are privacy-preserving, and thus of great interest. Motivated by the effectiveness of self-supervised methods in learning features directly from data, we propose a multi-modal conditional variational autoencoder (MC-VAE) capable of reconstructing features from missing modalities seen during training. This approach is used with HRNet to enable single modality inference for in-bed pose estimation. Through extensive evaluations, we demonstrate that body positions can be effectively recognized from the available modality, achieving on par results with baseline models that are highly dependent on having access to multiple modes at inference time. The proposed framework supports future research towards self-supervised learning that generates a robust model from a single source, and expects it to generalize over many unknown distributions in clinical environments.

Comments:	In the IEEE International Symposium on Biomedical Imaging (ISBI)
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2111.15124 [cs.CV]
	(or arXiv:2111.15124v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2111.15124
Journal reference:	ISBI 2022
Related DOI:	https://doi.org/10.1109/ISBI52829.2022.9761598

Submission history

From: David Ahmedt-Aristizabal [view email]
[v1] Tue, 30 Nov 2021 04:56:16 UTC (11,601 KB)
[v2] Mon, 24 Jan 2022 12:56:50 UTC (11,557 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:In-Bed Human Pose Estimation from Unseen and Privacy-Preserving Image Domains

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:In-Bed Human Pose Estimation from Unseen and Privacy-Preserving Image Domains

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators