skip to main content
10.1145/3560905.3568304acmconferencesArticle/Chapter ViewAbstractPublication PagessensysConference Proceedingsconference-collections
research-article
Open access

CIPhy: Causal Intervention with Physical Confounder from IoT Sensor Data for Robust Occupant Information Inference

Published: 24 January 2023 Publication History

Abstract

Occupant information inference with IoT sensor data enables many smart applications, such as patients'/older adults' in-home monitoring. The difficulty of collecting labeled real-world IoT sensor data often leads to reliability and scalability issues for those systems. Extensive prior works (e.g., domain adaptation) focus on the domain shift issues, i.e., the inconsistent data feature and label relationship, and dataset bias is often neglected. Dataset bias is commonly caused by limited and varied accessibility to labeled data for each class, and it is inevitable for real-world datasets. The model trained with a biased dataset fits into the bias, hence cannot further generalize to the testing data for accurate inference.
We propose CIPhy, a causal intervention scheme with physical confounders measured from the sensor data to achieve robust occupant information inference. We model the dataset bias as a confounding problem. There exists a confounder directly impacts both data feature and label, and each class's accessibility to labeled data varies when the confounder's condition changes. The model trained with biased data learns a spurious feature-label correlation conditioned on the confounder's condition in the training data. When testing data has a different condition, i.e., confounding shift, this correlation can not be applied. By using the causal intervention, e.g., backdoor adjustment, the confounding shift's negative impact on the data-driven models can be mitigated. The CIPhy decouples the sensor data to measure the confounder, then conducts the causal intervention for a de-biased occupant information inference. We use a public dataset on occupant identification as a case study, to investigate the feasibility of applying causal intervention to resolve the dataset bias issue. From the experiment, CIPhy achieves up to 11.42% identification accuracy improvement compared to baselines given the biased training data and confounding shift.

References

[1]
Bharathan Balaji, Jian Xu, Anthony Nwokafor, Rajesh Gupta, and Yuvraj Agarwal. 2013. Sentinel: occupancy based HVAC actuation using existing WiFi infrastructure within commercial buildings. In Proceedings of the 11th ACM Conference on Embedded Networked Sensor Systems. 1--14.
[2]
Huigang Chen, Totte Harinen, Jeong-Yoon Lee, Mike Yung, and Zhenyu Zhao. 2020. Causalml: Python package for causal machine learning. arXiv preprint arXiv:2002.11631 (2020).
[3]
Paul D Cleary and Ronald Angel. 1984. The analysis of relationships involving dichotomous dependent variables. Journal of Health and Social Behavior (1984), 334--348.
[4]
Yiwen Dong, Shijia Pan, Tong Yu, Mostafa Mirshekari, Jonathon Fagert, Amelie Bonde, Ole J. Mengshoel, Pei Zhang, and Hae Young Noh. 2021. The FootprintID Dataset: Footstep-Induced Structural Vibration Data for Indoor Person Identification with Different Walking Speeds.
[5]
Jonathan Francis, Matias Quintana, Nadine Von Frankenberg, Sirajum Munir, and Mario Bergés. 2019. Occutherm: Occupant thermal comfort inference using body shape information. In Proceedings of the 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation. 81--90.
[6]
Davrondzhon Gafurov, Einar Snekkenes, and Patrick Bours. 2007. Gait authentication and identification using wearable accelerometer sensor. In 2007 IEEE workshop on automatic identification advanced technologies. IEEE, 220--225.
[7]
Sahaj Garg, Vincent Perot, Nicole Limtiaco, Ankur Taly, Ed H Chi, and Alex Beutel. 2019. Counterfactual fairness in text classification through robustness. In Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society. 219--226.
[8]
Zhizhang Hu, Yue Zhang, and Shijia Pan. 2021. Footstep-Induced Floor Vibration Dataset: Reusability and Transferability Analysis. In Proceedings of the 19th ACM Conference on Embedded Networked Sensor Systems. 546--551.
[9]
Zhijing Jin, Julius von Kügelgen, Jingwei Ni, Tejas Vaidhya, Ayush Kaushal, Mrinmaya Sachan, and Bernhard Schölkopf. 2021. Causal direction of data collection matters: Implications of causal and anticausal learning for NLP. arXiv preprint arXiv:2110.03618 (2021).
[10]
Virgile Landeiro and Aron Culotta. 2016. Robust text classification in the presence of confounding bias. In Thirtieth AAAI Conference on Artificial Intelligence.
[11]
Virgile Landeiro and Aron Culotta. 2018. Robust text classification under confounding shift. Journal of Artificial Intelligence Research 63 (2018), 391--419.
[12]
Gierad Laput, Karan Ahuja, Mayank Goel, and Chris Harrison. 2018. Ubicoustics: Plug-and-play acoustic activity recognition. In Proceedings of the 31st Annual ACM Symposium on User Interface Software and Technology. 213--224.
[13]
Lung-Fei Lee. 1982. Specification error in multinomial logit models: Analysis of the omitted variable bias. Journal of Econometrics 20, 2 (1982), 197--209.
[14]
Scott Menard. 2002. Applied logistic regression analysis. Number 106. Sage.
[15]
Vimal Mollyn, Karan Ahuja, Dhruv Verma, Chris Harrison, and Mayank Goel. 2022. SAMoSA: Sensing Activities with Motion and Subsampled Audio. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 6, 3 (2022), 1--19.
[16]
Shijia Pan, Tong Yu, Mostafa Mirshekari, Jonathon Fagert, Amelie Bonde, Ole J Mengshoel, Hae Young Noh, and Pei Zhang. 2017. Footprintid: Indoor pedestrian identification through ambient structural vibration sensing. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 1, 3 (2017), 1--31.
[17]
Judea Pearl. 1998. Why there is no statistical test for confounding, why many think there is, and why they are almost right. (1998).
[18]
Judea Pearl. 2009. Causality. Cambridge university press.
[19]
Chao-Ying Joanne Peng, Kuk Lida Lee, and Gary M Ingersoll. 2002. An introduction to logistic regression analysis and reporting. The journal of educational research 96, 1 (2002), 3--14.
[20]
Bernhard Schölkopf. 2022. Causality for machine learning. In Probabilistic and Causal Inference: The Works of Judea Pearl. 765--804.
[21]
Edward H Simpson. 1951. The interpretation of interaction in contingency tables. Journal of the Royal Statistical Society: Series B (Methodological) 13, 2 (1951), 238--241.
[22]
Baochen Sun, Jiashi Feng, and Kate Saenko. 2017. Correlation alignment for unsupervised domain adaptation. In Domain Adaptation in Computer Vision Applications. Springer, 153--171.
[23]
Dipti Trivedi and Venkataramana Badarla. 2020. Occupancy detection systems for indoor environments: A survey of approaches and methods. Indoor and Built Environment 29, 8 (2020), 1053--1069.
[24]
Tan Wang, Jianqiang Huang, Hanwang Zhang, and Qianru Sun. 2020. Visual commonsense r-cnn. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 10760--10770.
[25]
Yunze Zeng, Parth H Pathak, and Prasant Mohapatra. 2016. WiWho: WiFi-based person identification in smart spaces. In 2016 15th ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN). IEEE, 1--12.
[26]
Beier Zhu, Yulei Niu, Xian-Sheng Hua, and Hanwang Zhang. 2022. Cross-domain empirical risk minimization for unbiased long-tailed classification. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 36. 3589--3597.

Cited By

View all

Index Terms

  1. CIPhy: Causal Intervention with Physical Confounder from IoT Sensor Data for Robust Occupant Information Inference

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      SenSys '22: Proceedings of the 20th ACM Conference on Embedded Networked Sensor Systems
      November 2022
      1280 pages
      ISBN:9781450398862
      DOI:10.1145/3560905
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 24 January 2023

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. causal intervention
      2. dataset bias
      3. occupant information inference

      Qualifiers

      • Research-article

      Conference

      Acceptance Rates

      SenSys '22 Paper Acceptance Rate 52 of 187 submissions, 28%;
      Overall Acceptance Rate 174 of 867 submissions, 20%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)123
      • Downloads (Last 6 weeks)13
      Reflects downloads up to 06 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media