Deep Proxy Causal Learning and its Application to Confounded Bandit Policy Evaluation

Xu, Liyuan; Kanagawa, Heishiro; Gretton, Arthur

Computer Science > Machine Learning

arXiv:2106.03907 (cs)

[Submitted on 7 Jun 2021 (v1), last revised 18 Jun 2024 (this version, v5)]

Title:Deep Proxy Causal Learning and its Application to Confounded Bandit Policy Evaluation

Authors:Liyuan Xu, Heishiro Kanagawa, Arthur Gretton

View PDF HTML (experimental)

Abstract:Proxy causal learning (PCL) is a method for estimating the causal effect of treatments on outcomes in the presence of unobserved confounding, using proxies (structured side information) for the confounder. This is achieved via two-stage regression: in the first stage, we model relations among the treatment and proxies; in the second stage, we use this model to learn the effect of treatment on the outcome, given the context provided by the proxies. PCL guarantees recovery of the true causal effect, subject to identifiability conditions. We propose a novel method for PCL, the deep feature proxy variable method (DFPV), to address the case where the proxies, treatments, and outcomes are high-dimensional and have nonlinear complex relationships, as represented by deep neural network features. We show that DFPV outperforms recent state-of-the-art PCL methods on challenging synthetic benchmarks, including settings involving high dimensional image data. Furthermore, we show that PCL can be applied to off-policy evaluation for the confounded bandit problem, in which DFPV also exhibits competitive performance.

Comments:	arXiv admin note: text overlap with arXiv:2010.07154
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2106.03907 [cs.LG]
	(or arXiv:2106.03907v5 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2106.03907

Submission history

From: Liyuan Xu [view email]
[v1] Mon, 7 Jun 2021 18:36:13 UTC (87 KB)
[v2] Tue, 7 Dec 2021 02:16:39 UTC (100 KB)
[v3] Sun, 2 Jul 2023 15:56:15 UTC (131 KB)
[v4] Mon, 19 Feb 2024 23:35:03 UTC (147 KB)
[v5] Tue, 18 Jun 2024 08:40:30 UTC (151 KB)

Computer Science > Machine Learning

Title:Deep Proxy Causal Learning and its Application to Confounded Bandit Policy Evaluation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Deep Proxy Causal Learning and its Application to Confounded Bandit Policy Evaluation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators