Causal Decision-Making

Decision Theory

Statistical Decision Theory

Bayes: How

Bayes: Why

Relative Comparisons: (Statistical) Regret

Alternate perspective: (Ex post) Regret

Example applications

Finding Optimal Procedures

Plug-in methods

Decision Rule Asymptotics

Empirical optimization

Uniform convergence of risk functional

Application: Binary Treatment Assignment

Decision Rule Sets

Constraints on Policy Targeting

Extensions

Simulation Example

library(policytree)
set.seed(1234)
n <- 2000
p <- 5
# Rounding down continuous covariates decreases runtime.
Z <- round(matrix(rnorm(n * p), n, p), 2)
colnames(Z) <- make.names(1:p)
X <- rbinom(n, 1, 1 / (1 + exp(Z[, 3])))
tau <- 1 / (1 + exp((Z[, 1] + Z[, 2]) / 2)) - 0.5
Y <- Z[, 3] + X * tau + rnorm(n)
c.forest <- grf::causal_forest(Z, Y, X) #Get nuisance functions
dr.scores <- double_robust_scores(c.forest) #Get scores functions
tree <- policy_tree(Z, dr.scores, 2) #Optimize over tree policies of depth 2
tree
## policy_tree object 
## Tree depth:  2 
## Actions:  1: control 2: treated 
## Variable splits: 
## (1) split_variable: X1  split_value: -0.56 
##   (2) split_variable: X3  split_value: -0.88 
##     (4) * action: 1 
##     (5) * action: 2 
##   (3) split_variable: X2  split_value: -0.72 
##     (6) * action: 2 
##     (7) * action: 1

Estimated optimal tree policy

plot(tree)

Conclusions

References

Adusumilli, Karun, Friedrich Geiecke, and Claudio Schilter. 2020. “Dynamically Optimal Treatment Allocation Using Reinforcement Learning.” http://arxiv.org/abs/1904.01047.
Andrews, Isaiah, and Jesse M. Shapiro. 2021. “A Model of Scientific Communication.” Econometrica 89 (5): 2117–42.
Assunção, Juliano, Robert McMillan, Joshua Murphy, and Eduardo Souza-Rodrigues. 2019. “Optimal Environmental Targeting in the Amazon Rainforest.” National Bureau of Economic Research.
Athey, Susan, and Stefan Wager. 2021. “Policy Learning with Observational Data.” Econometrica 89 (1): 133–61.
Brainard, William C. 1967. “Uncertainty and the Effectiveness of Policy.” The American Economic Review 57 (2): 411–25.
Cesa-Bianchi, Nicolo, and Gábor Lugosi. 2006. Prediction, Learning, and Games. Cambridge university press.
Hazan, Elad. 2019. “Introduction to Online Convex Optimization.” http://arxiv.org/abs/1909.05207.
Hirano, Keisuke, and Jack R. Porter. 2020. “Chapter 4 - Asymptotic Analysis of Statistical Decision Rules in Econometrics.” In Handbook of Econometrics, Volume 7a, edited by Steven N. Durlauf, Lars Peter Hansen, James J. Heckman, and Rosa L. Matzkin, 7:283–354. Handbook of Econometrics. Elsevier. https://doi.org/https://doi.org/10.1016/bs.hoe.2020.09.001.
Kallus, Nathan, and Angela Zhou. 2018. “Confounding-Robust Policy Improvement.” arXiv Preprint arXiv:1805.08593.
Kasy, Maximilian. 2018. “Optimal Taxation and Insurance Using Machine Learning—Sufficient Statistics and Beyond.” Journal of Public Economics 167: 205–19.
Kitagawa, Toru, and Aleksey Tetenov. 2018. “Who Should Be Treated? Empirical Welfare Maximization Methods for Treatment Choice.” Econometrica 86 (2): 591–616.
Manski, Charles F. 2021. “Econometrics for Decision Making: Building Foundations Sketched by Haavelmo and Wald.” Econometrica 89 (6): 2827–53.
Mohri, Mehryar, Afshin Rostamizadeh, and Ameet Talwalkar. 2018. Foundations of Machine Learning. MIT press.
Müller, Ulrich K. 2013. “Risk of Bayesian Inference in Misspecified Models, and the Sandwich Covariance Matrix.” Econometrica 81 (5): 1805–49.
Pearl, Judea. 2009. Causality. Cambridge university press.
Shalev-Shwartz, Shai, and others. 2011. “Online Learning and Online Convex Optimization.” Foundations and Trends in Machine Learning 4 (2): 107–94.
Sverdrup, Erik, Ayush Kanodia, Zhengyuan Zhou, Susan Athey, and Stefan Wager. 2020. “Policytree: Policy Learning via Doubly Robust Empirical Welfare Maximization over Trees.” Journal of Open Source Software 5 (50): 2232.
Van Der Vaart, Aad W, and Jon Wellner. 1996. Weak Convergence and Empirical Processes: With Applications to Statistics. Springer Science & Business Media.
Wald, Abraham. 1950. “Statistical Decision Functions.”
Zhou, Zhengyuan, Susan Athey, and Stefan Wager. 2018. “Offline Multi-Action Policy Learning: Generalization and Optimization.” http://arxiv.org/abs/1810.04778.