Fischer-Schultz Lecture: Generic Machine Learning Inference on Heterogenous Treatment Effects in Randomized Experiments, with an Application to Immunization in India

Machine Learning · 估计/估计量 · Learning · 推断 · 预测器/决策函数 ·

2023 年 6 月 13 日

翻译：Fischer-Schultz讲座：随机实验中异质性处理效应的通用机器学习推断——以印度免疫接种应用为例

Victor Chernozhukov,Mert Demirer,Esther Duflo,Iván Fernández-Val

from arxiv, 81 pages, 8 figures, 17 tables, includes Online Appendix

We propose strategies to estimate and make inference on key features of heterogeneous effects in randomized experiments. These key features include best linear predictors of the effects using machine learning proxies, average effects sorted by impact groups, and average characteristics of most and least impacted units. The approach is valid in high dimensional settings, where the effects are proxied (but not necessarily consistently estimated) by predictive and causal machine learning methods. We post-process these proxies into estimates of the key features. Our approach is generic, it can be used in conjunction with penalized methods, neural networks, random forests, boosted trees, and ensemble methods, both predictive and causal. Estimation and inference are based on repeated data splitting to avoid overfitting and achieve validity. We use quantile aggregation of the results across many potential splits, in particular taking medians of p-values and medians and other quantiles of confidence intervals. We show that quantile aggregation lowers estimation risks over a single split procedure, and establish its principal inferential properties. Finally, our analysis reveals ways to build provably better machine learning proxies through causal learning: we can use the objective functions that we develop to construct the best linear predictors of the effects, to obtain better machine learning proxies in the initial step. We illustrate the use of both inferential tools and causal learners with a randomized field experiment that evaluates a combination of nudges to stimulate demand for immunization in India.

翻译：本文提出在随机实验中估计异质性效应关键特征并进行推断的策略。这些关键特征包括：使用机器学习代理变量的效应最佳线性预测、按影响组别排序的平均效应、以及受影响最大和最小单元的平均特征。该方法适用于高维场景，其中效应可由预测性和因果性机器学习方法代理（但无需一致估计）。我们将这些代理变量后处理为关键特征的估计值。该方法具有通用性，可与惩罚方法、神经网络、随机森林、提升树及集成方法（包括预测性与因果性）结合使用。估计与推断基于重复数据分割以避免过拟合并确保有效性。我们采用跨多次潜在分割的分位数聚合结果，特别是取p值的中位数以及置信区间的中位数与其他分位数。研究表明，分位数聚合相较于单次分割程序可降低估计风险，并建立了其主要推断性质。最后，分析揭示了通过因果学习构建可证明更优的机器学习代理变量的途径：可利用我们开发的目标函数构建效应的最佳线性预测器，从而在初始步骤获得更优的机器学习代理变量。我们通过一项评估多种助推组合以刺激印度免疫接种需求的随机实地实验，展示了推断工具与因果学习器的实际应用。

相关内容

Machine Learning

关注 2251

机器学习（Machine Learning）是一个研究计算学习方法的国际论坛。该杂志发表文章，报告广泛的学习方法应用于各种学习问题的实质性结果。该杂志的特色论文描述研究的问题和方法，应用研究和研究方法的问题。有关学习问题或方法的论文通过实证研究、理论分析或与心理现象的比较提供了坚实的支持。应用论文展示了如何应用学习方法来解决重要的应用问题。研究方法论文改进了机器学习的研究方法。所有的论文都以其他研究人员可以验证或复制的方式描述了支持证据。论文还详细说明了学习的组成部分，并讨论了关于知识表示和性能任务的假设。官网地址：http://dblp.uni-trier.de/db/journals/ml/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日