识别测试不公平性的成因：可操纵性与可分离性 (Identifying Causes of Test Unfairness: Manipulability and Separability) - 专知论文

会员服务 ·

0

识别 · 分解 · 公平性 · 可分离性 · 构建 ·

Identifying Causes of Test Unfairness: Manipulability and Separability

翻译：识别测试不公平性的成因：可操纵性与可分离性

Youmi Suk,Weicong Lyu

from arxiv, 20 pages for the main text

Differential item functioning (DIF) is a widely used statistical notion for identifying items that may disadvantage specific groups of test-takers. These groups are often defined by non-manipulable characteristics, e.g., gender, race/ethnicity, or English-language learner (ELL) status. While DIF can be framed as a causal fairness problem by treating group membership as the treatment variable, this invokes the long-standing controversy over the interpretation of causal effects for non-manipulable treatments. To better identify and interpret causal sources of DIF, this study leverages an interventionist approach using treatment decomposition proposed by Robins and Richardson (2010). Under this framework, we can decompose a non-manipulable treatment into intervening variables. For example, ELL status can be decomposed into English vocabulary unfamiliarity and classroom learning barriers, each of which influences the outcome through different causal pathways. We formally define separable DIF effects associated with these decomposed components, depending on the absence or presence of item impact, and provide causal identification strategies for each effect. We then apply the framework to biased test items in the SAT and Regents exams. We also provide formal detection methods using causal machine learning methods, namely causal forests and Bayesian additive regression trees, and demonstrate their performance through a simulation study. Finally, we discuss the implications of adopting interventionist approaches in educational testing practices.

翻译：差异项目功能（DIF）是一种广泛使用的统计概念，用于识别可能对特定考生群体不利的测试项目。这些群体通常由不可操纵的特征定义，例如性别、种族/民族或英语学习者（ELL）身份。虽然通过将群体归属视为处理变量，DIF可被构建为一个因果公平性问题，但这引发了关于非可操纵处理变量因果效应解释的长期争议。为了更好地识别和解释DIF的因果来源，本研究采用Robins和Richardson（2010）提出的基于处理分解的干预主义方法。在此框架下，我们可以将非可操纵处理分解为干预变量。例如，ELL身份可分解为英语词汇不熟悉度和课堂学习障碍，每个因素通过不同的因果路径影响测试结果。我们根据项目影响的存在与否，正式定义了与这些分解成分相关的可分离DIF效应，并为每种效应提供了因果识别策略。随后，我们将该框架应用于SAT和Regents考试中的偏差测试项目。我们还利用因果机器学习方法（即因果森林和贝叶斯加性回归树）提供了正式检测方法，并通过模拟研究验证了其性能。最后，我们讨论了在教育测试实践中采用干预主义方法的意义。

0

相关内容

事件因果关系识别综述：原理、分类法、挑战与评估

事件因果关系识别综述：原理、分类法、挑战与评估

专知会员服务

44+阅读 · 2024年11月18日

【剑桥大学博士论文】可识别的因果表示学习：无监督、多视图和多环境

【剑桥大学博士论文】可识别的因果表示学习：无监督、多视图和多环境

专知会员服务

34+阅读 · 2024年6月25日

【剑桥大学博士论文】可识别的因果表示学习：无监督、多视角、多环境，192页pdf

【剑桥大学博士论文】可识别的因果表示学习：无监督、多视角、多环境，192页pdf

专知会员服务

42+阅读 · 2024年3月24日

可信机器学习的公平性综述

可信机器学习的公平性综述

专知会员服务

69+阅读 · 2021年2月23日

【商汤科技】可变形Transformers端到端对象检测，Deformable DETR

【商汤科技】可变形Transformers端到端对象检测，Deformable DETR

专知会员服务

33+阅读 · 2020年10月11日

【SIGIR2020】基于知识图谱的公平感知可解释推荐，Fairness-Aware Explainable Recommendation over Knowledge Graphs

【SIGIR2020】基于知识图谱的公平感知可解释推荐，Fairness-Aware Explainable Recommendation over Knowledge Graphs

专知会员服务

47+阅读 · 2020年6月3日

自动结构变分推理，Automatic structured variational inference

自动结构变分推理，Automatic structured variational inference

专知会员服务

41+阅读 · 2020年2月10日

【2020密歇根大学论文】基于学习的序列决策算法的公平性综述论文，Fairness in Learning-Based Sequential Decision Algorithms: A Survey

【2020密歇根大学论文】基于学习的序列决策算法的公平性综述论文，Fairness in Learning-Based Sequential Decision Algorithms: A Survey

专知会员服务

22+阅读 · 2020年1月15日

【变分推断课件】Lectures on Variational Inference：Statistical Analysis of Variational Approximations（附带pdf）

【变分推断课件】Lectures on Variational Inference：Statistical Analysis of Variational Approximations（附带pdf）

专知会员服务

16+阅读 · 2019年11月30日

【目标检测 | 2019最新综述】目标检测中的不平衡问题，附31页PDF， Imbalance Problems in Object Detection: A Review

【目标检测 | 2019最新综述】目标检测中的不平衡问题，附31页PDF， Imbalance Problems in Object Detection: A Review

专知会员服务

46+阅读 · 2019年11月15日

联邦学习如何处理异质性？港科大最新《异质联邦学习》综述，46页pdf全面阐述异质联邦学习的数据空间、统计、系统和模型异质性

联邦学习如何处理异质性？港科大最新《异质联邦学习》综述，46页pdf全面阐述异质联邦学习的数据空间、统计、系统和模型异质性

专知

11+阅读 · 2022年12月1日

中美国防创新体系分析《国防创新中的体系性竞争》，美国海军研究生院2022最新72页研究报告

中美国防创新体系分析《国防创新中的体系性竞争》，美国海军研究生院2022最新72页研究报告

专知

13+阅读 · 2022年5月28日

异常检测（Anomaly Detection）综述

异常检测（Anomaly Detection）综述

极市平台

20+阅读 · 2020年10月24日

【商汤科技】可变形Transformers端到端对象检测，Deformable DETR

【商汤科技】可变形Transformers端到端对象检测，Deformable DETR

专知

18+阅读 · 2020年10月11日

北大、清华、微软联合提出RepPoints，比边界框更好用的目标检测方法

北大、清华、微软联合提出RepPoints，比边界框更好用的目标检测方法

全球人工智能

13+阅读 · 2019年4月30日

异常检测的阈值，你怎么选？给你整理好了...

异常检测的阈值，你怎么选？给你整理好了...

机器学习算法与Python学习

10+阅读 · 2018年9月19日

差分隐私保护：从入门到脱坑

差分隐私保护：从入门到脱坑

FreeBuf

17+阅读 · 2018年9月10日

隐私和机器学习：两个意想不到的盟友？一文了解差分隐私

隐私和机器学习：两个意想不到的盟友？一文了解差分隐私

专知

21+阅读 · 2018年5月14日

【论文推荐】最新九篇目标检测相关论文—常识性知识转移、尺度不敏感、多尺度位置感知、渐进式域适应、时间感知特征图、人机合作

【论文推荐】最新九篇目标检测相关论文—常识性知识转移、尺度不敏感、多尺度位置感知、渐进式域适应、时间感知特征图、人机合作

专知

17+阅读 · 2018年4月11日

侦测欺诈交易（异常点检测）

侦测欺诈交易（异常点检测）

GBASE数据工程部数据团队

20+阅读 · 2017年5月10日

基于分类能力结构度量与类相关性关系保留的特征选取方法研究

国家自然科学基金

1+阅读 · 2017年12月31日

群体偏好的敏感性度量方法研究和群决策方法的可实施性评价

国家自然科学基金

0+阅读 · 2017年12月31日

随机图和随机环境中的接触过程、选举模型、排他过程

国家自然科学基金

0+阅读 · 2015年12月31日

青少年执行功能与数学认知的关系研究

国家自然科学基金

2+阅读 · 2015年12月31日

半参数回归模型中随机误差分布的检验问题

国家自然科学基金

2+阅读 · 2015年12月31日

随机环境下多个体系统集体行为分析、调控与优化

国家自然科学基金

0+阅读 · 2015年12月31日

多重排序数据的整合分析

国家自然科学基金

0+阅读 · 2015年12月31日

复杂公共环境下群体行为尺度自适应建模与特定异常行为识别算法研究

国家自然科学基金

1+阅读 · 2015年12月31日

试验设计中的模型选择

国家自然科学基金

6+阅读 · 2014年12月31日

因果推断及不完全数据的统计分析

国家自然科学基金

23+阅读 · 2008年12月31日

Understanding Fairness and Prediction Error through Subspace Decomposition and Influence Analysis

Arxiv

0+阅读 · 2月7日

Differential Test Functioning via Robust Scaling

Arxiv

0+阅读 · 2月6日

Factorial Difference-in-Differences

Arxiv

0+阅读 · 2月3日

Empirical Likelihood-Based Fairness Auditing: Distribution-Free Certification and Flagging

Arxiv

0+阅读 · 1月28日

Difference-in-Discontinuities: Estimation, Inference and Validity Tests

Arxiv

0+阅读 · 1月27日

A penalized heteroskedastic ordered probit model for DIF (measurement invariance) testing of single-item assessments in cross-cultural research

Arxiv

0+阅读 · 1月26日

A Hybrid Latent-Class Item Response Model for Detecting Measurement Non-Invariance in Ordinal Scales

Arxiv

0+阅读 · 1月24日

In Defense of the Pre-Test: Valid Inference when Testing Violations of Parallel Trends for Difference-in-Differences

Arxiv

0+阅读 · 1月16日

Classification Imbalance as Transfer Learning

Arxiv

0+阅读 · 1月15日

Perceived Fairness in Networks

Arxiv

0+阅读 · 1月13日

VIP会员

文章信息

相关主题

相关VIP内容

事件因果关系识别综述：原理、分类法、挑战与评估

事件因果关系识别综述：原理、分类法、挑战与评估

专知会员服务

44+阅读 · 2024年11月18日

【剑桥大学博士论文】可识别的因果表示学习：无监督、多视图和多环境

【剑桥大学博士论文】可识别的因果表示学习：无监督、多视图和多环境

专知会员服务

34+阅读 · 2024年6月25日

【剑桥大学博士论文】可识别的因果表示学习：无监督、多视角、多环境，192页pdf

【剑桥大学博士论文】可识别的因果表示学习：无监督、多视角、多环境，192页pdf

专知会员服务

42+阅读 · 2024年3月24日

可信机器学习的公平性综述

可信机器学习的公平性综述

专知会员服务

69+阅读 · 2021年2月23日

【商汤科技】可变形Transformers端到端对象检测，Deformable DETR

【商汤科技】可变形Transformers端到端对象检测，Deformable DETR

专知会员服务

33+阅读 · 2020年10月11日

【SIGIR2020】基于知识图谱的公平感知可解释推荐，Fairness-Aware Explainable Recommendation over Knowledge Graphs

【SIGIR2020】基于知识图谱的公平感知可解释推荐，Fairness-Aware Explainable Recommendation over Knowledge Graphs

专知会员服务

47+阅读 · 2020年6月3日

自动结构变分推理，Automatic structured variational inference

自动结构变分推理，Automatic structured variational inference

专知会员服务

41+阅读 · 2020年2月10日

【2020密歇根大学论文】基于学习的序列决策算法的公平性综述论文，Fairness in Learning-Based Sequential Decision Algorithms: A Survey

【2020密歇根大学论文】基于学习的序列决策算法的公平性综述论文，Fairness in Learning-Based Sequential Decision Algorithms: A Survey

专知会员服务

22+阅读 · 2020年1月15日

【变分推断课件】Lectures on Variational Inference：Statistical Analysis of Variational Approximations（附带pdf）

【变分推断课件】Lectures on Variational Inference：Statistical Analysis of Variational Approximations（附带pdf）

专知会员服务

16+阅读 · 2019年11月30日

【目标检测 | 2019最新综述】目标检测中的不平衡问题，附31页PDF， Imbalance Problems in Object Detection: A Review

【目标检测 | 2019最新综述】目标检测中的不平衡问题，附31页PDF， Imbalance Problems in Object Detection: A Review

专知会员服务

46+阅读 · 2019年11月15日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】基于自适应表征的高效视觉建模

《多域作战中融合网络、电子战与动能机动》

AI智能体时代大模型安全风险与攻防新挑战

迈向个性化大语言模型驱动的智能体：基础、评估与未来方向

相关资讯

联邦学习如何处理异质性？港科大最新《异质联邦学习》综述，46页pdf全面阐述异质联邦学习的数据空间、统计、系统和模型异质性

联邦学习如何处理异质性？港科大最新《异质联邦学习》综述，46页pdf全面阐述异质联邦学习的数据空间、统计、系统和模型异质性

专知

11+阅读 · 2022年12月1日

中美国防创新体系分析《国防创新中的体系性竞争》，美国海军研究生院2022最新72页研究报告

中美国防创新体系分析《国防创新中的体系性竞争》，美国海军研究生院2022最新72页研究报告

专知

13+阅读 · 2022年5月28日

异常检测（Anomaly Detection）综述

异常检测（Anomaly Detection）综述

极市平台

20+阅读 · 2020年10月24日

【商汤科技】可变形Transformers端到端对象检测，Deformable DETR

【商汤科技】可变形Transformers端到端对象检测，Deformable DETR

专知

18+阅读 · 2020年10月11日

北大、清华、微软联合提出RepPoints，比边界框更好用的目标检测方法

北大、清华、微软联合提出RepPoints，比边界框更好用的目标检测方法

全球人工智能

13+阅读 · 2019年4月30日

异常检测的阈值，你怎么选？给你整理好了...

异常检测的阈值，你怎么选？给你整理好了...

机器学习算法与Python学习

10+阅读 · 2018年9月19日

差分隐私保护：从入门到脱坑

差分隐私保护：从入门到脱坑

FreeBuf

17+阅读 · 2018年9月10日

隐私和机器学习：两个意想不到的盟友？一文了解差分隐私

隐私和机器学习：两个意想不到的盟友？一文了解差分隐私

专知

21+阅读 · 2018年5月14日

【论文推荐】最新九篇目标检测相关论文—常识性知识转移、尺度不敏感、多尺度位置感知、渐进式域适应、时间感知特征图、人机合作

【论文推荐】最新九篇目标检测相关论文—常识性知识转移、尺度不敏感、多尺度位置感知、渐进式域适应、时间感知特征图、人机合作

专知

17+阅读 · 2018年4月11日

侦测欺诈交易（异常点检测）

侦测欺诈交易（异常点检测）

GBASE数据工程部数据团队

20+阅读 · 2017年5月10日

相关论文

Understanding Fairness and Prediction Error through Subspace Decomposition and Influence Analysis

Arxiv

0+阅读 · 2月7日

Differential Test Functioning via Robust Scaling

Arxiv

0+阅读 · 2月6日

Factorial Difference-in-Differences

Arxiv

0+阅读 · 2月3日

Empirical Likelihood-Based Fairness Auditing: Distribution-Free Certification and Flagging

Arxiv

0+阅读 · 1月28日

Difference-in-Discontinuities: Estimation, Inference and Validity Tests

Arxiv

0+阅读 · 1月27日

A penalized heteroskedastic ordered probit model for DIF (measurement invariance) testing of single-item assessments in cross-cultural research

Arxiv

0+阅读 · 1月26日

A Hybrid Latent-Class Item Response Model for Detecting Measurement Non-Invariance in Ordinal Scales

Arxiv

0+阅读 · 1月24日

In Defense of the Pre-Test: Valid Inference when Testing Violations of Parallel Trends for Difference-in-Differences

Arxiv

0+阅读 · 1月16日

Classification Imbalance as Transfer Learning

Arxiv

0+阅读 · 1月15日

Perceived Fairness in Networks

Arxiv

0+阅读 · 1月13日

相关基金

基于分类能力结构度量与类相关性关系保留的特征选取方法研究

国家自然科学基金

1+阅读 · 2017年12月31日

群体偏好的敏感性度量方法研究和群决策方法的可实施性评价

国家自然科学基金

0+阅读 · 2017年12月31日

随机图和随机环境中的接触过程、选举模型、排他过程

国家自然科学基金

0+阅读 · 2015年12月31日

青少年执行功能与数学认知的关系研究

国家自然科学基金

2+阅读 · 2015年12月31日

半参数回归模型中随机误差分布的检验问题

国家自然科学基金

2+阅读 · 2015年12月31日

随机环境下多个体系统集体行为分析、调控与优化

国家自然科学基金

0+阅读 · 2015年12月31日

多重排序数据的整合分析

国家自然科学基金

0+阅读 · 2015年12月31日

复杂公共环境下群体行为尺度自适应建模与特定异常行为识别算法研究

国家自然科学基金

1+阅读 · 2015年12月31日

试验设计中的模型选择

国家自然科学基金

6+阅读 · 2014年12月31日

因果推断及不完全数据的统计分析

国家自然科学基金

23+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员