Decision Support for Marketplace Policies under Incomplete Evidence: From Replay to Launch Readiness

Marketplace platforms routinely evaluate pricing and allocation policies using logged observational data, yet strong offline performance does not imply that a policy is safe to deploy. In real-time bidding (RTB) marketplaces, reserve-price and floor-policy changes affect not only revenue but also fill, advertiser value, budget pacing, and competition across auctions, creating feedback and interference. The central problem is therefore not to estimate whether a policy improves an offline metric, but to determine whether the available evidence justifies direct launch or only further validation. In this regard, we propose a support-aware decision-support system (DSS) that distinguishes promising from actionable evidence. The framework integrates replay, support-aware off-policy evaluation (OPE), conservative lower-bound ranking, multi-sided guardrails, out-of-time validation, sensitivity analysis, and interference-aware validation design into a claim-preserving pipeline that outputs a launch-readiness classification rather than a single performance estimate. Applying the framework to iPinYou-style RTB logs, we identify a margin-gated floor policy as the leading candidate, with a 47.7% replay yield lift, a 45.8% conservative lower-tail lift, and stable out-of-time performance. However, the framework does not recommend direct launch. A decision-rule ablation shows that simplified pipelines select the same policy but incorrectly recommend deployment, leaving key causal assumptions unresolved. In contrast, the proposed DSS selects the same policy but changes the action to online validation, reflecting missing evidence on propensities, bidder response, and interference. Overall, the contribution is a reproducible DSS protocol that prevents decision overclaim under partial identification and converts offline evaluation into an auditable, action-oriented recommendation.

翻译：市场平台通常利用记录的观测数据评估定价和分配策略，但强劲的离线性能并不意味该策略可安全部署。在实时竞价（RTB）市场中，保留价与底价策略的变动不仅影响收入，还会波及填充率、广告主价值、预算节奏及跨竞拍竞争，产生反馈与干扰效应。因此，核心问题并非评估策略是否提升离线指标，而是判定现有证据足以支持直接发布，抑或仅能支撑进一步验证。基于此，我们提出一种觉察支持力的决策支持系统（DSS），能够区分有前景证据与可行动证据。该框架将回放、支持感知型离线策略评估（OPE）、保守下限排序、多边防护栏、时外验证、敏感性分析及干扰感知验证设计整合为一条遵循声明保留的流水线，最终输出发布准备就绪分类而非单一性能估计。在iPinYou风格RTB日志上的应用表明，边际门控底价策略以47.7%的回放收益提升率、45.8%的保守下限提升率及稳定的时外性能成为首要候选策略。然而，该框架并不推荐直接发布。决策规则消融实验显示，简化流水线虽选中相同策略，但错误建议部署，遗留关键因果假设未解。相比之下，所提出的DSS虽选择同一策略，但将行动转变为在线验证，反映出关于倾向性、竞标者响应及干扰的证据缺失。总体而言，本研究的贡献在于提出一个可复现的DSS协议，可在部分识别条件下防止决策过度声明，将离线评估转化为可审计、面向行动的建议。

相关内容

DSS

关注 479

决策支持系统（Decision Support Systems）期刊中发表的文章的共同主线是它们与支持增强决策制定的理论和技术问题的相关性。所涉及的领域可能包括基础、功能、接口、实现、影响和决策支持系统(DSS)的评估。手稿可以从不同的方法和方法学中获得，包括决策理论、经济学、计量经济学、统计学、计算机支持的协作工作、数据库管理、语言学、管理科学、数学建模、运营管理、认知科学、心理学、用户界面管理等。但是，一份侧重于对任何这些相关领域的直接贡献的手稿应提交给适合于特定领域的机构。官网地址：http://dblp.uni-trier.de/db/journals/dss/

《结构化部分可观测下的序贯决策研究》2026年300页

专知会员服务

16+阅读 · 7月11日

《将战术决策游戏作为认知评估与发展工具》最新26页报告

专知会员服务

23+阅读 · 2025年5月5日

支持作战决策的战略预见

专知会员服务

18+阅读 · 2024年12月5日

【CMU博士论文】不完美信息与实际非理性下的算法决策制定

专知会员服务

27+阅读 · 2024年10月9日