Automatic Generation of Attention Rules For Containment of Machine Learning Model Errors

Machine learning (ML) solutions are prevalent in many applications. However, many challenges exist in making these solutions business-grade. For instance, maintaining the error rate of the underlying ML models at an acceptably low level. Typically, the true relationship between feature inputs and the target feature to be predicted is uncertain, and hence statistical in nature. The approach we propose is to separate the observations that are the most likely to be predicted incorrectly into 'attention sets'. These can directly aid model diagnosis and improvement, and be used to decide on alternative courses of action for these problematic observations. We present several algorithms (`strategies') for determining optimal rules to separate these observations. In particular, we prefer strategies that use feature-based slicing because they are human-interpretable, model-agnostic, and require minimal supplementary inputs or knowledge. In addition, we show that these strategies outperform several common baselines, such as selecting observations with prediction confidence below a threshold. To evaluate strategies, we introduce metrics to measure various desired qualities, such as their performance, stability, and generalizability to unseen data; the strategies are evaluated on several publicly-available datasets. We use TOPSIS, a Multiple Criteria Decision Making method, to aggregate these metrics into a single quality score for each strategy, to allow comparison.

翻译：机器学习（ML）解决方案在众多应用中广泛存在。然而，要使这些解决方案达到企业级标准仍面临诸多挑战，例如将底层ML模型的错误率维持在可接受的低水平。通常，特征输入与待预测目标特征之间的真实关系存在不确定性，因此具有统计本质。我们提出的方法是将最可能被错误预测的观测值分离为"注意力集"。这些集合可直接辅助模型诊断与改进，并用于为这些有问题的观测值制定替代行动方案。我们提出了多种算法（"策略"）以确定分离这些观测值的最优规则。特别地，我们偏好采用基于特征切片的策略，因其具有人类可解释性、模型无关性，且仅需极少的补充输入或知识。此外，我们证明这些策略的性能优于多种常见基线方法（如选择预测置信度低于阈值的观测值）。为评估策略，我们引入多项指标来度量所需的各种品质，包括性能、稳定性及对未见数据的泛化能力；并在多个公开数据集上对这些策略进行评价。我们采用多准则决策方法TOPSIS将各策略的指标聚合为单一质量评分，以便进行比较。

相关内容

Machine Learning

关注 2251

机器学习（Machine Learning）是一个研究计算学习方法的国际论坛。该杂志发表文章，报告广泛的学习方法应用于各种学习问题的实质性结果。该杂志的特色论文描述研究的问题和方法，应用研究和研究方法的问题。有关学习问题或方法的论文通过实证研究、理论分析或与心理现象的比较提供了坚实的支持。应用论文展示了如何应用学习方法来解决重要的应用问题。研究方法论文改进了机器学习的研究方法。所有的论文都以其他研究人员可以验证或复制的方式描述了支持证据。论文还详细说明了学习的组成部分，并讨论了关于知识表示和性能任务的假设。官网地址：http://dblp.uni-trier.de/db/journals/ml/

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日