Everyday life is increasingly influenced by artificial intelligence, and there is no question that machine learning algorithms must be designed to be reliable and trustworthy for everyone. Specifically, computer scientists consider an artificial intelligence system safe and trustworthy if it fulfills five pillars: explainability, robustness, transparency, fairness, and privacy. In addition to these five, we propose a sixth fundamental aspect: conformity, that is, the probabilistic assurance that the system will behave as the machine learner expects. In this paper, we propose a methodology to link conformal prediction with explainable machine learning by defining CONFIDERAI, a new score function for rule-based models that leverages both rules predictive ability and points geometrical position within rules boundaries. We also address the problem of defining regions in the feature space where conformal guarantees are satisfied by exploiting techniques to control the number of non-conformal samples in conformal regions based on support vector data description (SVDD). The overall methodology is tested with promising results on benchmark and real datasets, such as DNS tunneling detection or cardiovascular disease prediction.
翻译:日常生活正日益受到人工智能的影响,毋庸置疑,机器学习算法必须设计得对每个人都可靠且值得信赖。具体而言,计算机科学家认为,一个人工智能系统若满足可解释性、鲁棒性、透明度、公平性和隐私性这五大支柱,便是安全可信的。在此五项基础之上,我们提出第六项关键要素:一致性,即系统将按机器学习者预期方式运行的概率性保证。本文提出了一种将符合预测与可解释机器学习相结合的方法论,通过定义CONFIDERAI(一种针对规则模型的新型评分函数),该函数同时利用了规则的预测能力以及点在规则边界内的几何位置。此外,我们还通过基于支持向量数据描述(SVDD)的技术来控制符合区域中非符合样本数量,从而解决了在特征空间中定义满足符合性保证区域的问题。整体方法论在基准数据集和真实数据集(如DNS隧道检测或心血管疾病预测)上进行了测试,取得了令人满意的结果。