The increasing advancements in the field of machine learning have led to the development of numerous applications that effectively address a wide range of problems with accurate predictions. However, in certain cases, accuracy alone may not be sufficient. Many real-world problems also demand explanations and interpretability behind the predictions. One of the most popular interpretable models that are classification rules. This work aims to propose an incremental model for learning interpretable and balanced rules based on MaxSAT, called IMLIB. This new model was based on two other approaches, one based on SAT and the other on MaxSAT. The one based on SAT limits the size of each generated rule, making it possible to balance them. We suggest that such a set of rules seem more natural to be understood compared to a mixture of large and small rules. The approach based on MaxSAT, called IMLI, presents a technique to increase performance that involves learning a set of rules by incrementally applying the model in a dataset. Finally, IMLIB and IMLI are compared using diverse databases. IMLIB obtained results comparable to IMLI in terms of accuracy, generating more balanced rules with smaller sizes.
翻译:机器学习领域的不断进步催生了众多应用,这些应用通过准确的预测有效解决了广泛的问题。然而,在某些情况下,仅凭准确性可能并不足够。许多现实世界的问题还要求对预测结果提供解释和可解释性。分类规则是最受欢迎的可解释模型之一。本文旨在提出一种基于MaxSAT的增量模型,用于学习可解释且平衡的规则,称为IMLIB。该新模型基于另外两种方法,一种基于SAT,另一种基于MaxSAT。基于SAT的方法限制了每条生成规则的大小,从而能够实现规则的平衡。我们认为,与混合大小规则相比,这样的一组规则似乎更自然且易于理解。基于MaxSAT的方法称为IMLI,它通过一种增量应用模型在数据集上学习一组规则的技巧来提高性能。最后,使用多种数据库对IMLIB和IMLI进行了比较。在准确性方面,IMLIB获得了与IMLI相当的结果,同时生成了更平衡且规模更小的规则。