The kappa statistic is the most widely used measure of inter-rater agreement for categorical data. Despite its popularity, applied researchers often encounter two major hurdles: (i) determining the sample size required to achieve a desired level of agreement with given power, and (ii) computing appropriate kappa coefficients with proper interpretation. Existing R packages such as irr and kappaSize provide these functionalities but require programming skills and lack an integrated, user-friendly interface. We present CATEKAPPA, an R package that bridges this gap by combining sample size planning (via kappaSize) and agreement analysis (via irr) into a single Shiny-based web application. The package supports Cohen's kappa for two raters, Fleiss' kappa for three or more raters, and Light's kappa, and provides automatic interpretation using the Landis & Koch scale. Users can either launch an interactive graphical interface or use command-line functions for scripting. The package is freely available on CRAN.
翻译:Kappa统计量是分类数据中最广泛使用的一致性度量指标。尽管其应用普遍,但实际研究者常面临两大障碍:(i)确定在给定统计功效下达到期望一致性水平所需的样本量;(ii)计算合适的Kappa系数并进行准确解读。现有R包如irr和kappaSize虽能提供这些功能,但要求用户具备编程技能,且缺乏集成化的用户友好界面。我们提出CATEKAPPA这一R包,通过将样本量规划(经由kappaSize包)与一致性分析(经由irr包)整合至基于Shiny的单一网络应用程序,填补了这一空白。该包支持两位评估者的Cohen's Kappa、三位及以上评估者的Fleiss' Kappa及Light's Kappa,并依据Landis & Koch分级标准提供自动解读。用户可通过交互式图形界面启动程序,也可使用命令行函数进行脚本化操作。该R包已在CRAN上免费发布。