Scorio.jl is a Julia package for evaluating and ranking systems from repeated responses to shared tasks. It provides a common tensor-based interface for direct score-based, pairwise, psychometric, voting, graph, and listwise methods, so the same benchmark can be analyzed under multiple ranking assumptions. We describe the package design, position it relative to existing Julia tools, and report pilot experiments on synthetic rank recovery, stability under limited trials, and runtime scaling.
翻译:Scorio.jl是一个用于评估和排序系统对共享任务重复响应的Julia软件包。它提供了基于张量的统一接口,支持直接基于分数、成对比较、心理测量、投票、图论和列表排序等多种方法,使得同一基准测试可以在多种排序假设下进行分析。本文描述了该软件包的设计,将其与现有Julia工具进行定位比较,并报告了在合成排序恢复、有限试验下的稳定性以及运行时扩展方面的初步实验。