Objective: This research explores using crowdsourcing for software usability evaluation. Background: Usability studies are essential for designing user-friendly software, but traditional methods are often costly and time-consuming. Crowdsourcing offers a quicker, cost-effective alternative for remote usability evaluation, though ensuring quality feedback remains a challenge. Method: A systematic mapping study was conducted to review current usability evaluation research. Subsequently, multi-experiments were performed, comparing novice crowd usability inspectors to experts using expert heuristic evaluation as a benchmark. These results were used to create and validate a framework for crowd usability inspection through a case study. Results: The mapping study identified expert heuristic evaluation as a prevalent method, especially for websites. Experimental findings showed that novice crowd usability inspections, guided by expert heuristics, can match experts in identifying usability issues in content, quality, severity, and time efficiency. The case study demonstrated that the framework allows effective usability inspections, leading to successful software redesigns. Iterations of 3-5 novice inspections effectively resolved key usability issues within three cycles. Conclusion: Crowdsourcing is an effective alternative to expert heuristic evaluation for usability assessment. The proposed framework for crowd usability inspection is a viable solution for budget-constrained software companies. Keywords: crowdsourcing, crowd usability evaluation, expert heuristic evaluation, framework.
翻译:目的:本研究探讨利用众包进行软件可用性评估。背景:可用性研究对于设计用户友好型软件至关重要,但传统方法通常成本高昂且耗时。众包为远程可用性评估提供了一种更快捷、更具成本效益的替代方案,但确保反馈质量仍是一个挑战。方法:通过系统性映射研究回顾当前可用性评估研究。随后,进行了多项实验,以专家启发式评估为基准,将新手众包可用性检查员与专家进行比较。这些结果被用于通过案例研究创建并验证一个众包可用性检查框架。结果:映射研究确定专家启发式评估是一种普遍使用的方法,尤其适用于网站。实验结果表明,在专家启发式原则指导下,新手众包可用性检查在识别内容、质量、严重性和时间效率方面的可用性问题上能够达到专家水平。案例研究表明,该框架支持有效的可用性检查,从而成功实现软件重新设计。经过3-5轮新手检查迭代,关键可用性问题在三个周期内得到有效解决。结论:对于可用性评估,众包是专家启发式评估的有效替代方案。所提出的众包可用性检查框架是预算有限软件公司的可行解决方案。关键词:crowdsourcing, crowd usability evaluation, expert heuristic evaluation, framework.