Handwritten character recognition (HCR) is a challenging problem for machine learning researchers. Unlike printed text data, handwritten character datasets have more variation due to human-introduced bias. With numerous unique character classes present, some data, such as Logographic Scripts or Sino-Korean character sequences, bring new complications to the HCR problem. The classification task on such datasets requires the model to learn high-complexity details of the images that share similar features. With recent advances in computational resource availability and further computer vision theory development, some research teams have effectively addressed the arising challenges. Although known for achieving high efficiency, many common approaches are still not generalizable and use dataset-specific solutions to achieve better results. Due to complex structure and high computing demands, existing methods frequently prevent the solutions from gaining popularity. This paper proposes a straightforward, generalizable, and highly effective approach (CharNet) for detailed character image classification and compares its performance to that of existing approaches.
翻译:手写字符识别(HCR)是机器学习研究者面临的一项具有挑战性的问题。与印刷文本数据不同,手写字符数据集由于人为引入的偏差而具有更大的变异性。由于存在大量独特的字符类别,某些数据(如表意文字或朝韩字符序列)为HCR问题带来了新的复杂性。对此类数据集的分类任务要求模型学习图像中共享相似特征的高复杂度细节。随着计算资源可用性的最新进展以及计算机视觉理论的进一步发展,一些研究团队已有效应对了这些新兴挑战。尽管常见方法以高效著称,但许多方法仍缺乏泛化性,并采用特定于数据集的解决方案来获得更优结果。由于结构复杂且计算需求高,现有方法往往阻碍了解决方案的推广应用。本文提出了一种直接、通用且高效的方法(CharNet)用于精细字符图像分类,并将其性能与现有方法进行了比较。