Annotation tools are the starting point for creating Natural Language Processing (NLP) datasets. There is a wide variety of tools available; setting up these tools is however a hindrance. We propose EEVEE, an annotation tool focused on simplicity, efficiency, and ease of use. It can run directly in the browser (no setup required) and uses tab-separated files (as opposed to character offsets or task-specific formats) for annotation. It allows for annotation of multiple tasks on a single dataset and supports four task-types: sequence labeling, span labeling, text classification and seq2seq.
翻译:标注工具是创建自然语言处理(NLP)数据集的起点。尽管现有工具种类繁多,但其部署过程往往成为应用障碍。我们提出EEVEE——一款专注于简洁性、高效性与易用性的标注工具。该工具可直接在浏览器中运行(无需配置),采用制表符分隔文件(而非字符偏移量或任务特定格式)进行标注。它支持在单个数据集上执行多种标注任务,涵盖四种任务类型:序列标注、跨度标注、文本分类与序列到序列(seq2seq)任务。