Scene text removal (STR) is the image transformation task to remove text regions in scene images. The conventional STR methods remove all scene text. This means that the existing methods cannot select text to be removed. In this paper, we propose a novel task setting named selective scene text removal (SSTR) that removes only target words specified by the user. Although SSTR is a more complex task than STR, the proposed multi-module structure enables efficient training for SSTR. Experimental results show that the proposed method can remove target words as expected.
翻译:场景文字移除(STR)是一种从场景图像中移除文字区域的图像变换任务。传统的STR方法会移除所有场景文字,这意味着现有方法无法选择要移除的文字。本文提出了一种名为选择性场景文字移除(SSTR)的新型任务设定,该设定仅移除用户指定的目标词汇。尽管SSTR比STR更复杂,但所提出的多模块结构能够实现SSTR的高效训练。实验结果表明,所提方法能够按预期移除目标词汇。