Short videos on platforms such as TikTok, Instagram Reels, and YouTube Shorts (i.e. short-form videos) have become a primary source of information and entertainment. Many short-form videos are inaccessible to blind and low vision (BLV) viewers due to their rapid visual changes, on-screen text, and music or meme-audio overlays. In our formative study, 7 BLV viewers who regularly watched short-form videos reported frequently skipping such inaccessible content. We present ShortScribe, a system that provides hierarchical visual summaries of short-form videos at three levels of detail to support BLV viewers in selecting and understanding short-form videos. ShortScribe allows BLV users to navigate between video descriptions based on their level of interest. To evaluate ShortScribe, we assessed description accuracy and conducted a user study with 10 BLV participants comparing ShortScribe to a baseline interface. When using ShortScribe, participants reported higher comprehension and provided more accurate summaries of video content.
翻译:在TikTok、Instagram Reels和YouTube Shorts等平台上,短视频已成为信息获取与娱乐的主要来源。然而,由于快速视觉变化、屏幕文字叠加以及音乐或迷音频层干扰,许多短视频对盲人及低视力(BLV)用户存在访问障碍。前期研究显示,7名经常观看短视频的BLV受访者普遍反映会跳过此类难以访问的内容。我们提出ShortScribe系统,通过提供三级细节的层级化视觉摘要,支持BLV用户选择和理解短视频内容。该系统允许用户根据兴趣程度在视频描述间进行导航。为评估ShortScribe,我们进行了描述准确性测试,并邀请10名BLV参与者开展用户研究,通过对比基线界面发现:使用ShortScribe时,参与者对内容的理解程度更高,且能提供更准确的视频内容摘要。