Using artificial-intelligence tools to make LaTeX content accessible to blind readers

Screen-reader software enables blind users to access large segments of electronic content, particularly if accessibility standards are followed. Unfortunately, this is not true for much of the content written in physics, mathematics, and other STEM-disciplines, due to the strong reliance on mathematical symbols and expressions, which screen-reader software generally fails to process correctly. A large portion of such content is based on source documents written in LaTeX, which are rendered to PDF or HTML for online distribution. Unfortunately, the resulting PDF documents are essentially inaccessible, and the HTML documents greatly vary in accessibility, since their rendering using standard tools is cumbersome at best. The paper explores the possibility of generating standards-compliant, accessible HTML from LaTeX sources using Large Language Models. It is found that the resulting documents are highly accessible, with possible complications occurring when the artificial intelligence tool starts to interpret the content.

翻译：屏幕阅读软件使得盲人用户能够访问大部分电子内容，尤其是在遵循无障碍标准的情况下。遗憾的是，对于物理学、数学及其他STEM学科中的大量内容而言，由于这些内容强烈依赖数学符号和表达式（屏幕阅读软件通常无法正确处理这些内容），这一说法并不成立。此类内容中有很大一部分基于用 LaTeX 编写的源文档，这些文档会被转换为 PDF 或 HTML 格式以便在线分发。然而，最终生成的 PDF 文档基本上无法访问，而 HTML 文档的可访问性也参差不齐，因为使用标准工具对其进行渲染充其量也相当繁琐。本文探索了利用大型语言模型从 LaTeX 源生成符合标准且可访问的 HTML 的可能性。研究发现，生成的文档具有高度的可访问性，但当人工智能工具开始解释内容时，可能会出现一些复杂情况。

相关内容

TOOLS

关注 1

这个新版本的工具会议系列恢复了从1989年到2012年的50个会议的传统。工具最初是“面向对象语言和系统的技术”，后来发展到包括软件技术的所有创新方面。今天许多最重要的软件概念都是在这里首次引入的。2019年TOOLS 50+1在俄罗斯喀山附近举行，以同样的创新精神、对所有与软件相关的事物的热情、科学稳健性和行业适用性的结合以及欢迎该领域所有趋势和社区的开放态度，延续了该系列。官网链接：http://tools2019.innopolis.ru/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日