We report on the ongoing development of arXiv's HTML Papers offering, available on every new TeX/LaTeX submission since its initial release in 2023. The main highlights from 2025 and early 2026 are: (i) community-driven improvements to HTML fidelity and service health, with roughly half of 6,000 user reports resolved; (ii) corpus-scale conversion work aimed at 90% error-free HTML (currently 75%); (iii) initial MathML 4 Intent annotations for accessible speech output; (iv) an in-progress Rust port of LaTeXML, reducing compute costs and enabling faster previews on submission. The arXiv HTML Papers project remains experimental, but is gradually maturing as we better understand the needs of arXiv's readers and the technical opportunities presented by new standards and by advances in programming languages and AI.
翻译:我们报告了arXiv HTML论文服务(自2023年首次发布起应用于每份新TeX/LaTeX投稿)的持续开发进展。2025年至2026年初的主要亮点包括:(i)社区驱动的HTML保真度与服务健康改进,目前约半数(6000份)用户报告已得到解决;(ii)面向语料库级别的转换工作,目标实现90%无错误HTML(当前为75%);(iii)初步的MathML 4意图标注,以支持无障碍语音输出;(iv)正在进行中的LaTeXML Rust移植,可降低计算成本并加快投稿预览生成。arXiv HTML论文项目仍处于实验阶段,但随着我们对arXiv读者需求以及新标准、编程语言与人工智能技术进步带来的技术机遇的深入理解,该项目正逐步走向成熟。