研究手稿插图：一种高效的深度学习方法 (Studying Illustrations in Manuscripts: An Efficient Deep-Learning Approach) - 专知论文

会员服务 ·

0

艺术 · 分析 · AI · 系统 · 提取 ·

Studying Illustrations in Manuscripts: An Efficient Deep-Learning Approach

翻译：研究手稿插图：一种高效的深度学习方法

Yoav Evron,Michal Bar-Asher Siegal,Michael Fire

from arxiv, 17 pages, 5 figures

The recent Artificial Intelligence (AI) revolution has opened transformative possibilities for the humanities, particularly in unlocking the visual-artistic content embedded in historical illuminated manuscripts. While digital archives now offer unprecedented access to these materials, the ability to systematically locate, extract, and analyze illustrations at scale remains a major challenge. We present a general and scalable AI-based pipeline for large-scale visual analysis of illuminated manuscripts. The framework integrates modern deep-learning models for page-level illustration detection, illustration extraction, and multimodal description, enabling scholars to search, cluster, and study visual materials and artistic trends across entire corpora. We demonstrate the applicability of this approach on large heterogeneous collections, including the Vatican Library and richly illuminated manuscripts such as the Bible of Borso d'Este. The system reveals meaningful visual patterns and cross-manuscript relationships by embedding illustrations into a shared representation space and analyzing their similarity structure (see figure 4). By harnessing recent advances in computer vision and vision-language models, our framework enables new forms of large-scale visual scholarship in historical studies, art history, and cultural heritage making it possible to explore iconography, stylistic trends, and cultural connections in ways that were previously impractical.

翻译：近期的人工智能（AI）革命为人文学科开启了变革性可能，尤其在解锁历史彩绘手稿中蕴含的视觉艺术内容方面。尽管数字档案现已提供前所未有的材料获取途径，但大规模系统性地定位、提取和分析插图仍是一项重大挑战。我们提出了一种通用且可扩展的基于AI的流程，用于彩绘手稿的大规模视觉分析。该框架集成了现代深度学习模型，实现页面级插图检测、插图提取和多模态描述，使学者能够跨整个文献库搜索、聚类和研究视觉材料与艺术趋势。我们在大型异构收藏集上验证了该方法的适用性，包括梵蒂冈图书馆馆藏及《博尔索·德斯特圣经》等精美彩绘手稿。该系统通过将插图嵌入共享表示空间并分析其相似性结构（见图4），揭示了有意义的视觉模式与跨手稿关联。借助计算机视觉与视觉-语言模型的最新进展，我们的框架为历史研究、艺术史和文化遗产领域实现了新形式的大规模视觉学术研究，使得探索图像志、风格趋势和文化联系成为可能，这在以往是难以实现的。

0

相关内容

艺术迄今依旧没有公认的定义，目前广义的艺术乃是由具有智能思考能力的动物，透过各种形式及工具以表达其情感与意识，因而产生的结果。艺术不只存在于人类社会中，也存在于其他相对高等的动物。

【博士论文】面向应用环境下深度学习方法的持续学习，195页pdf

【博士论文】面向应用环境下深度学习方法的持续学习，195页pdf

专知会员服务

52+阅读 · 2023年1月6日

斯坦福尤佳轩图深度学习博士论文-《用图赋能深度学习》，205页pdf

斯坦福尤佳轩图深度学习博士论文-《用图赋能深度学习》，205页pdf

专知会员服务

50+阅读 · 2022年12月15日

丹麦奥胡斯大学等最新《高效高分辨率深度学习》综述，全面阐述高效高分辨率深度学习方法

丹麦奥胡斯大学等最新《高效高分辨率深度学习》综述，全面阐述高效高分辨率深度学习方法

专知会员服务

21+阅读 · 2022年12月13日

【TPAMI2022】从展示到讲述: 基于深度学习的图像描述研究综述论文，From Show to Tell: A Survey on Deep Learning-based Image Captioning

【TPAMI2022】从展示到讲述: 基于深度学习的图像描述研究综述论文，From Show to Tell: A Survey on Deep Learning-based Image Captioning

专知会员服务

24+阅读 · 2022年3月1日

最新《图像描述Image Captioning》综述论文，22页pdf220篇文献

专知会员服务

43+阅读 · 2021年7月17日

深度学习如何又好又快? Google最新《高效深度学习: 更小、更快、更好》综述论文，43页pdf

深度学习如何又好又快? Google最新《高效深度学习: 更小、更快、更好》综述论文，43页pdf

专知会员服务

91+阅读 · 2021年6月18日

【Alma Mate博士论文】深度架构持续学习，附150页pdf与Slides

【Alma Mate博士论文】深度架构持续学习，附150页pdf与Slides

专知会员服务

47+阅读 · 2020年11月18日

深度学习图像分割综述论文最新版，Image Segmentation Using Deep Learning: A Survey

深度学习图像分割综述论文最新版，Image Segmentation Using Deep Learning: A Survey

专知会员服务

93+阅读 · 2020年4月11日

55页图深度学习导论《A Gentle Introduction to Deep Learning for Graphs》

专知会员服务

104+阅读 · 2020年1月3日

【《图解深度学习》电子书与代码，830页pdf】’Deep Learning Illustrated (2019)' by Deep Learning Study Group GitHub

【《图解深度学习》电子书与代码，830页pdf】’Deep Learning Illustrated (2019)' by Deep Learning Study Group GitHub

专知会员服务

153+阅读 · 2019年1月1日

【干货书】《Transformers 机器学习:深度探究》，284页pdf

【干货书】《Transformers 机器学习:深度探究》，284页pdf

专知

72+阅读 · 2022年4月21日

【长文综述】基于图神经网络的知识图谱研究进展

【长文综述】基于图神经网络的知识图谱研究进展

深度学习自然语言处理

15+阅读 · 2020年8月23日

55页图深度学习导论《A Gentle Introduction to Deep Learning for Graphs》

55页图深度学习导论《A Gentle Introduction to Deep Learning for Graphs》

专知

16+阅读 · 2020年1月3日

NLP+CV《桥接视觉与语言的研究综述》，带你全面了解视觉+语言最新应用和方法

NLP+CV《桥接视觉与语言的研究综述》，带你全面了解视觉+语言最新应用和方法

中国人工智能学会

27+阅读 · 2019年7月24日

【综述】3D数据分类深度学习方法综述，25页论文带你全面了解最新进展

【综述】3D数据分类深度学习方法综述，25页论文带你全面了解最新进展

中国人工智能学会

20+阅读 · 2019年7月17日

【综述】基于深度学习的图像数据增强方法最新进展，48页论文带你快速了解领域进展

【综述】基于深度学习的图像数据增强方法最新进展，48页论文带你快速了解领域进展

专知

43+阅读 · 2019年7月10日

高赞人气资源！集结数百篇顶会论文，由浅入深让你吃透图深度学习

高赞人气资源！集结数百篇顶会论文，由浅入深让你吃透图深度学习

量子位

10+阅读 · 2019年7月7日

深度学习了解一下（附53页Slides）

深度学习了解一下（附53页Slides）

专知

48+阅读 · 2019年5月20日

【深度学习】大牛的《深度学习》笔记，Deep Learning速成教程

【深度学习】大牛的《深度学习》笔记，Deep Learning速成教程

产业智能官

12+阅读 · 2018年4月6日

学界 | 面向工程师的机器学习简介：理论、算法、概念全覆盖

学界 | 面向工程师的机器学习简介：理论、算法、概念全覆盖

机器之心

17+阅读 · 2017年9月15日

面向大类别的空中手写中英文识别技术研究

国家自然科学基金

2+阅读 · 2017年12月31日

天元数学交流项目图像处理中的数学理论及方法研讨会

国家自然科学基金

9+阅读 · 2017年12月31日

针对大规模环境下复杂任务的策略搜索强化学习方法研究

国家自然科学基金

43+阅读 · 2015年12月31日

基于深度学习的复杂场景下人体行为识别研究

国家自然科学基金

9+阅读 · 2015年12月31日

基于深度学习的多尺度本质图像提取方法

国家自然科学基金

5+阅读 · 2015年12月31日

基于深度表达和迁移学习的人体检测研究

国家自然科学基金

6+阅读 · 2015年12月31日

彩色图像的高保真可逆信息隐藏算法研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于记忆的不变图像特征学习方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

基于深度信息和显著计算的手势交互技术研究及应用

国家自然科学基金

1+阅读 · 2014年12月31日

基于深度学习的三维模型检索技术

国家自然科学基金

13+阅读 · 2014年12月31日

AutoFigure: Generating and Refining Publication-Ready Scientific Illustrations

Arxiv

0+阅读 · 2月3日

Draw2Learn: A Human-AI Collaborative Tool for Drawing-Based Science Learning

Arxiv

0+阅读 · 2月2日

PaperBanana: Automating Academic Illustration for AI Scientists

Arxiv

0+阅读 · 1月30日

Visual Hand Gesture Recognition with Deep Learning: A Comprehensive Review of Methods, Datasets, Challenges and Future Research Directions

Arxiv

0+阅读 · 1月19日

Research Integrity and Academic Authority in the Age of Artificial Intelligence: From Discovery to Curation?

Arxiv

0+阅读 · 1月9日

Visual Merit or Linguistic Crutch? A Close Look at DeepSeek-OCR

Arxiv

0+阅读 · 1月8日

Visual Merit or Linguistic Crutch? A Close Look at DeepSeek-OCR

Arxiv

0+阅读 · 1月7日

The Machine Learning Canvas: Empirical Findings on Why Strategy Matters More Than AI Code Generation

Arxiv

0+阅读 · 1月5日

On Efficient Training of Large-Scale Deep Learning Models: A Literature Review

Arxiv

231+阅读 · 2023年4月7日

Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better

Arxiv

28+阅读 · 2021年6月16日

VIP会员

文章信息

相关主题

相关VIP内容

【博士论文】面向应用环境下深度学习方法的持续学习，195页pdf

【博士论文】面向应用环境下深度学习方法的持续学习，195页pdf

专知会员服务

52+阅读 · 2023年1月6日

斯坦福尤佳轩图深度学习博士论文-《用图赋能深度学习》，205页pdf

斯坦福尤佳轩图深度学习博士论文-《用图赋能深度学习》，205页pdf

专知会员服务

50+阅读 · 2022年12月15日

丹麦奥胡斯大学等最新《高效高分辨率深度学习》综述，全面阐述高效高分辨率深度学习方法

丹麦奥胡斯大学等最新《高效高分辨率深度学习》综述，全面阐述高效高分辨率深度学习方法

专知会员服务

21+阅读 · 2022年12月13日

【TPAMI2022】从展示到讲述: 基于深度学习的图像描述研究综述论文，From Show to Tell: A Survey on Deep Learning-based Image Captioning

【TPAMI2022】从展示到讲述: 基于深度学习的图像描述研究综述论文，From Show to Tell: A Survey on Deep Learning-based Image Captioning

专知会员服务

24+阅读 · 2022年3月1日

最新《图像描述Image Captioning》综述论文，22页pdf220篇文献

专知会员服务

43+阅读 · 2021年7月17日

深度学习如何又好又快? Google最新《高效深度学习: 更小、更快、更好》综述论文，43页pdf

深度学习如何又好又快? Google最新《高效深度学习: 更小、更快、更好》综述论文，43页pdf

专知会员服务

91+阅读 · 2021年6月18日

【Alma Mate博士论文】深度架构持续学习，附150页pdf与Slides

【Alma Mate博士论文】深度架构持续学习，附150页pdf与Slides

专知会员服务

47+阅读 · 2020年11月18日

深度学习图像分割综述论文最新版，Image Segmentation Using Deep Learning: A Survey

深度学习图像分割综述论文最新版，Image Segmentation Using Deep Learning: A Survey

专知会员服务

93+阅读 · 2020年4月11日

55页图深度学习导论《A Gentle Introduction to Deep Learning for Graphs》

专知会员服务

104+阅读 · 2020年1月3日

【《图解深度学习》电子书与代码，830页pdf】’Deep Learning Illustrated (2019)' by Deep Learning Study Group GitHub

【《图解深度学习》电子书与代码，830页pdf】’Deep Learning Illustrated (2019)' by Deep Learning Study Group GitHub

专知会员服务

153+阅读 · 2019年1月1日

热门VIP内容

开通专知VIP会员享更多权益服务

【CVPR2026】CARE-Edit: 面向上下文相关图像编辑的条件感知专家路由机制

《伊朗冲突中的算法战：人工智能驱动的决策压缩》最新报告

生成模型中组相对策略优化 (GRPO) 的研究进展：综述

工业控制场景下的机器人基础模型：综述及其应用就绪度评估体系

相关资讯

【干货书】《Transformers 机器学习:深度探究》，284页pdf

【干货书】《Transformers 机器学习:深度探究》，284页pdf

专知

72+阅读 · 2022年4月21日

【长文综述】基于图神经网络的知识图谱研究进展

【长文综述】基于图神经网络的知识图谱研究进展

深度学习自然语言处理

15+阅读 · 2020年8月23日

55页图深度学习导论《A Gentle Introduction to Deep Learning for Graphs》

55页图深度学习导论《A Gentle Introduction to Deep Learning for Graphs》

专知

16+阅读 · 2020年1月3日

NLP+CV《桥接视觉与语言的研究综述》，带你全面了解视觉+语言最新应用和方法

NLP+CV《桥接视觉与语言的研究综述》，带你全面了解视觉+语言最新应用和方法

中国人工智能学会

27+阅读 · 2019年7月24日

【综述】3D数据分类深度学习方法综述，25页论文带你全面了解最新进展

【综述】3D数据分类深度学习方法综述，25页论文带你全面了解最新进展

中国人工智能学会

20+阅读 · 2019年7月17日

【综述】基于深度学习的图像数据增强方法最新进展，48页论文带你快速了解领域进展

【综述】基于深度学习的图像数据增强方法最新进展，48页论文带你快速了解领域进展

专知

43+阅读 · 2019年7月10日

高赞人气资源！集结数百篇顶会论文，由浅入深让你吃透图深度学习

高赞人气资源！集结数百篇顶会论文，由浅入深让你吃透图深度学习

量子位

10+阅读 · 2019年7月7日

深度学习了解一下（附53页Slides）

深度学习了解一下（附53页Slides）

专知

48+阅读 · 2019年5月20日

【深度学习】大牛的《深度学习》笔记，Deep Learning速成教程

【深度学习】大牛的《深度学习》笔记，Deep Learning速成教程

产业智能官

12+阅读 · 2018年4月6日

学界 | 面向工程师的机器学习简介：理论、算法、概念全覆盖

学界 | 面向工程师的机器学习简介：理论、算法、概念全覆盖

机器之心

17+阅读 · 2017年9月15日

相关论文

AutoFigure: Generating and Refining Publication-Ready Scientific Illustrations

Arxiv

0+阅读 · 2月3日

Draw2Learn: A Human-AI Collaborative Tool for Drawing-Based Science Learning

Arxiv

0+阅读 · 2月2日

PaperBanana: Automating Academic Illustration for AI Scientists

Arxiv

0+阅读 · 1月30日

Visual Hand Gesture Recognition with Deep Learning: A Comprehensive Review of Methods, Datasets, Challenges and Future Research Directions

Arxiv

0+阅读 · 1月19日

Research Integrity and Academic Authority in the Age of Artificial Intelligence: From Discovery to Curation?

Arxiv

0+阅读 · 1月9日

Visual Merit or Linguistic Crutch? A Close Look at DeepSeek-OCR

Arxiv

0+阅读 · 1月8日

Visual Merit or Linguistic Crutch? A Close Look at DeepSeek-OCR

Arxiv

0+阅读 · 1月7日

The Machine Learning Canvas: Empirical Findings on Why Strategy Matters More Than AI Code Generation

Arxiv

0+阅读 · 1月5日

On Efficient Training of Large-Scale Deep Learning Models: A Literature Review

Arxiv

231+阅读 · 2023年4月7日

Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better

Arxiv

28+阅读 · 2021年6月16日

相关基金

面向大类别的空中手写中英文识别技术研究

国家自然科学基金

2+阅读 · 2017年12月31日

天元数学交流项目图像处理中的数学理论及方法研讨会

国家自然科学基金

9+阅读 · 2017年12月31日

针对大规模环境下复杂任务的策略搜索强化学习方法研究

国家自然科学基金

43+阅读 · 2015年12月31日

基于深度学习的复杂场景下人体行为识别研究

国家自然科学基金

9+阅读 · 2015年12月31日

基于深度学习的多尺度本质图像提取方法

国家自然科学基金

5+阅读 · 2015年12月31日

基于深度表达和迁移学习的人体检测研究

国家自然科学基金

6+阅读 · 2015年12月31日

彩色图像的高保真可逆信息隐藏算法研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于记忆的不变图像特征学习方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

基于深度信息和显著计算的手势交互技术研究及应用

国家自然科学基金

1+阅读 · 2014年12月31日

基于深度学习的三维模型检索技术

国家自然科学基金

13+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员