ChartSumm: A Comprehensive Benchmark for Automatic Chart Summarization of Long and Short Summaries

Automatic chart to text summarization is an effective tool for the visually impaired people along with providing precise insights of tabular data in natural language to the user. A large and well-structured dataset is always a key part for data driven models. In this paper, we propose ChartSumm: a large-scale benchmark dataset consisting of a total of 84,363 charts along with their metadata and descriptions covering a wide range of topics and chart types to generate short and long summaries. Extensive experiments with strong baseline models show that even though these models generate fluent and informative summaries by achieving decent scores in various automatic evaluation metrics, they often face issues like suffering from hallucination, missing out important data points, in addition to incorrect explanation of complex trends in the charts. We also investigated the potential of expanding ChartSumm to other languages using automated translation tools. These make our dataset a challenging benchmark for future research.

翻译：图表到文本的自动总结是一种有效工具，可为视障人士提供帮助，同时以自然语言向用户呈现表格数据的精确见解。大规模且结构良好的数据集始终是数据驱动模型的关键要素。本文提出ChartSumm：一个大规模基准数据集，包含总计84,363张图表及其元数据和描述，涵盖广泛的主题与图表类型，用于生成短摘要和长摘要。基于强基线的广泛实验表明，尽管这些模型在各种自动评估指标上取得了可观分数，能够生成流畅且信息丰富的摘要，但它们常面临幻觉问题、遗漏重要数据点，以及对图表中复杂趋势的错误解释等挑战。我们还探索了利用自动翻译工具将ChartSumm扩展至其他语言的潜力。这些特性使我们的数据集成为未来研究中一个富有挑战性的基准。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

专知会员服务

52+阅读 · 2022年10月22日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日