Semantic Analysis of Macro Usage for Portability

from arxiv, 12 pages. 4 figures. 2 tables. To appear in the 2024 IEEE/ACM 46th International Conference on Software Engineering (ICSE '24), April 14-20, 2024, Lisbon, Portugal. See https://zenodo.org/doi/10.5281/zenodo.7783131 for the latest version of the artifact associated with this paper

C is an unsafe language. Researchers have been developing tools to port C to safer languages such as Rust, Checked C, or Go. Existing tools, however, resort to preprocessing the source file first, then porting the resulting code, leaving barely recognizable code that loses macro abstractions. To preserve macro usage, porting tools need analyses that understand macro behavior to port to equivalent constructs. But macro semantics differ from typical functions, precluding simple syntactic transformations to port them. We introduce the first comprehensive framework for analyzing the portability of macro usage. We decompose macro behavior into 26 fine-grained properties and implement a program analysis tool, called Maki, that identifies them in real-world code with 94% accuracy. We apply Maki to 21 programs containing a total of 86,199 macro definitions. We found that real-world macros are much more portable than previously known. More than a third (37%) are easy-to-port, and Maki provides hints for porting more complicated macros. We find, on average, 2x more easy-to-port macros and up to 7x more in the best case compared to prior work. Guided by Maki's output, we found and hand-ported macros in four real-world programs. We submitted patches to Linux maintainers that transform eleven macros, nine of which have been accepted.

翻译：C 语言是一种不安全的语言。研究人员一直致力于开发工具，将C语言移植到更安全的语言，如Rust、Checked C或Go。然而，现有工具通常先对源文件进行预处理，然后移植生成的代码，导致产生几乎无法识别的代码，并丢失了宏抽象。为了保留宏的使用，移植工具需要能够理解宏行为的分析，以便将其移植到等效的构件。但宏的语义与典型函数不同，排除了通过简单语法转换进行移植的可能性。我们首次提出了一个用于分析宏使用可移植性的综合框架。我们将宏行为分解为26个细粒度属性，并实现了一个名为Maki的程序分析工具，该工具在实际代码中识别这些属性的准确率达到94%。我们将Maki应用于包含总计86,199个宏定义的21个程序。我们发现，实际代码中的宏的可移植性远超以往认知。超过三分之一（37%）的宏易于移植，且Maki为移植更复杂的宏提供了提示。与先前工作相比，我们平均发现的易于移植宏数量是原来的两倍，最佳情况下可达七倍。在Maki输出的指导下，我们为四个实际程序中的宏进行了手动移植。我们向Linux维护者提交了补丁，转化了11个宏，其中9个已被接受。

相关内容

TOOLS

关注 1

这个新版本的工具会议系列恢复了从1989年到2012年的50个会议的传统。工具最初是“面向对象语言和系统的技术”，后来发展到包括软件技术的所有创新方面。今天许多最重要的软件概念都是在这里首次引入的。2019年TOOLS 50+1在俄罗斯喀山附近举行，以同样的创新精神、对所有与软件相关的事物的热情、科学稳健性和行业适用性的结合以及欢迎该领域所有趋势和社区的开放态度，延续了该系列。官网链接：http://tools2019.innopolis.ru/

【CVPR 2022】视觉提示调整（VPT），Vision Prompt Tuning

专知会员服务

32+阅读 · 2022年3月12日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日