AXI4MLIR: User-Driven Automatic Host Code Generation for Custom AXI-Based Accelerators

This paper addresses the need for automatic and efficient generation of host driver code for arbitrary custom AXI-based accelerators targeting linear algebra algorithms, an important workload in various applications, including machine learning and scientific computing. While existing tools have focused on automating accelerator prototyping, little attention has been paid to the host-accelerator interaction. This paper introduces AXI4MLIR, an extension of the MLIR compiler framework designed to facilitate the automated generation of host-accelerator driver code. With new MLIR attributes and transformations, AXI4MLIR empowers users to specify accelerator features (including their instructions) and communication patterns and exploit the host memory hierarchy. We demonstrate AXI4MLIR's versatility across different types of accelerators and problems, showcasing significant CPU cache reference reductions (up to 56%) and up to a 1.65x speedup compared to manually optimized driver code implementations. AXI4MLIR implementation is open-source and available at: https://github.com/AXI4MLIR/axi4mlir.

翻译：本文针对线性代数算法领域任意自定义AXI加速器的主机驱动代码自动高效生成需求展开研究。线性代数是机器学习与科学计算等应用场景中的核心工作负载。现有工具多聚焦于加速器原型自动化设计，却鲜有关注主机-加速器交互问题。为此，本文提出AXI4MLIR——一种基于MLIR编译器框架的扩展方案，旨在实现主机-加速器驱动代码的自动生成。通过引入新型MLIR属性与变换机制，AXI4MLIR允许用户自主定义加速器特性（包含指令集）、通信模式，并充分利用主机内存层级体系。我们在多种加速器类型与不同规模问题上验证了AXI4MLIR的通用性：相较于人工优化的驱动代码实现，该方案可实现高达56%的CPU缓存引用缩减以及1.65倍加速比。AXI4MLIR实现已开源，代码仓库地址：https://github.com/AXI4MLIR/axi4mlir

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日