High-performance Vector-length Agnostic Quantum Circuit Simulations on ARM Processors - 专知论文

会员服务 ·

0

ARM · 向量化 · 设计 · 支持向量 · 可理解性 ·

High-performance Vector-length Agnostic Quantum Circuit Simulations on ARM Processors

翻译：暂无翻译

Ruimin Shi,Gabin Schieffer,Pei-Hung Lin,Maya Gokhale,Andreas Herten,Ivy Peng

from arxiv, To be published in IPDPS2026

ARM SVE and RISC-V RVV are emerging vector architectures in high-end processors that support vectorization of flexible vector length. In this work, we leverage an important workload for quantum computing, quantum state-vector simulations, to understand whether high-performance portability can be achieved in a vector-length agnostic (VLA) design. We propose a VLA design and optimization techniques critical for achieving high performance, including VLEN-adaptive memory layout adjustment, load buffering, fine-grained loop control, and gate fusion-based arithmetic intensity adaptation. We provide an implementation in Google's Qsim and evaluate five quantum circuits of up to 36 qubits on three ARM processors, including NVIDIA Grace, AWS Graviton3, and Fujitsu A64FX. By defining new metrics and PMU events to quantify vectorization activities, we draw generic insights for future VLA designs. Our single-source implementation of VLA quantum simulations achieves up to 4.5x speedup on A64FX, 2.5x speedup on Grace, and 1.5x speedup on Graviton.

翻译：暂无翻译

0

相关内容

ARM

安谋控股公司，又称ARM公司，跨国性半导体设计与软件公司，总部位于英国英格兰剑桥。主要的产品是ARM架构处理器的设计，将其以知识产权的形式向客户进行授权，同时也提供软件开发工具。维基百科

《用于适应性、任务就绪型军用仿生机器人的合成数据管道》

《用于适应性、任务就绪型军用仿生机器人的合成数据管道》

专知会员服务

19+阅读 · 2025年12月29日

仿生机器人技术的军事应用

仿生机器人技术的军事应用

专知会员服务

12+阅读 · 2025年12月4日

人工智能与仿真协同增强军事决策支持能力

人工智能与仿真协同增强军事决策支持能力

专知会员服务

69+阅读 · 2024年10月2日

【剑桥大学博士论文】针对微控制器和应用级处理器的高效空间和时间安全性，192页pdf

【剑桥大学博士论文】针对微控制器和应用级处理器的高效空间和时间安全性，192页pdf

专知会员服务

17+阅读 · 2023年7月7日

军工迈向高质量发展阶段（附报告）

军工迈向高质量发展阶段（附报告）

专知会员服务

42+阅读 · 2023年1月18日

「AI芯片/GPU/NPU/DSP专用处理器」技术特征比较分析最新2022综述论文

「AI芯片/GPU/NPU/DSP专用处理器」技术特征比较分析最新2022综述论文

专知会员服务

65+阅读 · 2022年3月6日

处理器芯片敏捷设计方法：问题与挑战

专知会员服务

19+阅读 · 2021年6月29日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【泡泡图灵智库】基于上采样预积分测量值的3D Lidar-IMU校准来矫正运动失真

【泡泡图灵智库】基于上采样预积分测量值的3D Lidar-IMU校准来矫正运动失真

泡泡机器人SLAM

11+阅读 · 2019年9月17日

初学者系列：Attentional Factorization Machines（AFM）详解

初学者系列：Attentional Factorization Machines（AFM）详解

专知

82+阅读 · 2019年9月16日

【泡泡一分钟】PIRVS：一个具有灵活传感器融合和硬件协同设计的先进视觉-惯性SLAM系统

【泡泡一分钟】PIRVS：一个具有灵活传感器融合和硬件协同设计的先进视觉-惯性SLAM系统

泡泡机器人SLAM

11+阅读 · 2019年9月11日

【泡泡图灵智库】多传感器深度连续融合的三维目标检测方法

【泡泡图灵智库】多传感器深度连续融合的三维目标检测方法

泡泡机器人SLAM

23+阅读 · 2019年9月7日

【泡泡图灵智库】PL-VIO：使用点和线特征的紧耦合单目视觉惯性里程计

【泡泡图灵智库】PL-VIO：使用点和线特征的紧耦合单目视觉惯性里程计

泡泡机器人SLAM

53+阅读 · 2019年7月9日

【泡泡一分钟】GOMSF——基于多传感器融合的图优化无人机鲁棒位姿估计方法

【泡泡一分钟】GOMSF——基于多传感器融合的图优化无人机鲁棒位姿估计方法

泡泡机器人SLAM

25+阅读 · 2019年7月2日

未来集群智能战争对我国武器装备体系建设的要求和挑战

未来集群智能战争对我国武器装备体系建设的要求和挑战

无人机

24+阅读 · 2019年6月26日

【泡泡图灵智库】基于CPU的实时6D物体姿态估计（arXiv）

【泡泡图灵智库】基于CPU的实时6D物体姿态估计（arXiv）

泡泡机器人SLAM

12+阅读 · 2019年1月26日

最新基于FPGA的深度学习加速器综述论文（附下载）

最新基于FPGA的深度学习加速器综述论文（附下载）

专知

23+阅读 · 2019年1月17日

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

KingsGarden

13+阅读 · 2017年7月16日

纳米尺度自旋电子器件参数化电路模型建立方法的研究

国家自然科学基金

0+阅读 · 2017年12月31日

高精度片上抖动测量关键技术及电路实现研究

国家自然科学基金

0+阅读 · 2015年12月31日

面向数万处理器的有限元线性方程组与模态多级算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

高精度模拟信号处理前端关键技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

大功率柔顺驱动器的设计方法及能量优化和交互安全机理研究

国家自然科学基金

1+阅读 · 2015年12月31日

嵌入式异构多核系统应用程序自动并行化过程关键技术研究

国家自然科学基金

1+阅读 · 2015年12月31日

多元质量特性下兵器装备协同研制能力网络形成与动态演化机理

国家自然科学基金

2+阅读 · 2015年12月31日

超高速CMOS数模转换器关键技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

三维连续集成集成电路关键工艺技术和机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

压电智能作动器的高保真完整非线性动力学建模和高精度多通道运动协同同步控制系统一体化优化设计

国家自然科学基金

0+阅读 · 2014年12月31日

Vectorization of Verilog Designs and its Effects on Verification and Synthesis

Arxiv

0+阅读 · 3月17日

ODIN-Based CPU-GPU Architecture with Replay-Driven Simulation and Emulation

Arxiv

0+阅读 · 3月17日

Work Sharing and Offloading for Efficient Approximate Threshold-based Vector Join

Arxiv

0+阅读 · 3月17日

A Unified Calibration Framework for Coordinate and Kinematic Parameters in Dual-Arm Robots

Arxiv

0+阅读 · 3月16日

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Arxiv

0+阅读 · 2月27日

Dimension Reduction in Multivariate Extremes via Latent Linear Factor Models

Arxiv

0+阅读 · 2月26日

Convex Loss Functions for Support Vector Machines (SVMs) and Neural Networks

Arxiv

0+阅读 · 2月25日

A Logic-Reuse Approach to Nibble-based Multiplier Design for Low Power Vector Computing

Arxiv

0+阅读 · 2月22日

Pareto Optimal Benchmarking of AI Models on ARM Cortex Processors for Sustainable Embedded Systems

Arxiv

0+阅读 · 2月20日

High-performance Vector-length Agnostic Quantum Circuit Simulations on ARM Processors

Arxiv

0+阅读 · 2月18日

VIP会员

文章信息

相关主题

相关VIP内容

《用于适应性、任务就绪型军用仿生机器人的合成数据管道》

《用于适应性、任务就绪型军用仿生机器人的合成数据管道》

专知会员服务

19+阅读 · 2025年12月29日

仿生机器人技术的军事应用

仿生机器人技术的军事应用

专知会员服务

12+阅读 · 2025年12月4日

人工智能与仿真协同增强军事决策支持能力

人工智能与仿真协同增强军事决策支持能力

专知会员服务

69+阅读 · 2024年10月2日

【剑桥大学博士论文】针对微控制器和应用级处理器的高效空间和时间安全性，192页pdf

【剑桥大学博士论文】针对微控制器和应用级处理器的高效空间和时间安全性，192页pdf

专知会员服务

17+阅读 · 2023年7月7日

军工迈向高质量发展阶段（附报告）

军工迈向高质量发展阶段（附报告）

专知会员服务

42+阅读 · 2023年1月18日

「AI芯片/GPU/NPU/DSP专用处理器」技术特征比较分析最新2022综述论文

「AI芯片/GPU/NPU/DSP专用处理器」技术特征比较分析最新2022综述论文

专知会员服务

65+阅读 · 2022年3月6日

处理器芯片敏捷设计方法：问题与挑战

专知会员服务

19+阅读 · 2021年6月29日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

边缘侧具身基础模型：部署约束与缓解策略综述

《认知战：定义、框架与案例研究》最新29页

【斯坦福博士论文】利用在线交互经验提升机器人学习稳健性的算法研究

土耳其以新型集成化C5ISR架构拓展人工智能赋能水下作战

相关资讯

【泡泡图灵智库】基于上采样预积分测量值的3D Lidar-IMU校准来矫正运动失真

【泡泡图灵智库】基于上采样预积分测量值的3D Lidar-IMU校准来矫正运动失真

泡泡机器人SLAM

11+阅读 · 2019年9月17日

初学者系列：Attentional Factorization Machines（AFM）详解

初学者系列：Attentional Factorization Machines（AFM）详解

专知

82+阅读 · 2019年9月16日

【泡泡一分钟】PIRVS：一个具有灵活传感器融合和硬件协同设计的先进视觉-惯性SLAM系统

【泡泡一分钟】PIRVS：一个具有灵活传感器融合和硬件协同设计的先进视觉-惯性SLAM系统

泡泡机器人SLAM

11+阅读 · 2019年9月11日

【泡泡图灵智库】多传感器深度连续融合的三维目标检测方法

【泡泡图灵智库】多传感器深度连续融合的三维目标检测方法

泡泡机器人SLAM

23+阅读 · 2019年9月7日

【泡泡图灵智库】PL-VIO：使用点和线特征的紧耦合单目视觉惯性里程计

【泡泡图灵智库】PL-VIO：使用点和线特征的紧耦合单目视觉惯性里程计

泡泡机器人SLAM

53+阅读 · 2019年7月9日

【泡泡一分钟】GOMSF——基于多传感器融合的图优化无人机鲁棒位姿估计方法

【泡泡一分钟】GOMSF——基于多传感器融合的图优化无人机鲁棒位姿估计方法

泡泡机器人SLAM

25+阅读 · 2019年7月2日

未来集群智能战争对我国武器装备体系建设的要求和挑战

未来集群智能战争对我国武器装备体系建设的要求和挑战

无人机

24+阅读 · 2019年6月26日

【泡泡图灵智库】基于CPU的实时6D物体姿态估计（arXiv）

【泡泡图灵智库】基于CPU的实时6D物体姿态估计（arXiv）

泡泡机器人SLAM

12+阅读 · 2019年1月26日

最新基于FPGA的深度学习加速器综述论文（附下载）

最新基于FPGA的深度学习加速器综述论文（附下载）

专知

23+阅读 · 2019年1月17日

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

KingsGarden

13+阅读 · 2017年7月16日

相关论文

Vectorization of Verilog Designs and its Effects on Verification and Synthesis

Arxiv

0+阅读 · 3月17日

ODIN-Based CPU-GPU Architecture with Replay-Driven Simulation and Emulation

Arxiv

0+阅读 · 3月17日

Work Sharing and Offloading for Efficient Approximate Threshold-based Vector Join

Arxiv

0+阅读 · 3月17日

A Unified Calibration Framework for Coordinate and Kinematic Parameters in Dual-Arm Robots

Arxiv

0+阅读 · 3月16日

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Arxiv

0+阅读 · 2月27日

Dimension Reduction in Multivariate Extremes via Latent Linear Factor Models

Arxiv

0+阅读 · 2月26日

Convex Loss Functions for Support Vector Machines (SVMs) and Neural Networks

Arxiv

0+阅读 · 2月25日

A Logic-Reuse Approach to Nibble-based Multiplier Design for Low Power Vector Computing

Arxiv

0+阅读 · 2月22日

Pareto Optimal Benchmarking of AI Models on ARM Cortex Processors for Sustainable Embedded Systems

Arxiv

0+阅读 · 2月20日

High-performance Vector-length Agnostic Quantum Circuit Simulations on ARM Processors

Arxiv

0+阅读 · 2月18日

相关基金

纳米尺度自旋电子器件参数化电路模型建立方法的研究

国家自然科学基金

0+阅读 · 2017年12月31日

高精度片上抖动测量关键技术及电路实现研究

国家自然科学基金

0+阅读 · 2015年12月31日

面向数万处理器的有限元线性方程组与模态多级算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

高精度模拟信号处理前端关键技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

大功率柔顺驱动器的设计方法及能量优化和交互安全机理研究

国家自然科学基金

1+阅读 · 2015年12月31日

嵌入式异构多核系统应用程序自动并行化过程关键技术研究

国家自然科学基金

1+阅读 · 2015年12月31日

多元质量特性下兵器装备协同研制能力网络形成与动态演化机理

国家自然科学基金

2+阅读 · 2015年12月31日

超高速CMOS数模转换器关键技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

三维连续集成集成电路关键工艺技术和机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

压电智能作动器的高保真完整非线性动力学建模和高精度多通道运动协同同步控制系统一体化优化设计

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员