We introduce PennyLane's Lightning suite, a collection of high-performance state-vector simulators targeting CPU, GPU, and HPC-native architectures and workloads. Quantum applications such as QAOA, VQE, and synthetic workloads are implemented to demonstrate the supported classical computing architectures and showcase the scale of problems that can be simulated using our tooling. We benchmark the performance of Lightning with backends supporting CPUs, as well as NVidia and AMD GPUs, and compare the results to other commonly used high-performance simulator packages, demonstrating where Lightning's implementations give performance leads. We show improved CPU performance by employing explicit SIMD intrinsics and multi-threading, batched task-based execution across multiple GPUs, and distributed forward and gradient-based quantum circuit executions across multiple nodes. Our data shows we can comfortably simulate a variety of circuits, giving examples with up to 30 qubits on a single device or node, and up to 41 qubits using multiple nodes.
翻译:我们介绍PennyLane的Lightning工具套件,这是一组面向CPU、GPU及HPC原生架构与工作负载的高性能态矢量模拟器集合。通过实现QAOA、VQE及合成工作负载等量子应用,我们展示了所支持的经典计算架构,并展现了利用该工具可模拟的问题规模。我们针对支持CPU、NVIDIA与AMD GPU的后端对Lightning性能进行基准测试,并将结果与其他常用高性能模拟器包比较,证明Lightning在特定实现中的性能优势。通过显式SIMD内建指令与多线程技术,我们实现了CPU性能提升;通过跨多GPU的批处理任务执行,以及跨多节点的分布式前向与梯度量子电路计算,进一步优化性能。数据显示,我们可流畅模拟多种电路:单设备或单节点上最高支持30量子比特,多节点场景下可达41量子比特。