Analog in-memory computing (AiMC) is an emerging technology that shows fantastic performance superiority for neural network acceleration. However, as the computational bit-width and scale increase, high-precision data conversion and long-distance data routing will result in unacceptable energy and latency overheads in the AiMC system. In this work, we focus on the potential of in-charge computing and in-time interconnection and show an innovative AiMC architecture, named AiDAC, with three key contributions: (1) AiDAC enhances multibit computing efficiency and reduces data conversion times by grouping capacitors technology; (2) AiDAC first adopts row drivers and column time accumulators to achieve large-scale AiMC arrays integration while minimizing the energy cost of data movements. (3) AiDAC is the first work to support large-scale all-analog multibit vector-matrix multiplication (VMM) operations. The evaluation shows that AiDAC maintains high-precision calculation (less than 0.79% total computing error) while also possessing excellent performance features, such as high parallelism (up to 26.2TOPS), low latency (<20ns/VMM), and high energy efficiency (123.8TOPS/W), for 8bits VMM with 1024 input channels.
翻译:模拟内存计算(AiMC)是一种新兴技术,在神经网络加速方面展现出卓越的性能优势。然而,随着计算位宽和规模的增加,高精度数据转换与长距离数据传输将导致AiMC系统产生不可接受的能耗与延迟开销。本文聚焦于电荷域计算及时域互连的潜力,提出一种创新的AiMC架构——AiDAC,其核心贡献有三:(1)通过电容器组技术,AiDAC提升了多比特计算效率并减少了数据转换次数;(2)AiDAC首次采用行驱动器和列时间累加器,在降低数据移动能耗的同时实现大规模AiMC阵列集成;(3)AiDAC是首个支持大规模全模拟多比特向量矩阵乘法(VMM)运算的工作。评估表明,针对1024输入通道的8位VMM操作,AiDAC在保持高精度计算(总计算误差低于0.79%)的同时,具备高性能特性——高并行度(高达26.2TOPS)、低延迟(<20ns/VMM)及高能效(123.8TOPS/W)。