Functionally-Complete Boolean Logic in Real DRAM Chips: Experimental Characterization and Analysis

Ismail Emir Yuksel,Yahya Can Tugrul,Ataberk Olgun,F. Nisa Bostanci,A. Giray Yaglikci,Geraldo F. Oliveira,Haocong Luo,Juan Gómez-Luna,Mohammad Sadrosadati,Onur Mutlu

from arxiv, A shorter version of this work is to appear at the 30th IEEE International Symposium on High-Performance Computer Architecture (HPCA-30), 2024

Processing-using-DRAM (PuD) is an emerging paradigm that leverages the analog operational properties of DRAM circuitry to enable massively parallel in-DRAM computation. PuD has the potential to reduce or eliminate costly data movement between processing elements and main memory. Prior works experimentally demonstrate three-input MAJ (MAJ3) and two-input AND and OR operations in commercial off-the-shelf (COTS) DRAM chips. Yet, demonstrations on COTS DRAM chips do not provide a functionally complete set of operations. We experimentally demonstrate that COTS DRAM chips are capable of performing 1) functionally-complete Boolean operations: NOT, NAND, and NOR and 2) many-input (i.e., more than two-input) AND and OR operations. We present an extensive characterization of new bulk bitwise operations in 256 off-the-shelf modern DDR4 DRAM chips. We evaluate the reliability of these operations using a metric called success rate: the fraction of correctly performed bitwise operations. Among our 19 new observations, we highlight four major results. First, we can perform the NOT operation on COTS DRAM chips with a 98.37% success rate on average. Second, we can perform up to 16-input NAND, NOR, AND, and OR operations on COTS DRAM chips with high reliability (e.g., 16-input NAND, NOR, AND, and OR with an average success rate of 94.94%, 95.87%, 94.94%, and 95.85%, respectively). Third, data pattern only slightly affects bitwise operations. Our results show that executing NAND, NOR, AND, and OR operations with random data patterns decreases the success rate compared to all logic-1/logic-0 patterns by 1.39%, 1.97%, 1.43%, and 1.98%, respectively. Fourth, bitwise operations are highly resilient to temperature changes, with small success rate fluctuations of at most 1.66% when the temperature is increased from 50C to 95C. We open-source our infrastructure at https://github.com/CMU-SAFARI/FCDRAM

翻译：利用DRAM处理（Processing-using-DRAM, PuD）是一种新兴范式，通过利用DRAM电路的模拟运算特性实现大规模并行内存计算。该技术有望减少或消除处理单元与主存之间的高成本数据搬运。先前研究已在商用现货（COTS）DRAM芯片上实验验证了三输入MAJ（MAJ3）以及两输入AND与OR运算，但尚未在COTS DRAM芯片上实现功能完备的运算集合。我们通过实验证明，COTS DRAM芯片能够执行：1）功能完备的布尔运算：NOT、NAND与NOR；2）多输入（即超过两输入）的AND与OR运算。我们对256颗现代商用DDR4 DRAM芯片中的新型批量按位运算进行了广泛表征，采用称为“成功率”（成功执行按位运算的比例）的指标评估其可靠性。在19项新发现中，我们重点指出四项主要结果：第一，COTS DRAM芯片上NOT运算的平均成功率达98.37%；第二，可高可靠性执行多达16输入的NAND、NOR、AND与OR运算（例如16输入NAND、NOR、AND与OR的平均成功率分别为94.94%、95.87%、94.94%与95.85%）；第三，数据模式对按位运算影响有限——相较于全逻辑1/逻辑0模式，随机数据模式执行NAND、NOR、AND与OR运算时成功率分别下降1.39%、1.97%、1.43%与1.98%；第四，按位运算对温度变化高度鲁棒，当温度从50°C升至95°C时，成功率波动幅度不超过1.66%。我们已在https://github.com/CMU-SAFARI/FCDRAM开源相关基础设施。