Despite blockchain data being publicly available, practical challenges and high costs often hinder its effective use by researchers, thus limiting data-driven research and exploration in the blockchain space. This is especially true when it comes to Layer~2 (L2) ecosystems, and ZKsync, in particular. To address these issues, we have curated a dataset from 1 year of activity extracted from a ZKsync Era archive node and made it freely available to external parties. In this paper, we provide details on this dataset and how it was created, showcase a few example analyses that can be performed with it, and discuss some future research directions. We also publish and share the code used in our analysis on GitHub to promote reproducibility and to support further research.
翻译:尽管区块链数据是公开可用的,但实际挑战和高昂成本常常阻碍研究人员对其有效利用,从而限制了区块链领域数据驱动的研究与探索。这在Layer~2(L2)生态系统,特别是ZKsync中尤为突出。为解决这些问题,我们从一个ZKsync Era归档节点中提取了一年的活动数据,整理成一个数据集,并免费提供给外部使用。本文详细介绍了该数据集及其创建过程,展示了利用该数据集可执行的若干示例分析,并讨论了一些未来的研究方向。我们还在GitHub上发布并分享了分析中使用的代码,以促进可重复性并支持进一步的研究。