This work presents a detailed evaluation of Rust (software) implementations of several popular sketching solutions, as well as recently proposed optimizations. We compare these solutions in terms of computational speed, memory consumption, and several approximation error metrics. Overall, we find a simple hashing based solution employed with the Nitro sampling technique [22] gives the best trade-off between memory, error and speed. Our findings also include some novel insights about how to best combine sampling with Counting Cuckoo filters depending on the application.
翻译:本文对几种主流草图算法及其近期优化方案的Rust软件实现进行了详细评估。我们从计算速度、内存消耗以及多项近似误差指标出发,对这些方案进行了比较。总体而言,我们发现采用Nitro采样技术[22]的简单哈希方案在内存、误差与速度之间实现了最佳平衡。研究还揭示了一些关于如何根据具体应用场景将采样与Counting Cuckoo过滤器进行最佳组合的新颖见解。