We introduce WritePolicyBench, a benchmark for evaluating memory write policies: decision rules that choose what to store, merge, and evict under a strict byte budget while processing a stream with document/API drift. The benchmark provides (i) task generators with controlled non-stationarity, (ii) an explicit action interface for external memory, (iii) a byte-accurate cost model, and (iv) standardized metrics that measure both task success and budget efficiency.
翻译:我们提出WritePolicyBench,这是一个用于评估内存写入策略的基准测试框架:该框架旨在严格字节预算条件下处理存在文档/API漂移的数据流时,决策规则应如何选择存储、合并和淘汰内容。该基准测试提供以下功能:(i) 具有可控非平稳性的任务生成器,(ii) 面向外部存储的显式操作接口,(iii) 字节级精确的成本模型,以及(iv) 衡量任务成功率与预算效率的标准化评估指标。