Administrative extracts are often exchanged as spreadsheets and may be read as reports in their own right during budgeting, workload review, and governance discussions. When an exported workbook becomes the reference snapshot for such decisions, the transformation can be checked by recomputation against a clearly identified input. A deterministic, rule-governed, file-based workflow is implemented in cad_processor.py. The script ingests a Casual Academic Database (CAD) export workbook and aggregates inclusive on-costs and student counts into subject-year and school-year totals, from which it derives cost-per-student ratios. It writes a processed workbook with four sheets: Processing Summary (run record and counters), Trend Analysis (schoolyear cost-per-student matrix), Report (wide subject-level table), and Fuzzy Bands (per-year anchors, membership weights, and band labels). The run record includes a SHA-256 hash of the input workbook bytes to support snapshot-matched recomputation. For within-year interpretation, the workflow adds a simple fuzzy banding layer that labels finite, positive school-year cost-per-student values as Low, Medium, or High. The per-year anchors are the minimum, median, and maximum of the finite, positive ratios. Membership weights are computed using left-shoulder, triangular, and right-shoulder functions, with deterministic tie-breaking in a fixed priority order (Medium, then Low, then High). These weights are treated as decision-support signals rather than probabilities. A worked example provides a reproducible calculation of a band assignment from the reported anchors and ratios. Supplementary material includes a claim-to-evidence matrix, a reproducibility note, and a short glossary that links selected statements to code and workbook artefacts.
翻译:行政数据提取文件常以电子表格形式交换,并在预算编制、工作量审查及治理讨论中作为独立报告使用。当导出的工作簿成为此类决策的参考快照时,可通过基于明确标识输入的重新计算来校验数据转换过程。cad_processor.py 脚本实现了基于确定性规则的文件工作流。该脚本读取临时学术数据库(CAD)导出工作簿,将包含附加成本的支出与学生人数按科目-年度和学院-年度进行聚合,进而计算生均成本比率。脚本输出包含四个工作表:处理摘要(运行记录与计数器)、趋势分析(学院年度生均成本矩阵)、报告(宽格式科目级数据表)以及模糊分带(年度锚点、隶属权重与分带标签)。运行记录包含输入工作簿字节的SHA-256哈希值,以支持快照匹配的重新计算。为便于年度内数据解读,工作流增加了简易模糊分带层,将有限正值的学院年度生均成本标记为低、中、高三档。年度锚点取有限正比率的最小值、中位数与最大值。隶属权重通过左肩形、三角形和右肩形隶属函数计算,并采用固定优先级顺序(中>低>高)进行确定性平局裁决。这些权重被视为决策支持信号而非概率值。工作示例展示了如何根据报告的锚点与比率进行可复现的分带分配计算。补充材料包含证据对应矩阵、可复现性说明及简明术语表,将特定陈述关联至代码与工作簿实体。