In Polaris, we introduced a cloud-native distributed query processor to perform analytics at scale. In this paper, we extend the underlying Polaris distributed computation framework, which can be thought of as a read-only transaction engine, to execute general transactions (including updates, deletes, inserts and bulk loads, in addition to queries) for Tier 1 warehousing workloads in a highly performant and predictable manner. We take advantage of the immutability of data files in log-structured data stores and build on SQL Server transaction management to deliver full transactional support with Snapshot Isolation semantics, including multi-table and multi-statement transactions. With the enhancements described in this paper, Polaris supports both query processing and transactions for T-SQL in Microsoft Fabric.
翻译:在Polaris项目中,我们提出了一种云原生分布式查询处理器,用于大规模分析处理。本文对底层Polaris分布式计算框架(可视为只读事务引擎)进行扩展,使其能够以高性能且可预测的方式执行通用事务(包括查询、更新、删除、插入和批量加载),适用于第一层级数据仓库负载。我们利用日志结构数据存储中数据文件的不可变性,并基于SQL Server事务管理机制实现具有快照隔离语义的完整事务支持,涵盖多表和多语句事务。通过本文所述增强功能,Polaris在Microsoft Fabric中为T-SQL同时提供查询处理与事务支持能力。