Process Faster, Pay Less: Functional Isolation for Stream Processing - 专知论文

会员服务 ·

0

Process Faster, Pay Less: Functional Isolation for Stream Processing

翻译：更快处理，更低成本：用于流处理的函数隔离方法

Eleni Zapridou,Michael Koepf,Panagiotis Sioulas,Ioannis Mytilinis,Anastasia Ailamaki

from arxiv, Accepted to the 42nd IEEE International Conference on Data Engineering (ICDE 2026). 2026 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses

Concurrent workloads often extract insights from high-throughput, real-time data streams. Existing stream processing engines isolate each query's resources, ensuring robust performance but incurring high infrastructure costs. In contrast, sharing work reduces the amount of necessary resources but introduces inter-query interference, leading to performance degradation for some queries. We introduce FunShare, a stream-processing system that improves resource efficiency without compromising performance by dynamically grouping queries based on their performance characteristics. FunShare strategically relaxes query interdependencies and minimizes redundant computation while preserving individual query performance. It achieves this by using an adaptive optimization framework that monitors execution metrics, accurately estimates computation overlaps, and reconfigures execution plans on the fly in response to changes in the underlying data streams. Our evaluation demonstrates that FunShare minimizes resource consumption compared to isolated execution while maintaining or improving throughput for all queries.

翻译：并发工作负载通常从高吞吐量、实时数据流中提取信息。现有流处理引擎隔离每个查询的资源，确保了稳健的性能，但导致高昂的基础设施成本。相比之下，工作共享减少了所需资源量，但引入了查询间干扰，导致某些查询性能下降。我们提出了FunShare，一个流处理系统，该系统通过根据查询的性能特征动态分组，在不影响性能的情况下提高资源效率。FunShare策略性地放松查询间的依赖关系，并在保持单个查询性能的同时最小化冗余计算。它通过使用自适应优化框架来实现这一点，该框架监控执行指标，准确估计计算重叠，并根据底层数据流的变化实时重新配置执行计划。我们的评估表明，与隔离执行相比，FunShare在最小化资源消耗的同时，维持或提高了所有查询的吞吐量。

0

相关内容

【博士论文】优化智能体工作流以提升信息获取效率

【博士论文】优化智能体工作流以提升信息获取效率

专知会员服务

19+阅读 · 2025年7月7日

【ICML2022】DepthShrinker:一种新的压缩范式，用于提高紧凑神经网络的实际硬件效率

【ICML2022】DepthShrinker:一种新的压缩范式，用于提高紧凑神经网络的实际硬件效率

专知会员服务

11+阅读 · 2022年6月5日

【Manning新书】掌握流式处理系统-实时事件处理，Grokking Streaming Systems Real-time event processing

【Manning新书】掌握流式处理系统-实时事件处理，Grokking Streaming Systems Real-time event processing

专知会员服务

47+阅读 · 2022年3月22日

【Yoshua Bengio】生成式流网络，Generative Flow Networks

【Yoshua Bengio】生成式流网络，Generative Flow Networks

专知会员服务

32+阅读 · 2022年3月19日

【博士论文】集群系统中的网络流调度

【博士论文】集群系统中的网络流调度

专知会员服务

47+阅读 · 2021年12月7日

最新《流处理系统演化》综述论文，34页pdf

最新《流处理系统演化》综述论文，34页pdf

专知会员服务

21+阅读 · 2020年8月4日

【实用书】流数据处理，Streaming Data，219页pdf

【实用书】流数据处理，Streaming Data，219页pdf

专知会员服务

78+阅读 · 2020年4月24日

腾讯信息流内容理解技术实践，A User-Centered Concept Mining System for Query and Document Understanding at Tencent

腾讯信息流内容理解技术实践，A User-Centered Concept Mining System for Query and Document Understanding at Tencent

专知会员服务

41+阅读 · 2019年12月15日

【O'Reilly TensorFlow Conference 2019】HARP：高效的GPU共享系统（HARP: An efficient and elastic GPU-sharing system），Alibaba | Pengfei Fan，Lingling Jin

【O'Reilly TensorFlow Conference 2019】HARP：高效的GPU共享系统（HARP: An efficient and elastic GPU-sharing system），Alibaba | Pengfei Fan，Lingling Jin

专知会员服务

10+阅读 · 2019年11月13日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

流程/过程挖掘（Process Mining）最新综述

流程/过程挖掘（Process Mining）最新综述

PaperWeekly

23+阅读 · 2022年9月19日

【Flink】基于 Flink 的流式数据实时去重

【Flink】基于 Flink 的流式数据实时去重

AINLP

14+阅读 · 2020年9月29日

Python图像处理，366页pdf，Image Operators Image Processing in Python

Python图像处理，366页pdf，Image Operators Image Processing in Python

专知

15+阅读 · 2020年7月23日

Python自然语言处理: 使用SpaCycle库进行标记化、词干提取和词形还原

Python自然语言处理: 使用SpaCycle库进行标记化、词干提取和词形还原

Python程序员

18+阅读 · 2019年3月28日

使用 Canal 实现数据异构

使用 Canal 实现数据异构

性能与架构

20+阅读 · 2019年3月4日

Fast-OCNet: 更快更好的OCNet.

Fast-OCNet: 更快更好的OCNet.

极市平台

21+阅读 · 2019年2月10日

【大数据】StreamSets：一个大数据采集工具

【大数据】StreamSets：一个大数据采集工具

产业智能官

40+阅读 · 2018年12月5日

最新｜深度离散哈希算法，可用于图像检索！

最新｜深度离散哈希算法，可用于图像检索！

全球人工智能

14+阅读 · 2017年12月15日

tensorflow系列笔记：流程，概念和代码解析

tensorflow系列笔记：流程，概念和代码解析

北京思腾合力科技有限公司

30+阅读 · 2017年11月11日

自然语言处理工具包spaCy介绍

自然语言处理工具包spaCy介绍

AINLP

18+阅读 · 2016年11月14日

基于略图挖掘的在不同时空域的网络流式数据实时处理

国家自然科学基金

1+阅读 · 2015年12月31日

分层异构网络面向视频流的绿色节能通信研究

国家自然科学基金

6+阅读 · 2015年12月31日

智能电网环境下地理分布式互联网数据中心的能量成本降低方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

多标记文本数据流分类方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

面向网络社会的工作流关键技术研究

国家自然科学基金

3+阅读 · 2015年12月31日

数据流发布中的隐私保护理论和方法研究

国家自然科学基金

7+阅读 · 2015年12月31日

海量数据流实时分发技术研究

国家自然科学基金

3+阅读 · 2015年12月31日

移动云计算中数据流应用的动态计算切分技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

云环境中支持混合并行模式的科学工作流的执行优化

国家自然科学基金

0+阅读 · 2014年12月31日

面向大数据计算的高吞吐量众核处理器关键技术研究

国家自然科学基金

0+阅读 · 2014年12月31日

A Semantic Quantum Circuit Cache for Scalable and Distributed Quantum-Classical Workflows

Arxiv

0+阅读 · 4月29日

Stannic: Systolic STochAstic ONliNe SchedulIng AcCelerator

Arxiv

0+阅读 · 4月17日

Scheduling Coflows in Multi-Core OCS Networks with Performance Guarantee

Arxiv

0+阅读 · 4月9日

LegoDiffusion: Micro-Serving Text-to-Image Diffusion Workflows

Arxiv

0+阅读 · 4月9日

Fast Cross-Operator Optimization of Attention Dataflow

Arxiv

0+阅读 · 4月3日

SwiftQueue: Optimizing Low-Latency Applications with Swift Packet Queuing

Arxiv

0+阅读 · 3月24日

Stannic: Systolic STochAstic ONliNe SchedulIng AcCelerator

Arxiv

0+阅读 · 3月21日

Low-Latency Stateful Stream Processing through Timely and Accurate Prefetching

Arxiv

0+阅读 · 3月20日

PAT: Accelerating LLM Decoding via Prefix-Aware Attention with Resource Efficient Multi-Tile Kernel

Arxiv

0+阅读 · 3月16日

Optimal Short Video Ordering and Transmission Scheduling for Reducing Video Delivery Cost in Peer-to-Peer CDNs

Arxiv

0+阅读 · 3月4日

VIP会员

文章信息

相关主题

最新内容

综述 | 从问答到任务完成：Agent系统与Harness设计

综述 | 从问答到任务完成：Agent系统与Harness设计

专知会员服务

0+阅读 · 今天16:54

Agentic RL：框架、实践与长程智能体训练

Agentic RL：框架、实践与长程智能体训练

专知会员服务

0+阅读 · 今天16:52

反无人机拦截器训练与运用课程：对美国陆军部队发展的启示

反无人机拦截器训练与运用课程：对美国陆军部队发展的启示

专知会员服务

6+阅读 · 今天8:00

重新思考无人机时代的生存能力

重新思考无人机时代的生存能力

专知会员服务

5+阅读 · 今天7:44

装甲突击旅：现代战争思考、战斗与组织

装甲突击旅：现代战争思考、战斗与组织

专知会员服务

4+阅读 · 今天7:28

在人工智能加速决策环境中拓展OODA循环

在人工智能加速决策环境中拓展OODA循环

专知会员服务

4+阅读 · 今天7:18

《廉价自杀式无人机战争的军事战略影响：乌克兰与伊朗案例研究》

《廉价自杀式无人机战争的军事战略影响：乌克兰与伊朗案例研究》

专知会员服务

5+阅读 · 今天7:07

军事欺骗：供作战战术指挥官使用的工具

军事欺骗：供作战战术指挥官使用的工具

专知会员服务

4+阅读 · 今天7:03

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

专知会员服务

4+阅读 · 6月23日

综述 | 世界动作模型：少做梦，多行动

综述 | 世界动作模型：少做梦，多行动

专知会员服务

6+阅读 · 6月23日

美以伊冲突：无人机与人工智能的运用

美以伊冲突：无人机与人工智能的运用

专知会员服务

10+阅读 · 6月23日

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

专知会员服务

4+阅读 · 6月23日

《特种部队在透明战场中的生存力》最新报告

《特种部队在透明战场中的生存力》最新报告

专知会员服务

5+阅读 · 6月23日

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

专知会员服务

8+阅读 · 6月23日

《人工智能生成的零日漏洞：对未来作战的影响》

《人工智能生成的零日漏洞：对未来作战的影响》

专知会员服务

7+阅读 · 6月23日

相关VIP内容

【博士论文】优化智能体工作流以提升信息获取效率

【博士论文】优化智能体工作流以提升信息获取效率

专知会员服务

19+阅读 · 2025年7月7日

【ICML2022】DepthShrinker:一种新的压缩范式，用于提高紧凑神经网络的实际硬件效率

【ICML2022】DepthShrinker:一种新的压缩范式，用于提高紧凑神经网络的实际硬件效率

专知会员服务

11+阅读 · 2022年6月5日

【Manning新书】掌握流式处理系统-实时事件处理，Grokking Streaming Systems Real-time event processing

【Manning新书】掌握流式处理系统-实时事件处理，Grokking Streaming Systems Real-time event processing

专知会员服务

47+阅读 · 2022年3月22日

【Yoshua Bengio】生成式流网络，Generative Flow Networks

【Yoshua Bengio】生成式流网络，Generative Flow Networks

专知会员服务

32+阅读 · 2022年3月19日

【博士论文】集群系统中的网络流调度

【博士论文】集群系统中的网络流调度

专知会员服务

47+阅读 · 2021年12月7日

最新《流处理系统演化》综述论文，34页pdf

最新《流处理系统演化》综述论文，34页pdf

专知会员服务

21+阅读 · 2020年8月4日

【实用书】流数据处理，Streaming Data，219页pdf

【实用书】流数据处理，Streaming Data，219页pdf

专知会员服务

78+阅读 · 2020年4月24日

腾讯信息流内容理解技术实践，A User-Centered Concept Mining System for Query and Document Understanding at Tencent

腾讯信息流内容理解技术实践，A User-Centered Concept Mining System for Query and Document Understanding at Tencent

专知会员服务

41+阅读 · 2019年12月15日

【O'Reilly TensorFlow Conference 2019】HARP：高效的GPU共享系统（HARP: An efficient and elastic GPU-sharing system），Alibaba | Pengfei Fan，Lingling Jin

【O'Reilly TensorFlow Conference 2019】HARP：高效的GPU共享系统（HARP: An efficient and elastic GPU-sharing system），Alibaba | Pengfei Fan，Lingling Jin

专知会员服务

10+阅读 · 2019年11月13日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

Agentic RL：框架、实践与长程智能体训练

重新思考无人机时代的生存能力

综述 | 从问答到任务完成：Agent系统与Harness设计

反无人机拦截器训练与运用课程：对美国陆军部队发展的启示

相关资讯

流程/过程挖掘（Process Mining）最新综述

流程/过程挖掘（Process Mining）最新综述

PaperWeekly

23+阅读 · 2022年9月19日

【Flink】基于 Flink 的流式数据实时去重

【Flink】基于 Flink 的流式数据实时去重

AINLP

14+阅读 · 2020年9月29日

Python图像处理，366页pdf，Image Operators Image Processing in Python

Python图像处理，366页pdf，Image Operators Image Processing in Python

专知

15+阅读 · 2020年7月23日

Python自然语言处理: 使用SpaCycle库进行标记化、词干提取和词形还原

Python自然语言处理: 使用SpaCycle库进行标记化、词干提取和词形还原

Python程序员

18+阅读 · 2019年3月28日

使用 Canal 实现数据异构

使用 Canal 实现数据异构

性能与架构

20+阅读 · 2019年3月4日

Fast-OCNet: 更快更好的OCNet.

Fast-OCNet: 更快更好的OCNet.

极市平台

21+阅读 · 2019年2月10日

【大数据】StreamSets：一个大数据采集工具

【大数据】StreamSets：一个大数据采集工具

产业智能官

40+阅读 · 2018年12月5日

最新｜深度离散哈希算法，可用于图像检索！

最新｜深度离散哈希算法，可用于图像检索！

全球人工智能

14+阅读 · 2017年12月15日

tensorflow系列笔记：流程，概念和代码解析

tensorflow系列笔记：流程，概念和代码解析

北京思腾合力科技有限公司

30+阅读 · 2017年11月11日

自然语言处理工具包spaCy介绍

自然语言处理工具包spaCy介绍

AINLP

18+阅读 · 2016年11月14日

相关论文

A Semantic Quantum Circuit Cache for Scalable and Distributed Quantum-Classical Workflows

Arxiv

0+阅读 · 4月29日

Stannic: Systolic STochAstic ONliNe SchedulIng AcCelerator

Arxiv

0+阅读 · 4月17日

Scheduling Coflows in Multi-Core OCS Networks with Performance Guarantee

Arxiv

0+阅读 · 4月9日

LegoDiffusion: Micro-Serving Text-to-Image Diffusion Workflows

Arxiv

0+阅读 · 4月9日

Fast Cross-Operator Optimization of Attention Dataflow

Arxiv

0+阅读 · 4月3日

SwiftQueue: Optimizing Low-Latency Applications with Swift Packet Queuing

Arxiv

0+阅读 · 3月24日

Stannic: Systolic STochAstic ONliNe SchedulIng AcCelerator

Arxiv

0+阅读 · 3月21日

Low-Latency Stateful Stream Processing through Timely and Accurate Prefetching

Arxiv

0+阅读 · 3月20日

PAT: Accelerating LLM Decoding via Prefix-Aware Attention with Resource Efficient Multi-Tile Kernel

Arxiv

0+阅读 · 3月16日

Optimal Short Video Ordering and Transmission Scheduling for Reducing Video Delivery Cost in Peer-to-Peer CDNs

Arxiv

0+阅读 · 3月4日

相关基金

基于略图挖掘的在不同时空域的网络流式数据实时处理

国家自然科学基金

1+阅读 · 2015年12月31日

分层异构网络面向视频流的绿色节能通信研究

国家自然科学基金

6+阅读 · 2015年12月31日

智能电网环境下地理分布式互联网数据中心的能量成本降低方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

多标记文本数据流分类方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

面向网络社会的工作流关键技术研究

国家自然科学基金

3+阅读 · 2015年12月31日

数据流发布中的隐私保护理论和方法研究

国家自然科学基金

7+阅读 · 2015年12月31日

海量数据流实时分发技术研究

国家自然科学基金

3+阅读 · 2015年12月31日

移动云计算中数据流应用的动态计算切分技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

云环境中支持混合并行模式的科学工作流的执行优化

国家自然科学基金

0+阅读 · 2014年12月31日

面向大数据计算的高吞吐量众核处理器关键技术研究

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员