PopPy: Opportunistically Exploiting Parallelism in Python Compound AI Applications

Compound AI applications, which compose calls to ML models using a general-purpose programming language like Python, are widely used for a variety of user-facing tasks, from software engineering to enterprise automation, making their end-to-end latency a critical bottleneck. In contrast to traditional applications, execution time is dominated by the external components, which cannot be handled by traditional language optimization systems, like optimizing compilers. To address this problem, we develop PopPy, a system that can uncover parallelization opportunities in Python applications that invoke these heavy external components, including those used in compound AI applications. PopPy supports a very expressive fragment of Python and requires minimal developer input to uncover parallelism. It combines an ahead-of-time compiler with a runtime, addressing three key challenges in extracting parallelism from Python applications: language complexity, dynamic dispatch, and variable mutation. On a set of real-world compound AI applications, PopPy achieves up to $6.4\times$ speedups in end-to-end execution time compared to standard Python execution while preserving the sequential program semantics.

翻译：摘要：复合AI应用通过通用编程语言（如Python）编排对机器学习模型的调用，广泛用于从软件工程到企业自动化的各类面向用户任务，这使得其端到端延迟成为关键瓶颈。与传统应用不同，其执行时间主要由外部组件决定，而传统语言优化系统（如优化编译器）无法处理这些外部组件。为解决此问题，我们开发了PopPy系统，该系统能够发现调用重外部组件的Python应用（包括复合AI应用）中的并行化机会。PopPy支持Python中极具表现力的子集，仅需最少开发人员输入即可发现并行性。它结合了提前编译器和运行时，解决了从Python应用中提取并行性的三个关键挑战：语言复杂性、动态调度与变量修改。在一组真实世界复合AI应用上，与标准Python执行相比，PopPy在保持顺序程序语义的同时，实现了高达$6.4\times$的端到端执行时间加速。

相关内容

关注 7110

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

【新书】《学习AI辅助的Python编程（第2版）》

专知会员服务

69+阅读 · 2024年10月22日

【新书】机器学习在金融领域：掌握使用Python驱动的机器学习的金融策略

专知会员服务

38+阅读 · 2024年6月11日

【MIT博士论文】人工智能系统的组合泛化，194页pdf

专知会员服务

61+阅读 · 2023年11月15日

【干货书】深度强化学习Python实战:算法的简洁实现，简化数学，以及TensorFlow和PyTorch的使用，447页pdf

专知会员服务

85+阅读 · 2022年8月2日