ML-Powered Index Tuning: An Overview of Recent Progress and Open Challenges

The scale and complexity of workloads in modern cloud services have brought into sharper focus a critical challenge in automated index tuning -- the need to recommend high-quality indexes while maintaining index tuning scalability. This challenge is further compounded by the requirement for automated index implementations to introduce minimal query performance regressions in production deployments, representing a significant barrier to achieving scalability and full automation. This paper directs attention to these challenges within automated index tuning and explores ways in which machine learning (ML) techniques provide new opportunities in their mitigation. In particular, we reflect on recent efforts in developing ML techniques for workload selection, candidate index filtering, speeding up index configuration search, reducing the amount of query optimizer calls, and lowering the chances of performance regressions. We highlight the key takeaways from these efforts and underline the gaps that need to be closed for their effective functioning within the traditional index tuning framework. Additionally, we present a preliminary cross-platform design aimed at democratizing index tuning across multiple SQL-like systems -- an imperative in today's continuously expanding data system landscape. We believe our findings will help provide context and impetus to the research and development efforts in automated index tuning.

翻译：现代云服务工作负载的规模与复杂性，使得自动化索引调优中的一个关键挑战愈发凸显——即在保持索引调优可扩展性的同时，推荐高质量索引的需求。自动化索引实现需在生产部署中引入最小的查询性能回退（query performance regression），这一要求进一步加剧了上述挑战，成为实现可扩展性与完全自动化的重大障碍。本文聚焦于自动化索引调优中的这些挑战，并探讨机器学习（ML）技术为其缓解提供的新机遇。具体而言，我们回顾了近年来在以下方面的研究努力：利用ML技术进行工作负载选择、候选索引过滤、加速索引配置搜索、减少查询优化器调用次数、以及降低性能回退风险。我们总结了这些研究的关键启示，并指出其在传统索引调优框架内有效运行时仍需弥合的差距。此外，我们提出了一项跨平台初步设计方案，旨在推动索引调优在多种类SQL系统中的普及——这在当今不断扩展的数据系统格局中至关重要。我们相信，本文的发现将为自动化索引调优的研究与开发工作提供背景与动力。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日