Over the last several years, the computation landscape for conducting data analytics has completely changed. While in the past, a lot of the activities have been undertaken in isolation by companies, and research institutions, today's infrastructure constitutes a wealth of services offered by a variety of providers that offer opportunities for reuse, and interactions while leveraging service collaboration, and service cooperation. This document focuses on expanding analytics services to develop a framework for reusable hybrid multi-service data analytics. It includes (a) a short technology review that explicitly targets the intersection of hybrid multi-provider analytics services, (b) a small motivation based on use cases we looked at, (c) enhancing the concepts of services to showcase how hybrid, as well as multi-provider services can be integrated and reused via the proposed framework, (d) address analytics service composition, and (e) integrate container technologies to achieve state-of-the-art analytics service deployment
翻译:过去几年间,开展数据分析的计算格局已彻底改变。过去,大部分活动由企业和研究机构独立开展,而如今的基础设施构成了由各类服务提供商提供的丰富服务,这些服务在利用服务协作与合作的同时,为服务复用与交互提供了机遇。本文档聚焦于扩展分析服务,以构建一套可复用的混合多云服务数据分析框架。具体包括:(a) 针对混合多提供商分析服务交叉领域的技术综述;(b) 基于我们所研究用例的简要动机阐述;(c) 提升服务概念以展示如何通过所提框架集成与复用混合及多提供商服务;(d) 探讨分析服务组合问题;(e) 融入容器技术以实现前沿的分析服务部署。