Towards Persistent Memory based Stateful Serverless Computing for Big Data Applications

The Function-as-a-service (FaaS) computing model has recently seen significant growth especially for highly scalable, event-driven applications. The easy-to-deploy and cost-efficient fine-grained billing of FaaS is highly attractive to big data applications. However, the stateless nature of serverless platforms poses major challenges when supporting stateful I/O intensive workloads such as a lack of native support for stateful execution, state sharing, and inter-function communication. In this paper, we explore the feasibility of performing stateful big data analytics on serverless platforms and improving I/O throughput of functions by using modern storage technologies such as Intel Optane DC Persistent Memory (PMEM). To this end, we propose Marvel, an end-to-end architecture built on top of the popular serverless platform, Apache OpenWhisk and Apache Hadoop. Marvel makes two main contributions: (1) enable stateful function execution on OpenWhisk by maintaining state information in an in-memory caching layer; and (2) provide access to PMEM backed HDFS storage for faster I/O performance. Our evaluation shows that Marvel reduces the overall execution time of big data applications by up to 86.6% compared to current MapReduce implementations on AWS Lambda.

翻译：函数即服务（FaaS）计算模式近年来取得了显著增长，尤其适用于高度可扩展的事件驱动型应用。FaaS易于部署且具有成本效益的细粒度计费方式对大数据应用极具吸引力。然而，无服务器平台的固有"无状态"特性在支持有状态I/O密集型工作负载时面临重大挑战，例如缺乏对有状态执行、状态共享及函数间通信的原生支持。本文探究了在无服务器平台上执行有状态大数据分析的可行性，并通过采用英特尔傲腾DC持久内存（PMEM）等现代存储技术来提升函数的I/O吞吐量。为此，我们提出了Marvel——一种基于流行无服务器平台Apache OpenWhisk与Apache Hadoop构建的端到端架构。Marvel包含两项主要贡献：（1）通过将状态信息维护在内存缓存层中，实现在OpenWhisk上的有状态函数执行；（2）提供对PMEM支持的HDFS存储的访问以实现更快的I/O性能。评估表明，与AWS Lambda上现有的MapReduce实现相比，Marvel可将大数据应用的整体执行时间降低多达86.6%。

相关内容

大数据

关注 270

从各种各样类型的数据中，快速获得有价值信息的能力，就是大数据技术。明白这一点至关重要，也正是这一点促使该技术具备走向众多企业的潜力。大数据的4个“V”，或者说特点有四个层面：第一，数据体量巨大。从TB级别，跃升到PB级别；第二，数据类型繁多。前文提到的网络日志、视频、图片、地理位置信息等等。第三，价值密度低。以视频为例，连续不间断监控过程中，可能有用的数据仅仅有一两秒。第四，处理速度快。

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日