Towards Persistent Memory based Stateful Serverless Computing for Big Data Applications

The Function-as-a-service (FaaS) computing model has recently seen significant growth especially for highly scalable, event-driven applications. The easy-to-deploy and cost-efficient fine-grained billing of FaaS is highly attractive to big data applications. However, the stateless nature of serverless platforms poses major challenges when supporting stateful I/O intensive workloads such as a lack of native support for stateful execution, state sharing, and inter-function communication. In this paper, we explore the feasibility of performing stateful big data analytics on serverless platforms and improving I/O throughput of functions by using modern storage technologies such as Intel Optane DC Persistent Memory (PMEM). To this end, we propose Marvel, an end-to-end architecture built on top of the popular serverless platform, Apache OpenWhisk and Apache Hadoop. Marvel makes two main contributions: (1) enable stateful function execution on OpenWhisk by maintaining state information in an in-memory caching layer; and (2) provide access to PMEM backed HDFS storage for faster I/O performance. Our evaluation shows that Marvel reduces the overall execution time of big data applications by up to 86.6% compared to current MapReduce implementations on AWS Lambda.

翻译：函数即服务（FaaS）计算模型近期在高度可扩展的事件驱动型应用领域呈现出显著增长态势。其易于部署、成本低廉的细粒度计费模式对大数据应用极具吸引力。然而，无服务器平台的"无状态"特性在支持有状态的I/O密集型工作负载时面临重大挑战，主要表现为缺乏对有状态执行、状态共享以及函数间通信的原生支持。本文探究了在无服务器平台上执行有状态大数据分析、并通过现代存储技术（如英特尔傲腾数据中心持久化内存PMEM）提升函数I/O吞吐量的可行性。为此，我们提出了Marvel架构——一种基于主流无服务器平台Apache OpenWhisk与Apache Hadoop构建的端到端解决方案。Marvel的核心贡献包括：（1）通过将状态信息维持在内存缓存层，在OpenWhisk上实现有状态函数执行；（2）提供基于PMEM的HDFS存储访问，实现更快的I/O性能。实验评估表明，与当前AWS Lambda上的MapReduce实现相比，Marvel将大数据应用的总体执行时间最高降低86.6%。

相关内容

大数据

关注 270

从各种各样类型的数据中，快速获得有价值信息的能力，就是大数据技术。明白这一点至关重要，也正是这一点促使该技术具备走向众多企业的潜力。大数据的4个“V”，或者说特点有四个层面：第一，数据体量巨大。从TB级别，跃升到PB级别；第二，数据类型繁多。前文提到的网络日志、视频、图片、地理位置信息等等。第三，价值密度低。以视频为例，连续不间断监控过程中，可能有用的数据仅仅有一两秒。第四，处理速度快。

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日