Advanced Deep Regression Models for Forecasting Time Series Oil Production

Global oil demand is rapidly increasing and is expected to reach 106.3 million barrels per day by 2040. Thus, it is vital for hydrocarbon extraction industries to forecast their production to optimize their operations and avoid losses. Big companies have realized that exploiting the power of deep learning (DL) and the massive amount of data from various oil wells for this purpose can save a lot of operational costs and reduce unwanted environmental impacts. In this direction, researchers have proposed models using conventional machine learning (ML) techniques for oil production forecasting. However, these techniques are inappropriate for this problem as they can not capture historical patterns found in time series data, resulting in inaccurate predictions. This research aims to overcome these issues by developing advanced data-driven regression models using sequential convolutions and long short-term memory (LSTM) units. Exhaustive analyses are conducted to select the optimal sequence length, model hyperparameters, and cross-well dataset formation to build highly generalized robust models. A comprehensive experimental study on Volve oilfield data validates the proposed models. It reveals that the LSTM-based sequence learning model can predict oil production better than the 1-D convolutional neural network (CNN) with mean absolute error (MAE) and R2 score of 111.16 and 0.98, respectively. It is also found that the LSTM-based model performs better than all the existing state-of-the-art solutions and achieves a 37% improvement compared to a standard linear regression, which is considered the baseline model in this work.

翻译：全球石油需求正在迅速增长，预计到2040年将达到每天1.063亿桶。因此，碳氢化合物开采行业必须对其产量进行预测，以优化运营并避免损失。大型企业已经意识到，利用深度学习（DL）的强大能力以及来自不同油井的大量数据进行预测，可以节省大量运营成本并减少不良环境影响。为此，研究人员提出了使用传统机器学习（ML）技术的油产量预测模型。然而，这些技术不适用于此问题，因为它们无法捕捉时间序列数据中的历史模式，导致预测不准确。本研究旨在通过开发基于序列卷积和长短期记忆（LSTM）单元的高级数据驱动回归模型来克服这些问题。通过详尽的分析来选择最佳序列长度、模型超参数以及跨井数据集构建方法，从而建立高度泛化的稳健模型。在Volve油田数据上进行的全面实验研究验证了所提出的模型。结果表明，基于LSTM的序列学习模型在油产量预测方面优于一维卷积神经网络（CNN），其平均绝对误差（MAE）和R²分数分别为111.16和0.98。研究还发现，基于LSTM的模型性能优于所有现有的最先进解决方案，相较于作为本工作基线模型的标准线性回归，实现了37%的提升。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日