Time Series Predictions in Unmonitored Sites: A Survey of Machine Learning Techniques in Water Resources

Prediction of dynamic environmental variables in unmonitored sites remains a long-standing challenge for water resources science. The majority of the world's freshwater resources have inadequate monitoring of critical environmental variables needed for management. Yet, the need to have widespread predictions of hydrological variables such as river flow and water quality has become increasingly urgent due to climate and land use change over the past decades, and their associated impacts on water resources. Modern machine learning methods increasingly outperform their process-based and empirical model counterparts for hydrologic time series prediction with their ability to extract information from large, diverse data sets. We review relevant state-of-the art applications of machine learning for streamflow, water quality, and other water resources prediction and discuss opportunities to improve the use of machine learning with emerging methods for incorporating watershed characteristics into deep learning models, transfer learning, and incorporating process knowledge into machine learning models. The analysis here suggests most prior efforts have been focused on deep learning learning frameworks built on many sites for predictions at daily time scales in the United States, but that comparisons between different classes of machine learning methods are few and inadequate. We identify several open questions for time series predictions in unmonitored sites that include incorporating dynamic inputs and site characteristics, mechanistic understanding and spatial context, and explainable AI techniques in modern machine learning frameworks.

翻译：无监测站点动态环境变量的预测仍是水资源科学中长期存在的挑战。全球大部分淡水资源对管理所需的关键环境变量缺乏充分监测。然而，由于过去数十年气候与土地利用变化及其对水资源的相关影响，对河流流量、水质等水文变量进行广泛预测的需求日益迫切。现代机器学习方法凭借其从大规模多样化数据集中提取信息的能力，在水文时间序列预测中日益超越基于过程的模型与经验模型。本文综述了机器学习在径流、水质及其他水资源预测中的前沿应用，探讨了通过新兴技术提升机器学习效能的机遇，包括将流域特征融入深度学习模型、迁移学习以及将过程知识整合至机器学习模型。分析表明，现有研究主要集中于基于多站点日尺度预测的深度学习框架（以美国为主），但不同类别机器学习方法间的比较研究仍显不足且不够充分。我们提出了无监测站点时间序列预测中若干待解问题，包括动态输入与站点特征的融合、机理理解与空间背景的结合，以及现代机器学习框架中可解释人工智能技术的应用。

相关内容

Machine Learning

关注 2249

机器学习（Machine Learning）是一个研究计算学习方法的国际论坛。该杂志发表文章，报告广泛的学习方法应用于各种学习问题的实质性结果。该杂志的特色论文描述研究的问题和方法，应用研究和研究方法的问题。有关学习问题或方法的论文通过实证研究、理论分析或与心理现象的比较提供了坚实的支持。应用论文展示了如何应用学习方法来解决重要的应用问题。研究方法论文改进了机器学习的研究方法。所有的论文都以其他研究人员可以验证或复制的方式描述了支持证据。论文还详细说明了学习的组成部分，并讨论了关于知识表示和性能任务的假设。官网地址：http://dblp.uni-trier.de/db/journals/ml/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日