TimeTuner: Diagnosing Time Representations for Time-Series Forecasting with Counterfactual Explanations

Deep learning (DL) approaches are being increasingly used for time-series forecasting, with many efforts devoted to designing complex DL models. Recent studies have shown that the DL success is often attributed to effective data representations, fostering the fields of feature engineering and representation learning. However, automated approaches for feature learning are typically limited with respect to incorporating prior knowledge, identifying interactions among variables, and choosing evaluation metrics to ensure that the models are reliable. To improve on these limitations, this paper contributes a novel visual analytics framework, namely TimeTuner, designed to help analysts understand how model behaviors are associated with localized correlations, stationarity, and granularity of time-series representations. The system mainly consists of the following two-stage technique: We first leverage counterfactual explanations to connect the relationships among time-series representations, multivariate features and model predictions. Next, we design multiple coordinated views including a partition-based correlation matrix and juxtaposed bivariate stripes, and provide a set of interactions that allow users to step into the transformation selection process, navigate through the feature space, and reason the model performance. We instantiate TimeTuner with two transformation methods of smoothing and sampling, and demonstrate its applicability on real-world time-series forecasting of univariate sunspots and multivariate air pollutants. Feedback from domain experts indicates that our system can help characterize time-series representations and guide the feature engineering processes.

翻译：深度学习（DL）方法正越来越多地被应用于时间序列预测，大量研究致力于设计复杂的深度学习模型。近期研究表明，深度学习的成功常归因于有效的数据表示，这促进了特征工程和表示学习领域的发展。然而，自动化的特征学习方法在融入先验知识、识别变量间的交互作用以及选择评估指标以确保模型可靠性等方面通常存在局限性。为改进这些不足，本文提出一种新颖的可视分析框架TimeTuner，旨在帮助分析人员理解模型行为如何与时间序列表示的局部相关性、平稳性及粒度相关联。该系统主要由以下两阶段技术构成：首先，利用反事实解释将时间序列表示、多变量特征与模型预测之间的关联进行联结；其次，设计了包括基于分区的相关矩阵和并置双变量条纹在内的多协调视图，并提供一组交互方式，使用户能够参与变换选择过程、在特征空间中导航并推理模型性能。我们通过平滑和采样两种变换方法实例化TimeTuner，并在单变量太阳黑子与多变量空气污染物这两种真实世界时间序列预测任务中展示其适用性。领域专家的反馈表明，我们的系统能够帮助表征时间序列表示并指导特征工程流程。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日