TimeTuner: Diagnosing Time Representations for Time-Series Forecasting with Counterfactual Explanations

Deep learning (DL) approaches are being increasingly used for time-series forecasting, with many efforts devoted to designing complex DL models. Recent studies have shown that the DL success is often attributed to effective data representations, fostering the fields of feature engineering and representation learning. However, automated approaches for feature learning are typically limited with respect to incorporating prior knowledge, identifying interactions among variables, and choosing evaluation metrics to ensure that the models are reliable. To improve on these limitations, this paper contributes a novel visual analytics framework, namely TimeTuner, designed to help analysts understand how model behaviors are associated with localized correlations, stationarity, and granularity of time-series representations. The system mainly consists of the following two-stage technique: We first leverage counterfactual explanations to connect the relationships among time-series representations, multivariate features and model predictions. Next, we design multiple coordinated views including a partition-based correlation matrix and juxtaposed bivariate stripes, and provide a set of interactions that allow users to step into the transformation selection process, navigate through the feature space, and reason the model performance. We instantiate TimeTuner with two transformation methods of smoothing and sampling, and demonstrate its applicability on real-world time-series forecasting of univariate sunspots and multivariate air pollutants. Feedback from domain experts indicates that our system can help characterize time-series representations and guide the feature engineering processes.

翻译：深度学习（DL）方法正越来越多地应用于时间序列预测，大量研究致力于设计复杂的深度学习模型。近期研究表明，深度学习的成功往往归因于有效的数据表示，这推动了特征工程和表示学习领域的发展。然而，自动化的特征学习方法在融入先验知识、识别变量间交互以及选择评估指标以确保模型可靠性方面通常存在局限性。为改进这些不足，本文提出一种名为TimeTuner的新型可视化分析框架，旨在帮助分析师理解模型行为如何与时间序列表示的局部相关性、平稳性和粒度相关联。该系统主要包含以下两阶段技术：首先，我们利用反事实解释来连接时间序列表示、多变量特征与模型预测之间的关系。其次，我们设计了多种协同视图（包括基于分区的相关性矩阵和并置双变量条纹图），并提供一系列交互操作，使用户能够介入变换选择过程、探索特征空间并推理模型性能。我们通过平滑和采样两种变换方法实例化TimeTuner，并在单变量太阳黑子及多变量空气污染物等真实世界时间序列预测中验证其适用性。领域专家反馈表明，本系统有助于刻画时间序列表示特征并指导特征工程流程。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日