TIC-TAC: A Framework To Learn And Evaluate Your Covariance

We study the problem of unsupervised heteroscedastic covariance estimation, where the goal is to learn the multivariate target distribution $\mathcal{N}(y, \Sigma_y | x )$ given an observation $x$. This problem is particularly challenging as $\Sigma_{y}$ varies for different samples (heteroscedastic) and no annotation for the covariance is available (unsupervised). Typically, state-of-the-art methods predict the mean $f_{\theta}(x)$ and covariance $\textrm{Cov}(f_{\theta}(x))$ of the target distribution through two neural networks trained using the negative log-likelihood. This raises two questions: (1) Does the predicted covariance truly capture the randomness of the predicted mean? (2) In the absence of ground-truth annotation, how can we quantify the performance of covariance estimation? We address (1) by deriving TIC: Taylor Induced Covariance, which captures the randomness of the multivariate $f_{\theta}(x)$ by incorporating its gradient and curvature around $x$ through the second order Taylor polynomial. Furthermore, we tackle (2) by introducing TAC: Task Agnostic Correlations, a metric which leverages conditioning of the normal distribution to evaluate the covariance. We verify the effectiveness of TIC through multiple experiments spanning synthetic (univariate, multivariate) and real-world datasets (UCI Regression, LSP, and MPII Human Pose Estimation). Our experiments show that TIC outperforms state-of-the-art in accurately learning the covariance, as quantified through TAC.

翻译：我们研究无监督异方差协方差估计问题，其目标是在给定观测$x$的条件下学习多元目标分布$\mathcal{N}(y, \Sigma_y | x )$。该问题尤为具有挑战性，因为$\Sigma_{y}$随不同样本变化（异方差），且协方差无标注可用（无监督）。典型情况下，现有最优方法通过两个神经网络预测目标分布的均值$f_{\theta}(x)$与协方差$\textrm{Cov}(f_{\theta}(x))$，并采用负对数似然进行训练。这引发两个问题：(1) 预测的协方差是否真正捕获了预测均值的随机性？(2) 在缺少真实标注的情况下，如何量化协方差估计的性能？针对问题(1)，我们推导出泰勒诱导协方差（TIC），该方法通过二阶泰勒多项式融合多元$f_{\theta}(x)$在$x$附近的梯度与曲率，从而捕获其随机性。针对问题(2)，我们提出任务无关相关性（TAC）度量指标，利用正态分布的条件化特性评估协方差。通过涵盖合成数据（单变量、多变量）与真实数据集（UCI回归、LSP、MPII人体姿态估计）的多项实验，我们验证了TIC的有效性。实验表明，经TAC量化评估，TIC在准确学习协方差方面优于现有最优方法。

相关内容

TAC

关注 784

IEEE情感计算TAC(IEEE Transactions on Affective Computing)是一份跨学科的国际档案期刊，旨在传播能够识别、解释和模拟人类情感和相关情感现象的系统设计研究成果。官网地址：http://dblp.uni-trier.de/db/journals/taffco/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日