On the connection between least squares, regularization, and classical shadows

Classical shadows (CS) offer a resource-efficient means to estimate quantum observables, circumventing the need for exhaustive state tomography. Here, we clarify and explore the connection between CS techniques and least squares (LS) and regularized least squares (RLS) methods commonly used in machine learning and data analysis. By formal identification of LS and RLS ``shadows'' completely analogous to those in CS -- namely, point estimators calculated from the empirical frequencies of single measurements -- we show that both RLS and CS can be viewed as regularizers for the underdetermined regime, replacing the pseudoinverse with invertible alternatives. Through numerical simulations, we evaluate RLS and CS from three distinct angles: the tradeoff in bias and variance, mismatch between the expected and actual measurement distributions, and the interplay between the number of measurements and number of shots per measurement. Compared to CS, RLS attains lower variance at the expense of bias, is robust to distribution mismatch, and is more sensitive to the number of shots for a fixed number of state copies -- differences that can be understood from the distinct approaches taken to regularization. Conceptually, our integration of LS, RLS, and CS under a unifying ``shadow'' umbrella aids in advancing the overall picture of CS techniques, while practically our results highlight the tradeoffs intrinsic to these measurement approaches, illuminating the circumstances under which either RLS or CS would be preferred, such as unverified randomness for the former or unbiased estimation for the latter.

翻译：经典阴影（CS）提供了一种资源高效的方式来估计量子观测量，免除了穷尽态层析的需求。本文阐明并探讨了CS技术与机器学习及数据分析中常用的最小二乘法（LS）和正则化最小二乘法（RLS）之间的联系。通过形式化识别LS和RLS的“阴影”——即完全类似于CS中根据单次测量经验频率计算得到的点估计量——我们证明RLS和CS均可视为欠定情形下的正则化器，将伪逆替换为可逆替代方案。通过数值模拟，我们从三个不同角度评估RLS和CS：偏差与方差的权衡、期望测量分布与实际测量分布的失配、以及测量次数与每次测量采样次数之间的相互作用。与CS相比，RLS以偏差为代价获得更低的方差，对分布失配具有鲁棒性，并且在固定状态副本数下对单次测量采样次数更为敏感——这些差异可通过正则化方法的不同路径来理解。在概念上，我们将LS、RLS和CS统一到“阴影”框架下，有助于推进对CS技术整体图景的认识；在实践中，我们的结果凸显了这些测量方法固有的权衡关系，阐明RLS或CS各自适用的情况，例如前者适用于未经验证的随机性，后者则适用于无偏估计。

相关内容

计算机科学

关注 56

计算机科学（Computer Science, CS）是系统性研究信息与计算的理论基础以及它们在计算机系统中如何实现与应用的实用技术的学科。它通常被形容为对那些创造、描述以及转换信息的算法处理的系统研究。计算机科学包含很多分支领域；其中一些，比如计算机图形学强调特定结果的计算，而另外一些，比如计算复杂性理论是学习计算问题的性质。还有一些领域专注于挑战怎样实现计算。比如程序设计语言理论学习描述计算的方法，而程序设计是应用特定的程序设计语言解决特定的计算问题，人机交互则是专注于挑战怎样使计算机和计算变得有用、可用，以及随时随地为人所用。 现代计算机科学( Computer Science)包含理论计算机科学和应用计算机科学两大分支。

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

专知会员服务

42+阅读 · 2020年5月30日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【WSDM2020】超越统计关系：将知识关系整合到多标签音乐风格分类的风格关联中（附pdf）

专知会员服务

18+阅读 · 2019年11月23日