Covering Number of Real Algebraic Varieties and Beyond: Improved Bounds and Applications

Covering numbers are a powerful tool used in the development of approximation algorithms, randomized dimension reduction methods, smoothed complexity analysis, and others. In this paper we prove upper bounds on the covering number of numerous sets in Euclidean space, namely real algebraic varieties, images of polynomial maps and semialgebraic sets in terms of the number of variables and degrees of the polynomials involved. The bounds remarkably improve the best known general bound by Yomdin-Comte, and our proof is much more straightforward. In particular, our result gives new bounds on the volume of the tubular neighborhood of the image of a polynomial map and a semialgebraic set, where results for varieties by Lotz and Basu-Lerario are not directly applicable. We illustrate the power of the result on three computational applications. Firstly, we derive a near-optimal bound on the covering number of low rank CP tensors, quantifying their approximation properties and filling in an important missing piece of theory for tensor dimension reduction and reconstruction. Secondly, we prove a bound on the required dimension for the randomized sketching of polynomial optimization problems, which controls how much computation can be saved through randomization without sacrificing solution quality. Finally, we deduce generalization error bounds for deep neural networks with rational or ReLU activation functions, improving or matching the best known results in the machine learning literature while helping to quantify the impact of architecture choice on generalization error.

翻译：覆盖数是近似算法、随机降维方法、平滑复杂度分析等领域发展中的有力工具。本文证明了欧氏空间中若干集合（如实代数簇、多项式映射像及半代数集）覆盖数的上界，这些界由所涉变量个数及多项式次数决定。我们的界显著改进了Yomdin-Comte提出的最佳通用界，且证明过程更为直接。特别地，我们的结果为多项式映射像和半代数集的管状邻域体积给出了新界，而Lotz与Basu-Lerario关于代数簇的结果在此类情形中并不直接适用。我们通过三个计算应用展示了该结果的威力：首先，导出了低秩CP张量覆盖数的近最优界，量化了其近似性质，填补了张量降维与重建理论中一个重要的缺失环节；其次，证明了多项式优化问题随机草图技术所需维度的界，揭示了在保证解质量的前提下随机化可节省的计算量；最后，推导了具有有理或ReLU激活函数的深度神经网络的泛化误差界，改进或匹配了机器学习文献中的最佳已知结果，同时有助于量化网络架构选择对泛化误差的影响。

相关内容

泛化误差

关注 107

学习方法的泛化能力（Generalization Error）是由该方法学习到的模型对未知数据的预测能力，是学习方法本质上重要的性质。现实中采用最多的办法是通过测试泛化误差来评价学习方法的泛化能力。泛化误差界刻画了学习算法的经验风险与期望风险之间偏差和收敛速度。一个机器学习的泛化误差（Generalization Error），是一个描述学生机器在从样品数据中学习之后，离教师机器之间的差距的函数。

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日