Quantized Fourier and Polynomial Features for more Expressive Tensor Network Models

from arxiv, 9 pages, 4 figures. Reviewed version after peer-review. To be published in the proceedings of the 27th International Conference on Artificial Intelligence and Statistics (AISTATS)

In the context of kernel machines, polynomial and Fourier features are commonly used to provide a nonlinear extension to linear models by mapping the data to a higher-dimensional space. Unless one considers the dual formulation of the learning problem, which renders exact large-scale learning unfeasible, the exponential increase of model parameters in the dimensionality of the data caused by their tensor-product structure prohibits to tackle high-dimensional problems. One of the possible approaches to circumvent this exponential scaling is to exploit the tensor structure present in the features by constraining the model weights to be an underparametrized tensor network. In this paper we quantize, i.e. further tensorize, polynomial and Fourier features. Based on this feature quantization we propose to quantize the associated model weights, yielding quantized models. We show that, for the same number of model parameters, the resulting quantized models have a higher bound on the VC-dimension as opposed to their non-quantized counterparts, at no additional computational cost while learning from identical features. We verify experimentally how this additional tensorization regularizes the learning problem by prioritizing the most salient features in the data and how it provides models with increased generalization capabilities. We finally benchmark our approach on large regression task, achieving state-of-the-art results on a laptop computer.

翻译：在核方法的背景下，多项式特征与傅里叶特征常被用于通过将数据映射到高维空间，为线性模型提供非线性扩展。然而，若考虑学习问题的对偶形式（这将导致大规模精确学习不可行），则由于张量乘积结构导致的模型参数随数据维度呈指数增长，阻碍了高维问题的处理。规避这种指数扩展的可行方法之一，是利用特征中存在的张量结构，将模型权重约束为欠参数化的张量网络。本文对多项式特征与傅里叶特征进行量化，即进一步实现张量化。基于这种特征量化，我们提出对相应模型权重进行量化，从而得到量化模型。研究表明：在相同模型参数数量下，相较于非量化对应模型，所得量化模型的VC维上界更高，且在学习相同特征时无需额外计算成本。我们通过实验验证了这种额外张量化如何通过优先保留数据中最显著的特征来规整化学习问题，以及如何提升模型的泛化能力。最终，我们在大规模回归任务上对方法进行基准测试，在笔记本电脑上达到了当前最优结果。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日