On the Power of Interactive Proofs for Learning

We continue the study of doubly-efficient proof systems for verifying agnostic PAC learning, for which we obtain the following results. - We construct an interactive protocol for learning the $t$ largest Fourier characters of a given function $f \colon \{0,1\}^n \to \{0,1\}$ up to an arbitrarily small error, wherein the verifier uses $\mathsf{poly}(t)$ random examples. This improves upon the Interactive Goldreich-Levin protocol of Goldwasser, Rothblum, Shafer, and Yehudayoff (ITCS 2021) whose sample complexity is $\mathsf{poly}(t,n)$. - For agnostically learning the class $\mathsf{AC}^0[2]$ under the uniform distribution, we build on the work of Carmosino, Impagliazzo, Kabanets, and Kolokolova (APPROX/RANDOM 2017) and design an interactive protocol, where given a function $f \colon \{0,1\}^n \to \{0,1\}$, the verifier learns the closest hypothesis up to $\mathsf{polylog}(n)$ multiplicative factor, using quasi-polynomially many random examples. In contrast, this class has been notoriously resistant even for constructing realisable learners (without a prover) using random examples. - For agnostically learning $k$-juntas under the uniform distribution, we obtain an interactive protocol, where the verifier uses $O(2^k)$ random examples to a given function $f \colon \{0,1\}^n \to \{0,1\}$. Crucially, the sample complexity of the verifier is independent of $n$. We also show that if we do not insist on doubly-efficient proof systems, then the model becomes trivial. Specifically, we show a protocol for an arbitrary class $\mathcal{C}$ of Boolean functions in the distribution-free setting, where the verifier uses $O(1)$ labeled examples to learn $f$.

翻译：我们继续研究用于验证agnostic PAC学习的双高效证明系统，并获得了以下结果： - 我们构建了一个交互式协议，用于学习给定函数$f \colon \{0,1\}^n \to \{0,1\}$的前$t$个最大傅里叶特征，误差可任意小，其中验证者使用$\mathsf{poly}(t)$个随机样本。这改进了Goldwasser、Rothblum、Shafer和Yehudayoff（ITCS 2021）的交互式Goldreich-Levin协议，其样本复杂度为$\mathsf{poly}(t,n)$。 - 对于均匀分布下agnostically学习$\mathsf{AC}^0[2]$类的问题，我们基于Carmosino、Impagliazzo、Kabanets和Kolokolova（APPROX/RANDOM 2017）的工作，设计了一个交互式协议。在该协议中，给定函数$f \colon \{0,1\}^n \to \{0,1\}$，验证者使用拟多项式数量的随机样本，学习到最接近假设的$\mathsf{polylog}(n)$乘法因子。相比之下，该类在仅使用随机样本构建可实现学习器（无需证明者）时一直极具挑战性。 - 对于均匀分布下agnostically学习$k$-junta的问题，我们获得了一个交互式协议，其中验证者使用$O(2^k)$个随机样本学习给定函数$f \colon \{0,1\}^n \to \{0,1\}$。关键在于，验证者的样本复杂度与$n$无关。我们还证明，如果不坚持双高效证明系统，则该模型变得平凡。具体而言，我们展示了在分布无关设定下任意布尔函数类$\mathcal{C}$的一个协议，其中验证者仅需$O(1)$个带标签样本即可学习$f$。