Motivated by the challenge of nonstationarity in sequential decision making, we study Online Convex Optimization (OCO) under the coupling of two problem structures: the domain is unbounded, and the comparator sequence $u_1,\ldots,u_T$ is arbitrarily time-varying. As no algorithm can guarantee low regret simultaneously against all comparator sequences, handling this setting requires moving from minimax optimality to comparator adaptivity. That is, sensible regret bounds should depend on certain complexity measures of the comparator relative to one's prior knowledge. This paper achieves a new type of these adaptive regret bounds via a sparse coding framework. The complexity of the comparator is measured by its energy and its sparsity on a user-specified dictionary, which offers considerable versatility. Equipped with a wavelet dictionary for example, our framework improves the state-of-the-art bound (Jacobsen & Cutkosky, 2022) by adapting to both ($i$) the magnitude of the comparator average $||\bar u||=||\sum_{t=1}^Tu_t/T||$, rather than the maximum $\max_t||u_t||$; and ($ii$) the comparator variability $\sum_{t=1}^T||u_t-\bar u||$, rather than the uncentered sum $\sum_{t=1}^T||u_t||$. Furthermore, our analysis is simpler due to decoupling function approximation from regret minimization.
翻译:受序列决策中非平稳性挑战的驱动,我们研究了在线凸优化(OCO)在两种问题结构耦合下的情形:域无界,且比较器序列 $u_1,\ldots,u_T$ 可任意时变。由于没有任何算法能保证对所有比较器序列同时实现低遗憾,处理这一设置需要从极小化极大最优性转向比较器适应性。也就是说,合理的遗憾上界应依赖于比较器相对于先验知识的某些复杂度度量。本文通过稀疏编码框架实现了一种新型的自适应遗憾界。比较器的复杂度由其能量以及在用户指定字典上的稀疏性衡量,这提供了相当大的灵活性。例如,配备小波字典后,我们的框架通过自适应于(i)比较器均值 $||\bar u||=||\sum_{t=1}^Tu_t/T||$ 的大小(而非最大值 $\max_t||u_t||$)以及(ii)比较器变异性 $\sum_{t=1}^T||u_t-\bar u||$(而非非中心化总和 $\sum_{t=1}^T||u_t||$),改进了现有最优界(Jacobsen & Cutkosky, 2022)。此外,由于将函数逼近与遗憾最小化解耦,我们的分析更为简洁。