The best-arm identification (BAI) problem is one of the most fundamental problems in interactive machine learning, which has two flavors: the fixed-budget setting (FB) and the fixed-confidence setting (FC). For $K$-armed bandits with the unique best arm, the optimal sample complexities for both settings have been settled down, and they match up to logarithmic factors. This prompts an interesting research question about the generic, potentially structured BAI problems: Is FB harder than FC or the other way around? In this paper, we show that FB is no harder than FC up to logarithmic factors. We do this constructively: we propose a novel algorithm called FC2FB (fixed confidence to fixed budget), which is a meta algorithm that takes in an FC algorithm $\mathcal{A}$ and turn it into an FB algorithm. We prove that this FC2FB enjoys a sample complexity that matches, up to logarithmic factors, that of the sample complexity of $\mathcal{A}$. This means that the optimal FC sample complexity is an upper bound of the optimal FB sample complexity up to logarithmic factors. Our result not only reveals a fundamental relationship between FB and FC, but also has a significant implication: FC2FB, combined with existing state-of-the-art FC algorithms, leads to improved sample complexity for a number of FB problems.
翻译:最优臂识别(BAI)问题是交互式机器学习中最基础的问题之一,其包含两种设定:固定预算(FB)与固定置信度(FC)。对于具有唯一最优臂的$K$臂赌博机,两种设定的最优样本复杂度均已确定,且它们在至多对数因子内相匹配。这引出了一个关于通用的、可能具有结构性的BAI问题的有趣研究课题:FB是否比FC更难,抑或相反?本文证明,在至多对数因子内,FB并不比FC更难。我们通过构造性方法证明这一点:提出了一种名为FC2FB(固定置信度转固定预算)的新算法,它是一种元算法,能够接收一个FC算法$\mathcal{A}$并将其转化为一个FB算法。我们证明,该FC2FB算法享有的样本复杂度与$\mathcal{A}$的样本复杂度在至多对数因子内相匹配。这意味着最优的FC样本复杂度在至多对数因子内是FB最优样本复杂度的上界。我们的结果不仅揭示了FB与FC之间的基本关系,还具有重要启示:FC2FB结合现有最先进的FC算法,能够为一系列FB问题带来改进的样本复杂度。