In Linear Hashing ($\mathsf{LH}$) with $\beta$ bins on a size $u$ universe ${\mathcal{U}=\{0,1,\ldots, u-1\}}$, items $\{x_1,x_2,\ldots, x_n\}\subset \mathcal{U}$ are placed in bins by the hash function $$x_i\mapsto (ax_i+b)\mod p \mod \beta$$ for some prime $p\in [u,2u]$ and randomly chosen integers $a,b \in [1,p]$. The "maxload" of $\mathsf{LH}$ is the number of items assigned to the fullest bin. Expected maxload for a worst-case set of items is a natural measure of how well $\mathsf{LH}$ distributes items amongst the bins. Fix $\beta=n$. Despite $\mathsf{LH}$'s simplicity, bounding $\mathsf{LH}$'s worst-case maxload is extremely challenging. It is well-known that on random inputs $\mathsf{LH}$ achieves maxload $\Omega\left(\frac{\log n}{\log\log n}\right)$; this is currently the best lower bound for $\mathsf{LH}$'s expected maxload. Recently Knudsen established an upper bound of $\widetilde{O}(n^{1 / 3})$. The question "Is the worst-case expected maxload of $\mathsf{LH}$ $n^{o(1)}$?" is one of the most basic open problems in discrete math. In this paper we propose a set of intermediate open questions to help researchers make progress on this problem. We establish the relationship between these intermediate open questions and make some partial progress on them.
翻译:在线性哈希($\mathsf{LH}$)中,对于大小为$u$的全集$\mathcal{U}=\{0,1,\ldots, u-1\}$,使用$\beta$个桶,通过哈希函数$$x_i\mapsto (ax_i+b)\mod p \mod \beta$$将项$\{x_1,x_2,\ldots, x_n\}\subset \mathcal{U}$分配到桶中,其中$p\in [u,2u]$为素数,$a,b \in [1,p]$为随机选定的整数。$\mathsf{LH}$的"最大负载"指分配到最满桶中的项数。在最坏情况项集下,期望最大负载是衡量$\mathsf{LH}$在桶间分配项性能的自然指标。令$\beta=n$。尽管$\mathsf{LH}$结构简单,但其最坏情况最大负载的界证明极为困难。已知在随机输入下,$\mathsf{LH}$的最大负载达到$\Omega\left(\frac{\log n}{\log\log n}\right)$;这是当前对$\mathsf{LH}$期望最大负载的最佳下界。近期Knudsen建立了$\widetilde{O}(n^{1 / 3})$的上界。问题"$\mathsf{LH}$的最坏情况期望最大负载是否为$n^{o(1)}$"是离散数学中最基础的开问题之一。本文提出一组中间性开放问题,以助力研究者在该问题上取得进展。我们建立了这些中间问题间的关联,并获得了部分阶段性成果。