In this note, we provide a refined analysis of Mitra's algorithm \cite{mitra2008clustering} for classifying general discrete mixture distribution models. Built upon spectral clustering \cite{mcsherry2001spectral}, this algorithm offers compelling conditions for probability distributions. We enhance this analysis by tailoring the model to bipartite stochastic block models, resulting in more refined conditions. Compared to those derived in \cite{mitra2008clustering}, our improved separation conditions are obtained.
翻译:本文对Mitra算法\cite{mitra2008clustering}在分类一般离散混合分布模型时的性能进行了精细化分析。该算法建立在谱聚类\cite{mcsherry2001spectral}基础之上,为概率分布提供了具有说服力的分类条件。我们通过将模型特化为二分随机块模型来改进这一分析,从而得到更精细的判定条件。相较于\cite{mitra2008clustering}中推导的条件,我们获得了更优的分离性条件。