An algorithm based on a deep probabilistic architecture referred to as a tree-structured sum-product network (t-SPN) is considered for cell classification. The t-SPN is constructed such that the unnormalized probability is represented as conditional probabilities of a subset of most similar cell classes. The constructed t-SPN architecture is learned by maximizing the margin, which is the difference in the conditional probability between the true and the most competitive false label. To enhance the generalization ability of the architecture, L2-regularization (REG) is considered along with the maximum margin (MM) criterion in the learning process. To highlight cell features, this paper investigates the effectiveness of two generic high-pass filters: ideal high-pass filtering and the Laplacian of Gaussian (LOG) filtering. On both HEp-2 and Feulgen benchmark datasets, the t-SPN architecture learned based on the max-margin criterion with regularization produced the highest accuracy rate compared to other state-of-the-art algorithms that include convolutional neural network (CNN) based algorithms. The ideal high-pass filter was more effective on the HEp-2 dataset, which is based on immunofluorescence staining, while the LOG was more effective on the Feulgen dataset, which is based on Feulgen staining.
翻译:本文考虑了一种基于树结构和积网络(t-SPN)的深度概率架构算法用于细胞分类。t-SPN的构建方式使得未归一化概率被表示为最相似细胞类子集的条件概率。通过最大化间隔(即真实标签与最具竞争性的错误标签之间的条件概率差值)来学习所构建的t-SPN架构。为增强架构的泛化能力,在学习过程中同时考虑L2正则化与最大间隔准则。为突出细胞特征,本文研究了两种通用高通滤波器的有效性:理想高通滤波和高斯-拉普拉斯(LOG)滤波。在HEp-2和Feulgen基准数据集上,基于带正则化的最大间隔准则学习的t-SPN架构相较于其他包括卷积神经网络(CNN)算法在内的最新算法取得了最高准确率。理想高通滤波器在基于免疫荧光染色的HEp-2数据集上更为有效,而LOG滤波器在基于Feulgen染色的Feulgen数据集上更为有效。