Enhancing octree-based context models for point cloud geometry compression with attention-based child node number prediction

In point cloud geometry compression, most octreebased context models use the cross-entropy between the onehot encoding of node occupancy and the probability distribution predicted by the context model as the loss. This approach converts the problem of predicting the number (a regression problem) and the position (a classification problem) of occupied child nodes into a 255-dimensional classification problem. As a result, it fails to accurately measure the difference between the one-hot encoding and the predicted probability distribution. We first analyze why the cross-entropy loss function fails to accurately measure the difference between the one-hot encoding and the predicted probability distribution. Then, we propose an attention-based child node number prediction (ACNP) module to enhance the context models. The proposed module can predict the number of occupied child nodes and map it into an 8- dimensional vector to assist the context model in predicting the probability distribution of the occupancy of the current node for efficient entropy coding. Experimental results demonstrate that the proposed module enhances the coding efficiency of octree-based context models.

翻译：在点云几何压缩中，大多数基于八叉树的上下文模型使用节点占位独热编码与上下文模型预测的概率分布之间的交叉熵作为损失函数。该方法将预测占用子节点数量（回归问题）和位置（分类问题）的问题转化为一个255维的分类问题。因此，它无法准确度量独热编码与预测概率分布之间的差异。我们首先分析了交叉熵损失函数为何无法准确度量独热编码与预测概率分布之间的差异。随后，我们提出了一种基于注意力的子节点数量预测模块来增强上下文模型。该模块能够预测占用子节点的数量，并将其映射为一个8维向量，以辅助上下文模型预测当前节点占位的概率分布，从而实现高效熵编码。实验结果表明，所提模块有效提升了基于八叉树的上下文模型的编码效率。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

WWW 2024 | GraphTranslator: 将图模型对齐大语言模型

专知会员服务

27+阅读 · 2024年3月25日

Nat. Mach. Intel. | 基于广义模板的图形神经网络用于准确的有机反应性预测

专知会员服务

11+阅读 · 2022年9月18日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日