HPC: Hierarchical Progressive Coding Framework for Volumetric Video

Volumetric video based on Neural Radiance Field (NeRF) holds vast potential for various 3D applications, but its substantial data volume poses significant challenges for compression and transmission. Current NeRF compression lacks the flexibility to adjust video quality and bitrate within a single model for various network and device capacities. To address these issues, we propose HPC, a novel hierarchical progressive volumetric video coding framework achieving variable bitrate using a single model. Specifically, HPC introduces a hierarchical representation with a multi-resolution residual radiance field to reduce temporal redundancy in long-duration sequences while simultaneously generating various levels of detail. Then, we propose an end-to-end progressive learning approach with a multi-rate-distortion loss function to jointly optimize both hierarchical representation and compression. Our HPC trained only once can realize multiple compression levels, while the current methods need to train multiple fixed-bitrate models for different rate-distortion (RD) tradeoffs. Extensive experiments demonstrate that HPC achieves flexible quality levels with variable bitrate by a single model and exhibits competitive RD performance, even outperforming fixed-bitrate models across various datasets.

翻译：基于神经辐射场（NeRF）的体视频在各类三维应用中具有巨大潜力，但其庞大的数据量对压缩与传输提出了重大挑战。现有的NeRF压缩方法缺乏灵活性，难以在单一模型中根据不同的网络与设备能力调整视频质量与码率。为解决这些问题，我们提出了HPC，一种新颖的分层渐进式体视频编码框架，能够使用单一模型实现可变码率。具体而言，HPC引入了一种分层表示结构，通过多分辨率残差辐射场来减少长时序列的时间冗余，同时生成多种细节层次。随后，我们提出一种端到端的渐进式学习方法，结合多码率-失真损失函数，对分层表示与压缩过程进行联合优化。我们的HPC仅需训练一次即可实现多个压缩等级，而现有方法需要为不同的码率-失真（RD）权衡训练多个固定码率模型。大量实验表明，HPC通过单一模型实现了可变码率下的灵活质量等级，并展现出具有竞争力的RD性能，甚至在多个数据集上优于固定码率模型。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

13+阅读 · 2022年3月12日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日