Recent years have witnessed a rapid development of immersive multimedia which bridges the gap between the real world and virtual space. Volumetric videos, as an emerging representative 3D video paradigm that empowers extended reality, stand out to provide unprecedented immersive and interactive video watching experience. Despite the tremendous potential, the research towards 3D volumetric video is still in its infancy, relying on sufficient and complete datasets for further exploration. However, existing related volumetric video datasets mostly only include a single object, lacking details about the scene and the interaction between them. In this paper, we focus on the current most widely used data format, point cloud, and for the first time release a full-scene volumetric video dataset that includes multiple people and their daily activities interacting with the external environments. Comprehensive dataset description and analysis are conducted, with potential usage of this dataset. The dataset and additional tools can be accessed via the following website: https://cuhksz-inml.github.io/full_scene_volumetric_video_dataset/.
翻译:近年来,沉浸式多媒体技术飞速发展,弥合了现实世界与虚拟空间之间的鸿沟。体积视频作为一种新兴的3D视频范式,能够赋能扩展现实,为观看者提供前所未有的沉浸式与交互式视频体验。尽管潜力巨大,但对3D体积视频的研究仍处于起步阶段,亟需充足且完备的数据集以推动进一步探索。然而,现有相关体积视频数据集大多仅包含单一对象,缺乏场景细节及对象间的交互信息。本文聚焦于当前最广泛使用的数据格式——点云,并首次发布了一个包含多人及其与外部环境日常交互活动的全景体积视频数据集。我们对该数据集进行了全面的描述与分析,并探讨了其潜在用途。该数据集及配套工具可通过以下网站获取:https://cuhksz-inml.github.io/full_scene_volumetric_video_dataset/。