This work represents a large step into modern ways of fast 3D reconstruction based on RGB camera images. Utilizing a Microsoft HoloLens 2 as a multisensor platform that includes an RGB camera and an inertial measurement unit for SLAM-based camera-pose determination, we train a Neural Radiance Field (NeRF) as a neural scene representation in real-time with the acquired data from the HoloLens. The HoloLens is connected via Wifi to a high-performance PC that is responsible for the training and 3D reconstruction. After the data stream ends, the training is stopped and the 3D reconstruction is initiated, which extracts a point cloud of the scene. With our specialized inference algorithm, five million scene points can be extracted within 1 second. In addition, the point cloud also includes radiometry per point. Our method of 3D reconstruction outperforms grid point sampling with NeRFs by multiple orders of magnitude and can be regarded as a complete real-time 3D reconstruction method in a mobile mapping setup.
翻译:本研究代表了基于RGB相机图像实现快速三维重建的现代方法迈出的重要一步。利用微软HoloLens 2作为多传感器平台(其包含RGB相机和惯性测量单元,用于基于SLAM的相机姿态确定),我们通过从HoloLens获取的数据,实时训练神经辐射场(NeRF)作为神经场景表示。HoloLens通过WiFi连接至高算力PC,该PC负责训练与三维重建。数据流结束后,训练终止并启动三维重建,从中提取场景点云。借助我们专用的推理算法,可在1秒内提取五百万个场景点。此外,点云还包含每个点的辐射度量信息。我们的三维重建方法在性能上以多个数量级超越基于NeRF的网格点采样,可被视为移动测绘配置中完整的实时三维重建方法。