FLSea: Underwater Visual-Inertial and Stereo-Vision Forward-Looking Datasets

Visibility underwater is challenging, and degrades as the distance between the subject and camera increases, making vision tasks in the forward-looking direction more difficult. We have collected underwater forward-looking stereo-vision and visual-inertial image sets in the Mediterranean and Red Sea. To our knowledge there are no other public datasets in the underwater environment acquired with this camera-sensor orientation published with ground-truth. These datasets are critical for the development of several underwater applications, including obstacle avoidance, visual odometry, 3D tracking, Simultaneous Localization and Mapping (SLAM) and depth estimation. The stereo datasets include synchronized stereo images in dynamic underwater environments with objects of known-size. The visual-inertial datasets contain monocular images and IMU measurements, aligned with millisecond resolution timestamps and objects of known size which were placed in the scene. Both sensor configurations allow for scale estimation, with the calibrated baseline in the stereo setup and the IMU in the visual-inertial setup. Ground truth depth maps were created offline for both dataset types using photogrammetry. The ground truth is validated with multiple known measurements placed throughout the imaged environment. There are 5 stereo and 8 visual-inertial datasets in total, each containing thousands of images, with a range of different underwater visibility and ambient light conditions, natural and man-made structures and dynamic camera motions. The forward-looking orientation of the camera makes these datasets unique and ideal for testing underwater obstacle-avoidance algorithms and for navigation close to the seafloor in dynamic environments. With our datasets, we hope to encourage the advancement of autonomous functionality for underwater vehicles in dynamic and/or shallow water environments.

翻译：水下能见度极具挑战性，且随目标与相机距离增大而恶化，这使得前向视角下的视觉任务更加困难。我们在地中海和红海收集了水下前向立体视觉和视觉-惯性图像集。据我们所知，目前尚无其他公开的采用此相机-传感器方向采集的含真值水下环境数据集。这些数据集对于开发多种水下应用至关重要，包括障碍物规避、视觉里程计、三维追踪、同步定位与地图构建（SLAM）以及深度估计。立体数据集包含动态水下环境中带已知尺寸物体的同步立体图像。视觉-惯性数据集包含单目图像和IMU测量值，数据间对齐精度达毫秒级时间戳，场景中放置了已知尺寸的物体。两种传感器配置均可实现尺度估计——立体设置利用标定基线，视觉-惯性设置利用IMU。两类数据集的真值深度图均通过摄影测量法离线生成，并通过遍布成像环境的多个已知测量结果进行验证。数据集总计包含5个立体数据集和8个视觉-惯性数据集，每个数据集包含数千张图像，涵盖不同水下能见度、环境光照条件、自然与人造结构以及动态相机运动模式。相机的前向拍摄方向使这些数据集具有独特性，特别适用于测试水下障碍物规避算法以及在动态环境中近海底导航。我们希望通过这些数据集，推动水下机器人在动态和/或浅水环境中的自主功能发展。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

专知会员服务

16+阅读 · 2022年3月13日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

105+阅读 · 2022年2月10日

【硬核课】机器人学习课程，UT Austin朱玉可博士讲述自主机器人的人工智能与机器学习机器学习算法

专知会员服务

40+阅读 · 2020年9月21日