面向任务相关植物部位高效搜索与检测的语义感知最优视点规划 (Semantics-Aware Next-best-view Planning for Efficient Search and Detection of Task-relevant Plant Parts)

Searching and detecting the task-relevant parts of plants is important to automate harvesting and de-leafing of tomato plants using robots. This is challenging due to high levels of occlusion in tomato plants. Active vision is a promising approach in which the robot strategically plans its camera viewpoints to overcome occlusion and improve perception accuracy. However, current active-vision algorithms cannot differentiate between relevant and irrelevant plant parts and spend time on perceiving irrelevant plant parts. This work proposed a semantics-aware active-vision strategy that uses semantic information to identify the relevant plant parts and prioritise them during view planning. The proposed strategy was evaluated on the task of searching and detecting the relevant plant parts using simulation and real-world experiments. In simulation experiments, the semantics-aware strategy proposed could search and detect 81.8% of the relevant plant parts using nine viewpoints. It was significantly faster and detected more plant parts than predefined, random, and volumetric active-vision strategies that do not use semantic information. The strategy proposed was also robust to uncertainty in plant and plant-part positions, plant complexity, and different viewpoint-sampling strategies. In real-world experiments, the semantics-aware strategy could search and detect 82.7% of the relevant plant parts using seven viewpoints, under complex greenhouse conditions with natural variation and occlusion, natural illumination, sensor noise, and uncertainty in camera poses. The results of this work clearly indicate the advantage of using semantics-aware active vision for targeted perception of plant parts and its applicability in the real world. It can significantly improve the efficiency of automated harvesting and de-leafing in tomato crop production.

翻译：在番茄植株自动化采收与去叶作业中，搜索并检测任务相关的植物部位至关重要。由于番茄植株存在高度遮挡，该任务极具挑战性。主动视觉是一种前景广阔的方法，机器人通过策略性地规划相机视点以克服遮挡并提升感知精度。然而，现有主动视觉算法无法区分相关与无关植物部位，导致大量时间耗费在对无关部位的感知上。本研究提出一种语义感知的主动视觉策略，该策略利用语义信息识别相关植物部位并在视点规划中予以优先考虑。通过仿真与真实环境实验，对所提策略在搜索与检测相关植物部位任务中的性能进行了评估。仿真实验中，所提出的语义感知策略仅需九个视点即可搜索并检测到81.8%的相关植物部位。相较于未使用语义信息的预定义、随机及体素化主动视觉策略，该策略速度显著更快且检测到的植物部位更多。该策略对植株及部位位置不确定性、植株复杂度以及不同视点采样策略均表现出良好鲁棒性。在真实环境实验中，面对温室复杂条件下存在的自然形态变异、遮挡、自然光照、传感器噪声及相机位姿不确定性，语义感知策略仅用七个视点即可搜索并检测到82.7%的相关植物部位。本研究结果清晰表明，语义感知主动视觉在植物部位目标感知方面具有显著优势，且具备实际应用可行性。该技术可显著提升番茄作物生产中自动化采收与去叶作业的效率。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日