CountingFruit：基于语言引导语义高斯溅射的实时三维果实计数 (CountingFruit: Real-Time 3D Fruit Counting with Language-Guided Semantic Gaussian Splatting) - 专知论文

会员服务 ·

0

3D · 推断 · 控制器 · 回合 · 讲稿 ·

2025 年 6 月 1 日

CountingFruit: Real-Time 3D Fruit Counting with Language-Guided Semantic Gaussian Splatting

翻译：CountingFruit：基于语言引导语义高斯溅射的实时三维果实计数

Fengze Li,Yangle Liu,Jieming Ma,Hai-Ning Liang,Yaochun Shen,Huangxiang Li,Zhijing Wu

Accurate fruit counting in real-world agricultural environments is a longstanding challenge due to visual occlusions, semantic ambiguity, and the high computational demands of 3D reconstruction. Existing methods based on neural radiance fields suffer from low inference speed, limited generalization, and lack support for open-set semantic control. This paper presents FruitLangGS, a real-time 3D fruit counting framework that addresses these limitations through spatial reconstruction, semantic embedding, and language-guided instance estimation. FruitLangGS first reconstructs orchard-scale scenes using an adaptive Gaussian splatting pipeline with radius-aware pruning and tile-based rasterization for efficient rendering. To enable semantic control, each Gaussian encodes a compressed CLIP-aligned language embedding, forming a compact and queryable 3D representation. At inference time, prompt-based semantic filtering is applied directly in 3D space, without relying on image-space segmentation or view-level fusion. The selected Gaussians are then converted into dense point clouds via distribution-aware sampling and clustered to estimate fruit counts. Experimental results on real orchard data demonstrate that FruitLangGS achieves higher rendering speed, semantic flexibility, and counting accuracy compared to prior approaches, offering a new perspective for language-driven, real-time neural rendering across open-world scenarios.

翻译：在真实农业环境中实现精确的果实计数是一项长期存在的挑战，这主要源于视觉遮挡、语义模糊性以及三维重建的高计算需求。现有基于神经辐射场的方法存在推理速度慢、泛化能力有限且缺乏开放集语义控制支持等问题。本文提出FruitLangGS，一种实时三维果实计数框架，通过空间重建、语义嵌入和语言引导的实例估计来解决这些局限性。FruitLangGS首先采用自适应高斯溅射流程重建果园级场景，该流程结合半径感知剪枝和基于图块的光栅化以实现高效渲染。为实现语义控制，每个高斯单元编码一个压缩的CLIP对齐语言嵌入，形成紧凑且可查询的三维表征。在推理阶段，基于提示的语义过滤直接在三维空间中进行，无需依赖图像空间分割或视图级融合。随后通过分布感知采样将选中的高斯单元转换为稠密点云，并进行聚类以估计果实数量。在真实果园数据上的实验结果表明，与现有方法相比，FruitLangGS在渲染速度、语义灵活性和计数准确性方面均表现更优，为开放世界场景下的语言驱动实时神经渲染提供了新视角。

0

相关内容

3D是英文“Three Dimensions”的简称，中文是指三维、三个维度、三个坐标，即有长、有宽、有高，换句话说，就是立体的，是相对于只有长和宽的平面（2D）而言。

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

32+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

164+阅读 · 2019年10月12日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

STRCF for Visual Object Tracking

STRCF for Visual Object Tracking

统计学习与视觉计算组

15+阅读 · 2018年5月29日

Focal Loss for Dense Object Detection

Focal Loss for Dense Object Detection

统计学习与视觉计算组

12+阅读 · 2018年3月15日

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

KingsGarden

13+阅读 · 2017年7月16日

From Softmax to Sparsemax-ICML16（1）

From Softmax to Sparsemax-ICML16（1）

KingsGarden

74+阅读 · 2016年11月26日

城市“建成环境——空间行为”的多尺度影响关系与机理研究

国家自然科学基金

13+阅读 · 2017年12月31日

“Fishes-in-net” 酵母孢子微胶囊式近平滑假丝酵母SCRII酶有机相高效手性合成机制研究

国家自然科学基金

3+阅读 · 2016年12月31日

Musielak-Orlicz-Sobolev 空间中的迹嵌入及其应用

国家自然科学基金

2+阅读 · 2015年12月31日

Volterra积分微分方程的多区间Chebyshev和Legendre谱配置法

国家自然科学基金

0+阅读 · 2015年12月31日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

47+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

动态Gr？bner 基与GVW算法

国家自然科学基金

0+阅读 · 2014年12月31日

“杰文斯”悖论、能效政策改进与“双控目标”分解

国家自然科学基金

0+阅读 · 2014年12月31日

Poisson流形上的修正Hamilton方法

国家自然科学基金

0+阅读 · 2014年12月31日

海量Web用户生成内容物化关键技术

国家自然科学基金

2+阅读 · 2014年12月31日

DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agents

Arxiv

0+阅读 · 2025年10月24日

DeepTx: Real-Time Transaction Risk Analysis via Multi-Modal Features and LLM Reasoning

Arxiv

0+阅读 · 2025年10月23日

PointMapPolicy: Structured Point Cloud Processing for Multi-Modal Imitation Learning

Arxiv

0+阅读 · 2025年10月23日

ControlFusion: A Controllable Image Fusion Framework with Language-Vision Degradation Prompts

Arxiv

0+阅读 · 2025年10月23日

TianHui: A Domain-Specific Large Language Model for Diverse Traditional Chinese Medicine Scenarios

Arxiv

0+阅读 · 2025年10月23日

QoQ-Med: Building Multimodal Clinical Foundation Models with Domain-Aware GRPO Training

Arxiv

0+阅读 · 2025年10月22日

CONFEX: Uncertainty-Aware Counterfactual Explanations with Conformal Guarantees

CONFEX: Uncertainty-Aware Counterfactual Explanations with Conformal Guarantees

Arxiv

0+阅读 · 2025年10月22日

SORA-ATMAS: Adaptive Trust Management and Multi-LLM Aligned Governance for Future Smart Cities

Arxiv

0+阅读 · 2025年10月22日

MoTVLA: A Vision-Language-Action Model with Unified Fast-Slow Reasoning

Arxiv

0+阅读 · 2025年10月22日

XGen-Q: An Explainable Domain-Adaptive LLM Framework with Retrieval-Augmented Generation for Software Security

Arxiv

0+阅读 · 2025年10月21日

VIP会员

文章信息

相关主题

最新内容

【博士论文】已对齐 AI 系统的持续脆弱性

【博士论文】已对齐 AI 系统的持续脆弱性

专知会员服务

3+阅读 · 4月3日

潜空间综述：基础、演化、机制、能力与展望

潜空间综述：基础、演化、机制、能力与展望

专知会员服务

6+阅读 · 4月3日

《非合作空中目标识别感知方法与人工智能技术近期趋势综述》

《非合作空中目标识别感知方法与人工智能技术近期趋势综述》

专知会员服务

15+阅读 · 4月3日

《人工智能时代的国防工业政策》

《人工智能时代的国防工业政策》

专知会员服务

6+阅读 · 4月3日

来自乌克兰与伊朗冲突的经验：战场适应力仍是关键

来自乌克兰与伊朗冲突的经验：战场适应力仍是关键

专知会员服务

9+阅读 · 4月3日

《无人机前沿：印度武装力量无人机库存、全球经验与非接触式动能战争的战略要务》

《无人机前沿：印度武装力量无人机库存、全球经验与非接触式动能战争的战略要务》

专知会员服务

8+阅读 · 4月3日

《用于高功率微波反蜂群作战的生成式人工智能方法》技术报告

《用于高功率微波反蜂群作战的生成式人工智能方法》技术报告

专知会员服务

13+阅读 · 4月3日

“美国情报界年度威胁评估报告”中的技术挑战描述

“美国情报界年度威胁评估报告”中的技术挑战描述

专知会员服务

4+阅读 · 4月3日

《升级动态与核门槛政治：对伊朗-美国冲突（2024-2026）的定量-分析评估》

《升级动态与核门槛政治：对伊朗-美国冲突（2024-2026）的定量-分析评估》

专知会员服务

8+阅读 · 4月3日

《2026年美国/以色列-伊朗冲突》

《2026年美国/以色列-伊朗冲突》

专知会员服务

6+阅读 · 4月3日

《美国与伊朗的冲突》美国会服务处报告

《美国与伊朗的冲突》美国会服务处报告

专知会员服务

6+阅读 · 4月3日

美国对伊朗军事行动：弹药与反导

美国对伊朗军事行动：弹药与反导

专知会员服务

7+阅读 · 4月3日

超越技术：伊朗冲突中的“战争方式”

超越技术：伊朗冲突中的“战争方式”

专知会员服务

14+阅读 · 4月1日

军事决策大语言模型综合评价基准

军事决策大语言模型综合评价基准

专知会员服务

11+阅读 · 4月1日

利用核国家战略互动博弈（SIGNAL）进行实验性兵棋推演

利用核国家战略互动博弈（SIGNAL）进行实验性兵棋推演

专知会员服务

10+阅读 · 4月1日

相关VIP内容

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

32+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

164+阅读 · 2019年10月12日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

潜空间综述：基础、演化、机制、能力与展望

《人工智能时代的国防工业政策》

【博士论文】已对齐 AI 系统的持续脆弱性

《非合作空中目标识别感知方法与人工智能技术近期趋势综述》

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

STRCF for Visual Object Tracking

STRCF for Visual Object Tracking

统计学习与视觉计算组

15+阅读 · 2018年5月29日

Focal Loss for Dense Object Detection

Focal Loss for Dense Object Detection

统计学习与视觉计算组

12+阅读 · 2018年3月15日

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

KingsGarden

13+阅读 · 2017年7月16日

From Softmax to Sparsemax-ICML16（1）

From Softmax to Sparsemax-ICML16（1）

KingsGarden

74+阅读 · 2016年11月26日

相关论文

DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agents

Arxiv

0+阅读 · 2025年10月24日

DeepTx: Real-Time Transaction Risk Analysis via Multi-Modal Features and LLM Reasoning

Arxiv

0+阅读 · 2025年10月23日

PointMapPolicy: Structured Point Cloud Processing for Multi-Modal Imitation Learning

Arxiv

0+阅读 · 2025年10月23日

ControlFusion: A Controllable Image Fusion Framework with Language-Vision Degradation Prompts

Arxiv

0+阅读 · 2025年10月23日

TianHui: A Domain-Specific Large Language Model for Diverse Traditional Chinese Medicine Scenarios

Arxiv

0+阅读 · 2025年10月23日

QoQ-Med: Building Multimodal Clinical Foundation Models with Domain-Aware GRPO Training

Arxiv

0+阅读 · 2025年10月22日

CONFEX: Uncertainty-Aware Counterfactual Explanations with Conformal Guarantees

CONFEX: Uncertainty-Aware Counterfactual Explanations with Conformal Guarantees

Arxiv

0+阅读 · 2025年10月22日

SORA-ATMAS: Adaptive Trust Management and Multi-LLM Aligned Governance for Future Smart Cities

Arxiv

0+阅读 · 2025年10月22日

MoTVLA: A Vision-Language-Action Model with Unified Fast-Slow Reasoning

Arxiv

0+阅读 · 2025年10月22日

XGen-Q: An Explainable Domain-Adaptive LLM Framework with Retrieval-Augmented Generation for Software Security

Arxiv

0+阅读 · 2025年10月21日

相关基金

城市“建成环境——空间行为”的多尺度影响关系与机理研究

国家自然科学基金

13+阅读 · 2017年12月31日

“Fishes-in-net” 酵母孢子微胶囊式近平滑假丝酵母SCRII酶有机相高效手性合成机制研究

国家自然科学基金

3+阅读 · 2016年12月31日

Musielak-Orlicz-Sobolev 空间中的迹嵌入及其应用

国家自然科学基金

2+阅读 · 2015年12月31日

Volterra积分微分方程的多区间Chebyshev和Legendre谱配置法

国家自然科学基金

0+阅读 · 2015年12月31日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

47+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

动态Gr？bner 基与GVW算法

国家自然科学基金

0+阅读 · 2014年12月31日

“杰文斯”悖论、能效政策改进与“双控目标”分解

国家自然科学基金

0+阅读 · 2014年12月31日

Poisson流形上的修正Hamilton方法

国家自然科学基金

0+阅读 · 2014年12月31日

海量Web用户生成内容物化关键技术

国家自然科学基金

2+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员