KAN 我们能流吗？通过 KAN 与 RWKV 的 3D 流匹配推进机器人操作 (KAN We Flow? Advancing Robotic Manipulation with 3D Flow Matching via KAN & RWKV)

Diffusion-based visuomotor policies excel at modeling action distributions but are inference-inefficient, since recursively denoising from noise to policy requires many steps and heavy UNet backbones, which hinders deployment on resource-constrained robots. Flow matching alleviates the sampling burden by learning a one-step vector field, yet prior implementations still inherit large UNet-style architectures. In this work, we present KAN-We-Flow, a flow-matching policy that draws on recent advances in Receptance Weighted Key Value (RWKV) and Kolmogorov-Arnold Networks (KAN) from vision to build a lightweight and highly expressive backbone for 3D manipulation. Concretely, we introduce an RWKV-KAN block: an RWKV first performs efficient time/channel mixing to propagate task context, and a subsequent GroupKAN layer applies learnable spline-based, groupwise functional mappings to perform feature-wise nonlinear calibration of the action mapping on RWKV outputs. Moreover, we introduce an Action Consistency Regularization (ACR), a lightweight auxiliary loss that enforces alignment between predicted action trajectories and expert demonstrations via Euler extrapolation, providing additional supervision to stabilize training and improve policy precision. Without resorting to large UNets, our design reduces parameters by 86.8\%, maintains fast runtime, and achieves state-of-the-art success rates on Adroit, Meta-World, and DexArt benchmarks. Our project page can be viewed in \href{https://zhihaochen-2003.github.io/KAN-We-Flow.github.io/}{\textcolor{red}{link}}

翻译：基于扩散的视觉运动策略在建模动作分布方面表现出色，但其推理效率低下，因为从噪声到策略的递归去噪需要许多步骤和沉重的 UNet 骨干网络，这阻碍了其在资源受限的机器人上的部署。流匹配通过学习一个单步向量场减轻了采样负担，但先前的实现仍然继承了大型 UNet 风格的架构。在这项工作中，我们提出了 KAN-We-Flow，这是一种流匹配策略，它借鉴了视觉领域中接收加权键值（RWKV）和柯尔莫哥洛夫-阿诺德网络（KAN）的最新进展，为 3D 操作构建了一个轻量级且高度表达性的骨干网络。具体来说，我们引入了一个 RWKV-KAN 模块：RWKV 首先执行高效的时序/通道混合以传播任务上下文，随后一个 GroupKAN 层应用基于可学习样条的、分组函数映射，对 RWKV 输出上的动作映射进行特征级非线性校准。此外，我们引入了动作一致性正则化（ACR），这是一种轻量级的辅助损失，通过欧拉外推法强制预测的动作轨迹与专家演示之间的一致性，为稳定训练和提高策略精度提供了额外的监督。在不依赖大型 UNet 的情况下，我们的设计将参数减少了 86.8%，保持了快速的运行时间，并在 Adroit、Meta-World 和 DexArt 基准测试中实现了最先进的成功率。我们的项目页面可在 \href{https://zhihaochen-2003.github.io/KAN-We-Flow.github.io/}{\textcolor{red}{链接}} 查看。