Physical computing systems provide a promising route toward hardware-native machine learning, but their computational capabilities remain difficult to characterize in a principled, task-independent, and data-efficient way. We extend the Information Processing Capacity (IPC) framework to stationary physical computing systems and establish several fundamental results: individual capacities are bounded between zero and one, their sum over a complete basis is bounded by the number of readouts, and noise strictly reduces this bound. We address the finite-sample estimation of IPC and derive the asymptotic form of the systematic positive bias affecting naive estimators. Building on these results, we introduce data-efficient estimation methods based on Richardson extrapolation and Sobol quasi-random sampling. We validate the framework experimentally using a photonic computing system based on picosecond laser pulses propagating through a nonlinear optical fibre. By varying the laser power and fibre length, we observe systematic shifts of the IPC distribution toward higher-order nonlinear capacities induced by the Kerr effect. Finally, we demonstrate that the total IPC strongly correlates with performance on benchmark machine-learning tasks and provides a reliable estimate of the effective dimensionality of the system. These results establish IPC as a practical bridge between the intrinsic dynamics of physical computing systems and their machine-learning performance.
翻译:物理计算系统为硬件原生机器学习提供了一条有前景的路径,但其计算能力在原则性、任务无关且数据高效的刻画方面仍然困难。我们将在固定物理计算系统中扩展信息处理容量框架,并建立若干基础性结论:各容量介于零与一之间,其在完备基上的总和受读出处数量约束,噪声严格降低该总和。针对有限样本下信息处理容量的估计问题,我们推导了朴素估计器系统正向偏差的渐近形式。基于这些结果,引入基于理查森外推和索博尔拟随机采样的数据高效估计方法。通过基于皮秒激光脉冲在非线性光纤中传播的光子计算系统进行实验验证,通过调节激光功率和光纤长度,观察到克尔效应诱导的信息处理容量分布向高阶非线性容量系统性转移。最后证明总信息处理容量与基准机器学习任务性能强相关,并能可靠估计系统的有效维度。这些成果确立了信息处理容量作为连接物理计算系统内在动力学与机器学习性能的实用桥梁。