Spiking neural networks (SNNs) are recurrent models that can leverage sparsity in input time series to efficiently carry out tasks such as classification. Additional efficiency gains can be obtained if decisions are taken as early as possible as a function of the complexity of the input time series. The decision on when to stop inference and produce a decision must rely on an estimate of the current accuracy of the decision. Prior work demonstrated the use of conformal prediction (CP) as a principled way to quantify uncertainty and support adaptive-latency decisions in SNNs. In this paper, we propose to enhance the uncertainty quantification capabilities of SNNs by implementing ensemble models for the purpose of improving the reliability of stopping decisions. Intuitively, an ensemble of multiple models can decide when to stop more reliably by selecting times at which most models agree that the current accuracy level is sufficient. The proposed method relies on different forms of information pooling from ensemble models, and offers theoretical reliability guarantees. We specifically show that variational inference-based ensembles with p-variable pooling significantly reduce the average latency of state-of-the-art methods, while maintaining reliability guarantees.
翻译:脉冲神经网络(SNN)是一种能够利用输入时间序列稀疏性高效完成分类等任务的循环模型。若根据输入时间序列的复杂度尽早做出决策,还可进一步提升效率。何时停止推理并生成决策这一判断,必须依赖于对当前决策准确度的估计。已有研究展示了使用共形预测(CP)作为量化不确定性并支持SNN中自适应延迟决策的原则性方法。本文提出通过实现集成模型来增强SNN的不确定性量化能力,旨在提升停止决策的可靠性。直观而言,多模型集成可通过选择多数模型认为当前准确度水平已足够的时间点,更可靠地决定何时停止。所提方法基于集成模型的不同信息池化形式,并提供了理论可靠性保证。我们特别证明,采用p变量池化的变分推理集成能够在维持可靠性保证的同时,显著降低现有技术的平均延迟。