NeuralPrefix：一种零样本传感数据插补插件 (NeuralPrefix: A Zero-shot Sensory Data Imputation Plugin)

Real-world sensing challenges such as sensor failures, communication issues, and power constraints lead to data intermittency. An issue that is known to undermine the traditional classification task that assumes a continuous data stream. Previous works addressed this issue by designing bespoke solutions (i.e. task-specific and/or modality-specific imputation). These approaches, while effective for their intended purposes, had limitations in their applicability across different tasks and sensor modalities. This raises an important question: Can we build a task-agnostic imputation pipeline that is transferable to new sensors without requiring additional training? In this work, we formalise the concept of zero-shot imputation and propose a novel approach that enables the adaptation of pre-trained models to handle data intermittency. This framework, named NeuralPrefix, is a generative neural component that precedes a task model during inference, filling in gaps caused by data intermittency. NeuralPrefix is built as a continuous dynamical system, where its internal state can be estimated at any point in time by solving an Ordinary Differential Equation (ODE). This approach allows for a more versatile and adaptable imputation method, overcoming the limitations of task-specific and modality-specific solutions. We conduct a comprehensive evaluation of NeuralPrefix on multiple sensory datasets, demonstrating its effectiveness across various domains. When tested on intermittent data with a high 50% missing data rate, NeuralPreifx accurately recovers all the missing samples, achieving SSIM score between 0.93-0.96. Zero-shot evaluations show that NeuralPrefix generalises well to unseen datasets, even when the measurements come from a different modality.

翻译：现实世界中的传感挑战（如传感器故障、通信问题和功率限制）会导致数据间歇性缺失。这一问题已知会破坏传统分类任务（其假设数据流是连续的）。先前的研究通过设计定制化解决方案（即任务特定和/或模态特定的插补方法）来解决此问题。这些方法虽然在其预期用途上有效，但在跨不同任务和传感器模态的适用性方面存在局限性。这引发了一个重要问题：能否构建一个任务无关的插补流程，使其能够迁移到新传感器而无需额外训练？在本工作中，我们形式化了零样本插补的概念，并提出了一种新颖的方法，使预训练模型能够适应处理数据间歇性问题。该框架名为NeuralPrefix，是一个生成式神经组件，在推理阶段置于任务模型之前，用于填补由数据间歇性造成的空缺。NeuralPrefix被构建为一个连续动力系统，其内部状态可通过求解常微分方程（ODE）在任何时间点进行估计。这种方法实现了一种更通用、适应性更强的插补方法，克服了任务特定和模态特定解决方案的局限性。我们在多个传感数据集上对NeuralPrefix进行了全面评估，证明了其在各领域的有效性。在缺失率高达50%的间歇性数据测试中，NeuralPrefix准确恢复了所有缺失样本，其结构相似性指数（SSIM）得分介于0.93至0.96之间。零样本评估表明，NeuralPrefix能够很好地泛化到未见过的数据集，即使测量数据来自不同的模态。