Synthetic time series generated by diffusion models enable sharing privacy-sensitive datasets, such as patients' functional MRI records. Key criteria for synthetic data include high data utility and traceability to verify the data source. Recent watermarking methods embed in homogeneous latent spaces, but state-of-the-art time series generators operate in data space, making latent-based watermarking incompatible. This creates the challenge of watermarking directly in data space while handling feature heterogeneity and temporal dependencies. We propose TimeWak, the first watermarking algorithm for multivariate time series diffusion models. To handle temporal dependence and spatial heterogeneity, TimeWak embeds a temporal chained-hashing watermark directly within the temporal-feature data space. The other unique feature is the $\epsilon$-exact inversion, which addresses the non-uniform reconstruction error distribution across features from inverting the diffusion process to detect watermarks. We derive the error bound of inverting multivariate time series while preserving robust watermark detectability. We extensively evaluate TimeWak on its impact on synthetic data quality, watermark detectability, and robustness under various post-editing attacks, against five datasets and baselines of different temporal lengths. Our results show that TimeWak achieves improvements of 61.96% in context-FID score, and 8.44% in correlational scores against the strongest state-of-the-art baseline, while remaining consistently detectable.
翻译:扩散模型生成的合成时间序列数据使得共享隐私敏感数据集成为可能,例如患者的功能性磁共振成像记录。合成数据的关键标准包括高数据效用性和可追溯性,以验证数据来源。现有的水印方法通常在均匀的潜在空间中嵌入水印,但最先进的时间序列生成器在数据空间中运行,使得基于潜在空间的水印方法不兼容。这带来了直接在数据空间中嵌入水印的挑战,同时需要处理特征异质性和时序依赖性。我们提出了TimeWak,这是首个面向多元时间序列扩散模型的水印算法。为处理时序依赖性和空间异质性,TimeWak直接在时序-特征数据空间中嵌入时序链式哈希水印。其另一独特特性是$\epsilon$-精确反演,该技术通过反演扩散过程来检测水印,解决了特征间重构误差分布不均匀的问题。我们推导了在保持鲁棒水印可检测性的前提下,反演多元时间序列的误差边界。我们在五个不同时序长度的数据集上,针对合成数据质量、水印可检测性及多种后编辑攻击下的鲁棒性,对TimeWak进行了全面评估,并与现有基线方法进行比较。实验结果表明,相较于当前最强的基线方法,TimeWak在上下文FID分数上提升了61.96%,在相关性分数上提升了8.44%,同时始终保持稳定的可检测性。