Flow matching has emerged as a powerful framework for generative modeling, with recent empirical successes highlighting the effectiveness of signal-space prediction ($x$-prediction). In this work, we investigate the transfer of this paradigm to binary manifolds, a fundamental setting for generative modeling of discrete data. While $x$-prediction remains effective, we identify a latent structural mismatch that arises when it is coupled with velocity-based objectives ($v$-loss), leading to a time-dependent singular weighting that amplifies gradient sensitivity to approximation errors. Motivated by this observation, we formalize prediction-loss alignment as a necessary condition for flow matching training. We prove that re-aligning the objective to the signal space ($x$-loss) eliminates the singular weighting, yielding uniformly bounded gradients and enabling robust training under uniform timestep sampling without reliance on heuristic schedules. Finally, with alignment secured, we examine design choices specific to binary data, revealing a topology-dependent distinction between probabilistic objectives (e.g., cross-entropy) and geometric losses (e.g., mean squared error). Together, these results provide theoretical foundations and practical guidelines for robust flow matching on binary -- and related discrete -- domains, positioning signal-space alignment as a key principle for robust diffusion learning.
翻译:流匹配已成为生成建模的强大框架,近期实证成功凸显了信号空间预测($x$-预测)的有效性。本文研究将该范式迁移至二元流形——离散数据生成建模的基础场景。尽管$x$-预测保持有效,我们识别出当其与基于速度的目标函数($v$-损失)结合时产生的潜在结构失配,导致时变奇异加权现象,从而放大近似误差对梯度的敏感性。基于此观察,我们形式化地将预测-损失对齐定义为流匹配训练的必要条件。证明将目标函数重新对齐至信号空间($x$-损失)可消除奇异加权,产生一致有界梯度,并实现在均匀时间步采样下无需依赖启发式调度策略的鲁棒训练。最后,在对齐得到保障的基础上,我们考察二元数据特有的设计选择,揭示概率型目标函数(如交叉熵)与几何型损失函数(如均方误差)之间依赖于拓扑结构的区别。这些结果共同为二元及相关的离散域上的鲁棒流匹配提供了理论基础与实践指南,将信号空间对齐定位为鲁棒扩散学习的关键原则。