We introduce a theoretical framework that connects multi-chart autoencoders in manifold learning with the classical theory of vector bundles and characteristic classes. Rather than viewing autoencoders as producing a single global Euclidean embedding, we treat a collection of locally trained encoder-decoder pairs as a learned atlas on a manifold. We show that any reconstruction-consistent autoencoder atlas canonically defines transition maps satisfying the cocycle condition, and that linearising these transition maps yields a vector bundle coinciding with the tangent bundle when the latent dimension matches the intrinsic dimension of the manifold. This construction provides direct access to differential-topological invariants of the data. In particular, we show that the first Stiefel-Whitney class can be computed from the signs of the Jacobians of learned transition maps, yielding an algorithmic criterion for detecting orientability. We also show that non-trivial characteristic classes provide obstructions to single-chart representations, and that the minimum number of autoencoder charts is determined by the good cover structure of the manifold. Finally, we apply our methodology to low-dimensional orientable and non-orientable manifolds, as well as to a non-orientable high-dimensional image dataset.
翻译:我们提出一个理论框架,将流形学习中的多图自编码器与向量丛及示性类的经典理论联系起来。不同于将自编码器视为生成单一全局欧几里得嵌入,我们将一组局部训练的编码器-解码器对视为流形上学习到的图谱。我们证明,任何满足重构一致性的自编码器图谱都能规范地定义满足上循环条件的转移映射,且当潜在维度与流形本征维度一致时,这些转移映射的线性化会得到与切丛一致的向量丛。该构造可直接获取数据的微分拓扑不变量。特别地,我们证明第一斯蒂弗尔-惠特尼类可通过学习到的转移映射雅可比矩阵的符号计算得出,从而为检测可定向性提供算法判据。我们还证明非平凡示性类会构成单图表示的障碍,且自编码器图谱的最小数量由流形的良好覆盖结构决定。最后,我们将该方法应用于低维可定向与不可定向流形,以及一个不可定向的高维图像数据集。