This paper presents the geometric aspect of the autoencoder framework, which, despite its importance, has been relatively less recognized. Given a set of high-dimensional data points that approximately lie on some lower-dimensional manifold, an autoencoder learns the \textit{manifold} and its \textit{coordinate chart}, simultaneously. This geometric perspective naturally raises inquiries like "Does a finite set of data points correspond to a single manifold?" or "Is there only one coordinate chart that can represent the manifold?". The responses to these questions are negative, implying that there are multiple solution autoencoders given a dataset. Consequently, they sometimes produce incorrect manifolds with severely distorted latent space representations. In this paper, we introduce recent geometric approaches that address these issues.
翻译:本文阐述了自动编码器框架的几何视角,尽管该视角至关重要,却相对未获充分重视。给定一组近似位于某个低维流形上的高维数据点,自动编码器能同时学习该流形及其坐标图。这一几何视角自然地引发了诸如“有限的数据点集是否对应单个流形?”或“是否存在唯一能表示该流形的坐标图?”等问题。对这些问题的答案是否定的,这意味着给定数据集存在多个解自动编码器。因此,它们有时会产生具有严重扭曲潜在空间表示的错误流形。本文介绍了近期解决这些问题的几何方法。