Uniform manifold approximation and projection (UMAP) is a nonlinear dimensionality reduction technique.

The UMAP algorithm is competitive with t-SNE for visualization quality, and arguably preserves more of the global structure with superior run time performance.

What's new in UMAP?

Graph Construction

The first phase of UMAP can be thought of as the construction of a weighted k-neighbour graph.

For each x_i we will define \rho_i and \sigma_i. Let

图片alt

图片alt


and set \sigma_i to be the value such that
图片alt

图片alt

Graph Layout

In practice UMAP uses a force directed graph layout algorithm in low dimensional space.

UMAP步骤

图片alt

图片alt


图片alt

图片alt

UMAP与t-SNE的区别

图片alt

图片alt

临近点的个数

图片alt

图片alt

min-dist 越大,即曲线对应的纵坐标越大,距离相近的点,投影到横坐标上的距离就远,相似点的分布越稀疏。

图片alt

图片alt

Hyper-parameters

参考