Supervised Manifold Learning via Random Forest Geometry-Preserving Proximities

Rhodes, Jake S.

Statistics > Machine Learning

arXiv:2307.01077 (stat)

[Submitted on 3 Jul 2023]

Title:Supervised Manifold Learning via Random Forest Geometry-Preserving Proximities

Authors:Jake S. Rhodes

View PDF

Abstract:Manifold learning approaches seek the intrinsic, low-dimensional data structure within a high-dimensional space. Mainstream manifold learning algorithms, such as Isomap, UMAP, $t$-SNE, Diffusion Map, and Laplacian Eigenmaps do not use data labels and are thus considered unsupervised. Existing supervised extensions of these methods are limited to classification problems and fall short of uncovering meaningful embeddings due to their construction using order non-preserving, class-conditional distances. In this paper, we show the weaknesses of class-conditional manifold learning quantitatively and visually and propose an alternate choice of kernel for supervised dimensionality reduction using a data-geometry-preserving variant of random forest proximities as an initialization for manifold learning methods. We show that local structure preservation using these proximities is near universal across manifold learning approaches and global structure is properly maintained using diffusion-based algorithms.

Comments:	10 pages
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:2307.01077 [stat.ML]
	(or arXiv:2307.01077v1 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2307.01077

Submission history

From: Jake Rhodes [view email]
[v1] Mon, 3 Jul 2023 14:55:11 UTC (1,231 KB)

Statistics > Machine Learning

Title:Supervised Manifold Learning via Random Forest Geometry-Preserving Proximities

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Supervised Manifold Learning via Random Forest Geometry-Preserving Proximities

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators