Uniform Manifold Approximation and Projection (UMAP)

Version 4.6 (4.9 MB) by Stephen Meehan

An algorithm for manifold learning and dimension reduction.

9.9K Downloads

Updated 1 Sep 2025

Given a set of high-dimensional data, run_umap.m produces a lower-dimensional representation of the data for purposes of data visualization and exploration. See the file run_umap.m for documentation and many examples of how to use this code.

The UMAP algorithm is the invention of Leland McInnes, John Healy, and James Melville. See their original paper for a long-form description. Also see the documentation for the original Python implementation.

Connor Meehan developed code in C++, Python, Java, and MATLAB to build this implementation of UMAP for the MATLAB community, with assistance from Jonathan Ebrahimian, Stephen Meehan, and Wayne Moore. Our visualization features require a MATLAB version no earlier than R2019a and no later than R2024b because R2025 removes the ability of Java to access MATLAB figures/windows. We appreciate all and any help in finding bugs.

Provided by the Herzenberg Lab at Stanford University.

This submission interoperates with FlowJo v10.x, a widely used analysis app for flow cytometry distributed by BD Life Sciences.

Our priority has been determining the suitability of our concepts for research publications in flow cytometry for the use of UMAP supervised templates and exhaustive projection pursuit.

Cite As

Connor Meehan, Jonathan Ebrahimian, Wayne Moore, and Stephen Meehan (2025). Uniform Manifold Approximation and Projection (UMAP) (https://www.mathworks.com/matlabcentral/fileexchange/71902), MATLAB Central File Exchange.

MATLAB Release Compatibility

Created with R2024b

Compatible with R2019a to R2024b

Platform Compatibility

Windows macOS Linux

Tags Add Tags

Acknowledgements

Inspired: CytoMAP

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

epp

fcs

mlp

umap

util

Version	Published	Release Notes
4.6	1 Sep 2025	- Minor performance accelerations. - Code cleanup. -run_umap insists on a MATLAB version before 2025 if the verbose argument is ‘graphic’. R2025 removes Java access to MATLAB figures/windows.	Download
4.5	15 Jun 2025	Minor bug fixes, feature improvements, and code accelerations based on two years of writing bioinformatics papers.	Download
4.4	25 May 2023	Fixes and improvements based on feedback from CYTO 2023 conference. Testing with R2023a release	Download
4.2.1	20 Oct 2022	Corrected documentation in run_umap for examples 4 & 5 which use FlowJo.	Download
4.2	20 Oct 2022	1. Integration with FlowJO - Import data and supervision labels from workspaces - Export results to workspaces 2. Multidimensional scaling views supervised template reduction. 3. HeatMap improvements. 4. Many bug fixes and other improvements	Download
4.1	9 Mar 2022	1) Improved documentation and examples for using MLP train/predict independently of UMAP 2) MlpPython.Predict function -Is faster on r2019b or later -Allows test set with all OR MORE of the training set columns in any order	Download
4.0	27 Feb 2022	-mlp_train combines neural network and supervised template classification -job_folder allows batching runs of run_umap from external software without MATLAB reloads Example 34 in run_umap.m illustrates these new arguments and others	Download
3.01	23 Nov 2021	1. Fast approximation now accelerates both matching and reduction processing. 2. Prediction table now: a) Displays dimensions for true+, false+ and false- stacked together. b) Highlights selections as yellow on UMAP and EPP plots.	Download
3.0	11 Nov 2021	V3.0 improves speed, classification assessment and ROI functionality. For details see the last section of the FileExchange description and/or search the run_umap.m file for fast_approximation, run_epp and match_predictions.	Download
2.2	4 Apr 2021	-New table showing density distribution & KLD of unreduced data associated with groupings of the reduced data -New run_umap arguments for supervised templates and accessing prior UMAP features -New examples with larger data sets	Download
2.1.3	16 Feb 2021	Fix edge case where running template fails IF the metric is a user defined function.	Download
2.1.2	11 Feb 2021	-Added parameters to run_umap "wrapper" that reach more capabilities within the UMAP.m core; search "v2.1.2" in run_umap.m to see these additions. -Fixed bugs for edge cases involving minimal data and user-defined metrics.	Download
2.1.01	28 Jan 2021	-Maximized UMAP parallelism speed by using all MATLAB’s assigned logical CPU cores -Added NN-descent support for 'SEuclidean' -New slider for shading UMAP supervisor colors -Stochastic gradient descent halts gracefully if user closes progress window	Download
2.1.0	23 Jan 2021	-Stochastic gradient descent (SGD) is now parallelized by default with our MEX method. See 'sgd_tasks' in the documentation. -'Randomize' is now true by default in order to use parallelism to accelerate both NN-descent and SGD -Other minor bug fixes	Download
2.0.0	16 Dec 2020	-Improved documentation for some arguments and removed all popups when "verbose" is false -run_umap now accepts all knnsearch arguments (except for 'SortIndices') -Nearest neighbour computations are significantly accelerated for certain data inputs	Download
1.5.2	4 Aug 2020	-Removed .exe and .MEX files to comply with File Exchange requirements. Users are now encouraged to download these from our Google Drive if they wish to significantly speed up run_umap. -Added examples 17 to 19 in run_umap header comment.	Download
1.3.4	9 Jan 2020	-Fixed a bug in SGD in Java where data was unintentionally stored as two distinct objects -Added QF trees and dissimilarity plots -Added an experimental joined_transform method that outperforms transform() when training data is missing populations	Download
1.3.3	15 Nov 2019	-Fixed some minor cosmetic issues such as suboptimal plot scaling	Download
1.3.2	7 Nov 2019	-If applying a UMAP template on data that appears to have new populations, a warning occurs and the option is given to perform a re-supervised reduction -Fixed an indexing error occurring in smooth_knn_dist.m if data had too many identical points	Download
1.3.1	9 Oct 2019	-Fixed a GUI bug that would occur for users with MATLAB R2018b or earlier	Download
1.3.0	9 Oct 2019	-Data can now be reduced to any number of dimensions by changing the 'n_components' parameter; if reducing to more than 2 dimensions, a 3D plot is shown -DBSCAN can be used to cluster UMAP output -The 'n_epochs' parameter can now be manually changed	Download
1.2.1	3 Sep 2019	-Added precomputed parameter values for users without the Curve Fitting Toolbox -Fixed an issue when using transform() on new data sets of same size of previous embedding and improved adjacency matrix for transform() -Improved progress bars	Download
1.2.0	2 Aug 2019	-Added 2 examples (run_umap.m) showing how to perform supervised dimension reduction with UMAP -Improved labelling of plots; for supervised UMAP, the plot includes a legend with labels from the categorical data -Explained proper MATLAB path settings	Download
1.1.0	21 Jun 2019		Download