How to extract the variable IDs after dimensionality reduction via pca?

1 view (last 30 days)
I have a dataset X(voxel*protein), it's size is 3694200*10. I used pca to reduce dimensionality and got a matrix of size 3694200*4. The question is that I need to know which 4 proteins are remained. How can I get the original column IDs(0-9) of the 4 proteins?
My code: [COEEF,SCORE,LATENT] = pca(X); cumVar = cumsum(LATENT)./sum(LATENT); reducedData = SCORE(:,1:4);

Accepted Answer

the cyclist
the cyclist on 8 Jul 2014
PCA doesn't select a subset of variable. (Specifically, in your case, PCA does not select 4 out of your 10 variables).
Instead, PCA is identifying linear combinations of your original variables that explain the overall variation. If a small number of these linear combinations capture most of the variation, then it makes sense to limit to those combinations. The first output variable tells you the coefficients of those linear combinations.

More Answers (0)

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!