Statistically comparison between n sets of data which have non Gaussian distribution

6 views (last 30 days)

Shaya s on 13 Oct 2017

0
Link

Direct link to this question

https://se.mathworks.com/matlabcentral/answers/361207-statistically-comparison-between-n-sets-of-data-which-have-non-gaussian-distribution

Commented: Star Strider on 20 Oct 2017

Hello, I have three groups of data resulted from a feature extraction. I want to see if the feature I selected has the comparison significance or not. Actually, I want to see if I can use this feature, or this doesn't have any discriminating value, so i n this case to ignore it. For this, I should do the statistics to understand weather there is any relation between groups for this feature or not. The resulted histogram of distributions of each class are as follow:

According to my search these distributions aren't Gaussian and so I use Mann-Whitney test to see if there is a correlation between the groups. There are two values h and p:

h shows to reject the null hypotheses or not (h=1 rejection, h=0 a failure for rejection)

P-value also shows the evidence against the null hypotheses.

My question is first, did I go correct for using this test? Also how should I interpret these values?

The result for instance for the first two groups were as follow:

        p =
        1.5170e-09
        h =
       logical
        1
      stats = 
      struct with fields:
         zval: -6.0425
       ranksum: 1648

Thank you for reading my post.

0 Comments
Show -2 older commentsHide -2 older comments

Accepted Answer

Star Strider on 13 Oct 2017

2
Link

Direct link to this answer

https://se.mathworks.com/matlabcentral/answers/361207-statistically-comparison-between-n-sets-of-data-which-have-non-gaussian-distribution#answer_285695

The Mann-Whitney (the ranksum function) is for two-sample comparisons only. If you want to do multiple comparisons, the kruskalwallis (link) or friedman (link) (and multcompare (link)) functions might be more appropriate.

You need to decide, based on your knowledge of your data.

As with everything in statistics, this is not trivial. For a reference, I suggest: Hollander, et al., Nonparametric Statistical Methods, 3e, Section 6.5 (ISBN 978-0-470-38737-5).

8 Comments
Show 6 older commentsHide 6 older comments

Shaya s on 20 Oct 2017

Yes I understand, thank you.

Star Strider on 20 Oct 2017

My pleasure.

Products

Statistics and Machine Learning Toolbox

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Statistically comparison between n sets of data which have non Gaussian distribution

0 Comments
Show -2 older commentsHide -2 older comments

Accepted Answer

8 Comments
Show 6 older commentsHide 6 older comments

More Answers (0)

See Also

Categories

Tags

Products

Community Treasure Hunt

Statistically comparison between n sets of data which have non Gaussian distribution

0 Comments Show -2 older commentsHide -2 older comments

Accepted Answer

8 Comments Show 6 older commentsHide 6 older comments

More Answers (0)

See Also

Categories

Tags

Products

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

8 Comments
Show 6 older commentsHide 6 older comments