Problem 55225. Simpson's Paradox - Calculate correlation coefficients for groups of data
Simpson's Paradox is a statistical phenomenon where groups of data can have a characteristic while the whole data set together has the opposite characteristic. In the example below, both groups have a negative correlation between x and y, but collectively there is a positive correlation.
Write a function that takes three vectors as input: x, y, and g. The vector g will contain only the values 1 and 2. The function should return three outputs. These outputs are the Pearson correlation coefficients for three different groupings of the data, which are: (1) for all x and y, (2) x and y corresponding to elements where g has the value 1, (3) x and y for which g is 2.
[c,c1,c2] = groupcorr(x,y,g)
c =
0.8800
c1 =
-0.6800
c2 =
-0.4396
Solution Stats
Problem Comments
Solution Comments
Show commentsProblem Recent Solvers296
Suggested Problems
-
Replace NaNs with the number that appears to its left in the row.
2975 Solvers
-
Project Euler: Problem 6, Natural numbers, squares and sums.
2139 Solvers
-
340 Solvers
-
Replace multiples of 5 with NaN
440 Solvers
-
1203 Solvers
More from this Author22
Problem Tags
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!