Inf resulted when calculating mean
35 views (last 30 days)
Show older comments
King To Leung
on 30 Jul 2022
Commented: Walter Roberson
on 31 Jul 2022
I am calculating the mean pe each year from 1993-2021. I have rounded my data set, drdata, to the 2 decimal place. However, the stockpe resulted in Inf for every year. I don't understand why this happens. I have already rounded it.
%drdata column 2 is date
%drdata column 4 is the pe ratio data
[year,~,~] = datevec(728110)
%storing the years in y (months and dates are not necessary)
%dates are stored in the 2nd column
[y,~,~] = datevec(drdata(:,2));
%calculating mean stock return for each year, return is in the 7th column
unique(y)
mask = y >= 1993 & y <= 2021;
nnz(mask)
nnz(isnan(drdata(mask,4)))
for k=1993:2021
stockpe(k-1992) = round(mean(drdata(y==k,4)),2);
%1st element will correspond to 1993, 2nd - 1994 and so on
end
4 Comments
Walter Roberson
on 31 Jul 2022
temp = drdata(y==1993,4);
size(temp)
min(temp), max(temp)
Please show the output of these debugging commands
Accepted Answer
Walter Roberson
on 31 Jul 2022
You have inf in your data. mean() of data is sum() of the data divided by the number of elements of the data. sum() that includes inf is going to be inf (unless the data includes nan or -inf) and inf divided by a finite number is inf.
If the inf represent missing data, delete those entries before processing the mean()
Also consider using grpstats() or splitapply()
6 Comments
Walter Roberson
on 31 Jul 2022
I would suggest
mask = ~isfinite(drdata(:,4));
drdata(mask,4) = nan;
You do not need to loop.
Or you could
drdata = standardizemissing(drdata, inf);
More Answers (0)
See Also
Categories
Find more on Structures in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!