How do I compare two data sets of unequal length?
Show older comments
I have two sets of data, taken on different days, from the same sensor. The temperature was swept from 26C to -30C to 80C and back to 26C. The sensor was read periodically during the temperature sweep. The data sets consist of a temperature column, and another column representing the sensor readings. I would like to take a difference between the two sets of sensor readings, generating another data set having a column of temperatures, and a column of differences between the two original sets of sensor readings. If each data set had exactly the same vector of temperatures, I could just subtract one vector of sensor readings from the other. However, the temperature vectors do not contain exactly the same temperatures, and they don't even have the same number of elements. I would like to interpolate one set of temperatures and sensor readings to match the temperatures of the other, so I have two data sets of the same size, at the same temperatures. One complicating factor is that, due to sensor hysteresis with respect to temperature, the sensor readings are different on the downward temperature ramps from those on the upward temperature ramp. Therefore I can't sort the data on temperature, because that would mix the upward and downward ramps. If I could sort the data on temperature, I could use timeseries objects, with temperature in place of time. However, that won't work in this case.
8 Comments
Adam Danz
on 8 Aug 2018
A sample of your data would be helpful to visualize the problem.
DH
on 8 Aug 2018
Adam Danz
on 8 Aug 2018
What is 'Sensory Data'?
Also, how are you going to pair the two temperature vectors? Are you pairing them by time-of-day? If so, where's the time data?
DH
on 8 Aug 2018
I think I understand your problem now (sensorY was a typo). I'll think about it. In the meantime, here are @DH's data in case anyone else is thinking about this. The red and blue are the two data sets and you can see the lag in temperature and the sensor between the two data sets.

Adam Danz
on 8 Aug 2018
...this is a tough one. You can't use interp1() because the first input is required to be monotonic without duplicates which your data isn't. Even if you sort by temperature and store the sorted index values, you still have duplicates. What is the final goal here? I know you want to measure the difference between the sensors at the same temperature. But you have duplicate measures within (nearly) the same temperature. For example, your temperature data passes through 0 twice. Can you use the average of those 2 sensor measures for the temp=0 data point?
DH
on 9 Aug 2018
DH
on 9 Aug 2018
Accepted Answer
More Answers (2)
Yuvaraj Venkataswamy
on 8 Aug 2018
0 votes
2 Comments
Adam Danz
on 8 Aug 2018
Yeah, this method won't work.
Yuvaraj Venkataswamy
on 8 Aug 2018
if true
id = ismember(dataset1', dataset2', 'rows');
X = 1:size(dataset1, 2);
Y = X(id);
end
1 Comment
Adam Danz
on 8 Aug 2018
This method won't work on your current data, either.
Categories
Find more on Resampling Techniques in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!