Find NaNs at the end of an Excel file

Question

0 votes

Hi,

New user, first question, so bear with me but I haven't been able to find anything on it. I need to find file names containing NaN values that are on the end of a comma delimited file. Xlsread seems to automatically truncate. The files vary in length. What is the best way to do this?

baseInputFolder = 'C:\Users\me\Desktop\Test\';
filename = strcat(baseInputFolder,'find_NaNs.xlsx');
inputFiles = dir(fullfile(baseInputFolder,'**\*.csv'));
nextRow = 1;
for k = 1:length(inputFiles)
    baseFileName = inputFiles(k).name;
    fullFileName = fullfile(inputFiles(k).folder, baseFileName);
    fprintf('Reading file %d of %d named %s\n',k,length(inputFiles),baseFileName);
    if any(isnan(xlsread(fullFileName,1)), 'all')
        range1=sprintf('%s%d','A',nextRow);
        writematrix(fullFileName,filename,'Sheet','Sheet1','Range',range1);
        nextRow = nextRow+1;
    end
end

0 Comments
Show -2 older comments Hide -2 older comments

Sign in to comment.

Sign in to answer this question.

Follow Question

Answer 1

Star Strider on 5 Mar 2021

0 votes

See if the contains function (introduced in R2016b) will do what you want.

7 Comments
Show 5 older comments Hide 5 older comments

Star Strider on 5 Mar 2021

Open in MATLAB Online

I’m not certain what you’re doing, or if this is any significant improvement:

nextRow = 1;
for k = 1:length(inputFiles)
    baseFileName = inputFiles(k).name;
    fullFileName = fullfile(inputFiles(k).folder, baseFileName);
    fprintf('Reading file %d of %d named %s\n',k,length(inputFiles),baseFileName);
    D1 = readmatrix(fullFileName);
    [r,c] = find(isnan(D1));
    nrnan = nnz(isnan(D1));                                                         % Number Of ‘NaN’ Values In File
    nanconsec = nnz(diff(r)>1)==0;                                                  % If Rows Containing ‘NaN’ Values Are Consecutive = ‘true’
    if nrnan & nanconsec
        nanfiles{nextRow} = fullFileName;                                           % If Both Conditions Are ‘true’, Store ‘filename’
        nextRow = nextRow+1;
    end
end

That assumes that you are only checking for NaN values in rows at the end of each file, not intermediate NaN values (if any exist anywhere else). This will not store file names if the rows with NaN values are not consecutive. If the NaN values are only at the end of the file, it would store them, if they are also in other places in the file, it would not store that specific name. (It might also be necessary to test to be certain the NaN values fill each row completely. I have no idea if this is necessary.)

This stores them in the looop and then would write the file name cell array to a text file after the loop completes.

Star Strider on 5 Mar 2021

As always, my pleasure!

Walter Roberson on 5 Mar 2021

Yes, when you use use xlsread(), the first output, num, automatically has leading and trailing rows and columns of nan removed. This is because when you are talking about numeric values, text shows up as NaN (not a number, after all) and xlsread() wants to trim out header lines and trailer lines and text columns.

Also it is because if you ask excel to read a range of values and the range exceeds the size actually in the file, then excel returns nan. So xlread() cannot tell the difference between nans supplied because the file "ended" and nans that were part of the data. Indeed, unless there is a template in the file or formatting has been specifically applied to a particular range, Excel itself cannot really tell where the end of the data is. It is all ambiguous in spreadsheets: if you wrote something to row 10000 and then deleted the content, then is the spreadsheet now "really" 10000 rows, or is it "really" the size implied by the last non-empty data?

Sign in to comment.

Find NaNs at the end of an Excel file

0 Comments
Show -2 older comments Hide -2 older comments

Accepted Answer

7 Comments
Show 5 older comments Hide 5 older comments

More Answers (0)

Categories

Tags

Community Treasure Hunt

Find NaNs at the end of an Excel file

0 Comments Show -2 older comments Hide -2 older comments

Accepted Answer

7 Comments Show 5 older comments Hide 5 older comments

More Answers (0)

Categories

Tags

See Also

Community Treasure Hunt

0 Comments
Show -2 older comments Hide -2 older comments

7 Comments
Show 5 older comments Hide 5 older comments