Reading unformatted text and converting to formatted

Hi,
I have some files that contain raw text that is unformatted. These files have lots of lines and all of these lines are along the lines of
RFCH[0][0]:1, RFCH[0][1]:2, RFCH[0][2]:3,
etc. What I need to do is assign the value to the right of the colon to a corresponding matrix location, so the first 1 would go to [0,0] ([1,1] since Matlab probably wouldn't recognize [0,0]).
I know I can open the file using
fid=fopen(projectdir, 'r')
T=textscan(fid, '%s')
fclose(fid)
I am not really sure how go about this. Any help would be great.

4 Comments

@Ibro Tutic: please do NOT remove your question like this. We are volunteers who give our time to help you and everyone else who might be reading this forum. When you delete your question you make our answers meaningless and pointless, and you make our information useless for everyone else as well. You are telling us that our volunteered time is worthless, that our efforts are not important, and that other people shall not learn from our efforts.
You have just made the answer below worthless, even though several people have spent their own time volunteering their time to help both you and other browsers of this forum.
Notice I linked the other question. If somebody was interested in the solution and what it is referencing, they can go to that question to get ALL the relevant information. This question was worded extremely badly on my part and attempting to fix it by commenting and editing the original question would confuse everybody looking at it.
(Answers Dev) Restored edit

Sign in to comment.

 Accepted Answer

You can read the whole file using fileread, identify the digits using regexp, convert the values to linear indices using sub2ind, and then simple linear indexing is all that is required to allocate the values from the file to the final matrix:
str = fileread('temp.txt');
% identify digits:
rgx = '[A-Z]+\[(\d+)\]\[(\d+)\]: *(\d+)';
C = regexp(str,rgx,'tokens');
% convert digits to numeric:
M = cellfun(@str2double,vertcat(C{:}));
M(:,1:2) = 1+M(:,1:2);
% convert to linear indices:
out = nan(max(M(:,1)),max(M(:,2)));
idx = sub2ind(size(out),M(:,1),M(:,2));
% allocate values:
out(idx) = M(:,3)
Because you did not provide any sample datafile I had to create my own, which contains only this string:
RFCH[0][0]:1, RFCH[0][1]:2, RFCH[0][2]:3, RFCH[2][2]:4
The code's output using my sample data file is this:
out =
1 2 3
NaN NaN NaN
NaN NaN 4

4 Comments

Running your script gives:
Error using cellfun
Input #2 expected to be a cell array, was double instead.
Error in Untitled3 (line 12)
M = cellfun(@str2double,vertcat(C{:}));
The some sample data is:
RainflowcycleCounterHistogram[0][0]:0 RainflowcycleCounterHistogram[0][1]:0
And so on. I apologize for not giving this out initially as I assumed that people wouldn't just do it for me and I wanted to learn it.
Your original question did not have the space character after the colons, and my computer is not very good at reading your mind. I have now altered my answer to include your new information. If you have any other useful information about the strings, now is a good time to tell us. Even better is to simply upload the file itself so we can try our code on it. You can upload a file by clicking the paperclip button, and then pressing both the Choose file and Attach file buttons.
"The letters don't matter at all..." actually they do. As you can see dpb's answer uses exactly the literal string 'RFCH'. Every piece of information is important.
The file can't be uploaded due to NDA's and what not. And I figured that I could figure out what portion of the code looked at the letters and edit it in myself, but the method you use was something I did not think of and am not familiar with, so it didn't work out the way I intended.
I am getting the same error as above, it seems the matrix C is an empty 0x0 matrix, which I assume is causing the error. The space after the colon was an error on my part, there is not supposed to be a space. I modified your script to take this into account and I am still receiving the same error.

Sign in to comment.

More Answers (1)

RFCH[0][0]:1, RFCH[0][1]:2, RFCH[0][2]:3,
fmt='RFCH[%d][%d]:%d';
fid=fopen('yorfile');
c=cell2mat(textscan(fid,fmt,'delimiter',',','collectoutput',1));
fid=fclose(fid);
X(c(:,1)+1,c(:,2)+1)=c(:,3);

1 Comment

Subscripted assignment dimension mismatch.
Error in temp (line 5)
X(c(:,1)+1,c(:,2)+1) = c(:,3);

Sign in to comment.

Categories

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!