How to index on a loop for first occurrence?
Show older comments
Hi,
I have a set of data which I would like to loop through for each row.
I would like for every row, the first occurrence of a one to cause all other subsequent values in that row to be replaced by a zero.
Does anybody have any advice on how I could go about doing this?
Thanks
Answers (2)
Andrei Bobrov
on 1 Jun 2015
Edited: Andrei Bobrov
on 1 Jun 2015
a = [1 0 1 0 0 0 1
1 0 0 0 0 0 0
0 0 0 0 1 0 0
0 0 0 0 0 0 0
0 0 0 1 1 0 0
0 0 0 0 0 0 0
0 0 0 1 0 0 0
0 1 0 0 0 0 0
0 0 0 0 1 0 0
0 0 0 0 0 0 0
0 0 0 0 0 0 0
0 0 0 0 1 0 0
0 0 0 1 0 0 1
0 0 0 1 0 0 0
0 0 0 0 0 0 0
0 0 0 0 0 0 1
1 0 0 0 0 0 0
0 0 1 0 0 0 0
0 0 0 0 0 0 0
0 0 0 0 0 0 0
1 0 0 0 0 1 0
0 0 0 0 0 0 0
0 0 0 0 0 0 1
0 0 0 0 0 0 0
0 0 1 0 0 1 0
0 0 1 0 0 0 0
0 0 0 0 0 1 0
0 0 0 0 1 0 0]
out = cumsum(a==1,2)==1 & a==1;
10 Comments
Chameleon17
on 1 Jun 2015
Edited: Walter Roberson
on 1 Jun 2015
Chameleon17
on 1 Jun 2015
Adam
on 1 Jun 2015
It works fine for me.
You may want to wrap it in a cast to double depending what you want to do with the output as it returns a logical array, but for many purposes a logical array will serve you better anyway.
Andrei Bobrov
on 1 Jun 2015
small example
>> a = [0 1 0 0
1 1 0 0
0 1 0 1
1 0 0 1];
>> out = cumsum(a==1,2)==1 & a==1
out =
0 1 0 0
1 0 0 0
0 1 0 0
1 0 0 0
>>
Here all right?
Chameleon17
on 1 Jun 2015
should help understand the use of & as an element-wise logical operator. Note that this is different to && which expects scalar operands to AND together into a result.
gives similar information on the == operator's usage.
It did take me quite a while trying to find these in the help because an operator is too short to use as a search term, but they are all under the Language Fundamentals section of the main help.
Chameleon17
on 2 Jun 2015
Chameleon17
on 9 Sep 2015
Edited: Walter Roberson
on 9 Sep 2015
Walter Roberson
on 9 Sep 2015
Edited: Walter Roberson
on 9 Sep 2015
The ==1 examines each element independently. It does not process row by row.
The & will result in 1 if both values are true, and 0 otherwise. It is a binary operator. The cumsum(a==1,2)==1 part is going to create a logical matrix the same size as "a" and the a==1 part is going to create a logical matrix the same size as "a", and then the & is going to compare corresponding elements one at a time, returning true only if both are set. Another way of writing the code would be
L1 = cumsum(a==1,2)==1;
L2 = a==1;
out = false(size(L1));
for K = 1 : size(L1,2)
for J = 1 : size(L1,1)
out(J,K) = L1(J,K) & L2(J,K);
end
end
In the case where "a" contains only 0 and 1, then the code could be shorter:
out = cumsum(a,2)==1 & a;
or
out = (cumsum(a,2)==1) .* a;
If "a" can contain values other than 0 and 1, then none of the posted versions in this sub-thread are correct. I would need to think more about a good solution for that case.
Chameleon17
on 9 Sep 2015
Adam
on 1 Jun 2015
idx = find( x == 1, 1 );
x( (idx + 1):end ) = 0;
will do this for an array x. I will leave it up to you to make the trivial change to do it in a loop.
7 Comments
Chameleon17
on 1 Jun 2015
Chameleon17
on 1 Jun 2015
Yes, it definitely won't work without the loop, but your original question suggested you wanted to use a loop to do this so something like this (untested on command line so could be a syntax error. nRows and x are just for pseudo code for you to replace with real data)
for n = 1:nRows
idx = find( x(n,:) == 1, 1 );
x( n,(idx + 1):end ) = 0;
end
Andrei's solution appears to work fine for me for not using a for loop and does actually look similar to a solution I gave to a similar question a while ago so it is a solution I like.
In general though, if speed is not an issue then the question of whether to use a loop or some other alternative that doesn't use a loop is a matter of preference where readability is often the most important factor.
If, when you come back to your own code a week later, you can no longer understand what it is doing because it uses unintuitive methods then it probably isn't worth it.
Obviously if the for loop is slow then finding an alternative is important.
Chameleon17
on 1 Jun 2015
Edited: Walter Roberson
on 9 Sep 2015
Chameleon17
on 1 Jun 2015
The final 1 after the comma is the 2nd argument to the find function which tells it to just find the first occurrence rather than all occurrences (or some other number than 1 if explicitly defined).
If you don't add that argument you will get all indices on the row corresponding to ones.
For example:
a = [0 0 1 1 0 0 1]
a =
0 0 1 1 0 0 1
>> find( a == 1 )
ans =
3 4 7
>> find( a == 1, 1 )
ans =
3
When I have something like this I don't fully understand I find it invaluable to just create a quick example on the command line so I can see for myself what is going on. It is one of the big advantages of Matalb over, for example, C++ where you need to do a lot more work to just test to see what simple syntax alternatives give you.
Chameleon17
on 2 Jun 2015
Categories
Find more on Annotations in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!