creating a Dummy variable from a string vector
5 views (last 30 days)
Show older comments
Hi, i want to create a dummy variable of a string columnn cointaining information over investors. I have identified the names of the investors that i demme to have experience and would like to create a dummy = 1 if the cell contain the name of said investor, and 0 if not. The dataset is quite large so cant do this manually, not really sure how to proceed.
2 Comments
Walter Roberson
on 13 May 2022
When you say "contains" do you mean that you want to compare the entries for exact equality? Or should "sam" be considered to be contained in "assam" or "samuel"? Should the comparison be case sensitive? Should "Billie-Joe" be considered the same as "Billie Joe"?
Answers (2)
Steven Lord
on 13 May 2022
Use the string manipulation and/or set membership functions. Let's start with a random set of names:
rng default
D = ["Doc"; "Grumpy"; "Happy"; "Sleepy"; "Bashful"; "Sneezy"; "Dopey"];
randomDwarfs = D(randi(numel(D), [20 1]))
We can use matches to look for an exact match; startsWith, endsWith, or contains to look for text inside a name; or ismember to ask for members of a group of names.
onlyGrumpy = matches(randomDwarfs, "Grumpy");
startsWithD = startsWith(randomDwarfs, "D"); % Doc, Dopey
bashfulOrSleepy = ismember(randomDwarfs, ["Bashful"; "Sleepy"]);
results = table(randomDwarfs, onlyGrumpy, startsWithD, bashfulOrSleepy)
I can extract the names from randomDwarfs using those logical vectors.
BorS = randomDwarfs(bashfulOrSleepy)
0 Comments
See Also
Categories
Find more on Logical in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!