Eliminating duplicate columns in a table when appending

Hi,
I am importing data using readtable function. but when i append data using:
mytable=[mytable mytable2];
I receive an error: Duplicate variable name: 'ID' because every file contains the ID of the firm. I need the column ID only once. so how can i skip the ID column in mytable2 when i want to append?

 Accepted Answer

Use the join command. Here's a simple example.
% Set up some sample data
id = [1 2 3]';
firstName = {'Walter','Image','Star'}';
lastName = {'Roberson','Analyst','Strider'}';
tableFirst = table(id,firstName);
tableLast = table(id,lastName);
% Join the tables
tableFirstAndLast = join(tableFirst,tableLast)
FYI, there are many input options that govern how the variables in the two tables are retained when there are duplicates, so I suggest carefully reading the documentation (linked above) if this example doesn't do exactly what you want in your case.
You might also need a different function from that family, e.g. innerjoin.

6 Comments

Moved Danielle's comment here, rather than being an "answer":
for some tables i am receiving an error Error using tabular/join (line 163) The key variables for A and B cannot contain any missing values.
Error in trial (line 7) a2=join(t7,t8);
Can you upload a MAT file with the two input tables? (Or a simplified example that illustrates the problem, and the output you want.)
thanks. Attached the file. I have 45 tables and i want to merge them in one. the number of rows is the same and in the same order. So how can I append vertically all these tables? I was merging two by two using join.it worked for some and didn't work for others. I removed most of the tables in order to upload the file since there is a 5m cap
The proximate cause is that some of the tables contain a row with NaN as the value of IDRSSD. Because one NaN is not equal to another NaN, MATLAB cannot join those rows.
One simple solution might be to replace those values of NaN with some other unique identifier. (It looks like setting IDRSSD=0 would work.)
You could also use the innerjoin() function, as I mentioned earlier, which will only use matching rows.
a2=innerjoin(t7,t8);
Or you could manually avoid that first row for those tables:
a2 = join(t7(2:end,:),t8(2:end,:))
but that seems pretty inelegant.
The approach to take will depend on exactly what output you want, and what is more readable and efficient.
Can I use this function if I need to erge two tables having more that one columns in common?

Sign in to comment.

More Answers (0)

Categories

Tags

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!