Testing/training set

how do i divide my data into testing and training set??whats the easiest way to check whether a sample belongs to a particular class or set??? i have my data saved in excel....

 Accepted Answer

Junaid
Junaid on 23 Apr 2012

1 vote

it totally depends on type of dataset. One of the famous technique is Cross validation. Look at this picture for quick idea.
or simply divide 60% dataset for training and 40% for testing. or make it iterative as it is done in cross validation.

More Answers (1)

Walter Roberson
Walter Roberson on 23 Apr 2012

0 votes

You should divide randomly.
If there was some feature that could be checked that divided the test and training classes, then you would not have done a good job of selecting the training class. The training class should be representative of all of the data.

1 Comment

Hey... I have 100 subject database out of which 50 are male and 50 are female. Each subject has 6 images. I just took 1 image per subject i.e 100 images. I just extracted 100 feature vectors and able to cross validate the data. But, I want to utilise all the images of a subject.
So, if I take 6 images of a subject. I would get 600 images and I can cross validate. But, it would not give a correct result as it would divide images from the same subject into testing and training at a time.
So, what I was thinking is that 6 feature vectors which i obtain from a subject should go to either testing or training data. But, I am unable to implement it. How to fix it ? Need your help.

Sign in to comment.

Categories

Find more on Deep Learning Toolbox in Help Center and File Exchange

Asked:

on 23 Apr 2012

Commented:

on 22 Aug 2016

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!