Selection of Neural Network Training Data

Question

Kamuran Turksoy on 4 May 2017

0
Link

Direct link to this question

https://se.mathworks.com/matlabcentral/answers/338795-selection-of-neural-network-training-data

Answered: Greg Heath on 5 May 2017

One can divide his/her data into training, validation and testing and use them to train a neural network model (regression in my case). My question is, what if there are some data points in the training set that impair the model performance? Are there any good ways to find such data points and remove them from the training data set?

I was thinking of using something similar to cross-validation (leave one out) as:

1. Leave a data point from training set

2. Train the model with the rest of the training set

3. If there is improvement in error of the validation (or testing) sets discard the point.

4. Repeat this for all data points until no more improvement is observed.

There are two problems with this method:

1. It will take a long time for large data sets.

2. Random initial weights will add complexity on discarding data points. Constant initial values with a seed value may not be optimum set to begin with.

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Greg Heath on 5 May 2017

0
Link

Direct link to this answer

https://se.mathworks.com/matlabcentral/answers/338795-selection-of-neural-network-training-data#answer_265868

Before learning, obtain the mean and standard deviations of the input and target variables. Overlay the plots of the variables on lines of mean +/- m*std for m= 1:4.

Remove or modify outliers.

Hope this helps

Thank you for formally accepting my answer

Greg.

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Selection of Neural Network Training Data

0 Comments
Show -2 older commentsHide -2 older comments

Answers (1)

0 Comments
Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Community Treasure Hunt

Selection of Neural Network Training Data

0 Comments Show -2 older commentsHide -2 older comments

Answers (1)

0 Comments Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

0 Comments
Show -2 older commentsHide -2 older comments