When training an agent using the Reinforcement Learning Toolbox, how can I use a custom stopping criterion?

Question

DHRUV LAAD on 2 Jan 2020

2
Link

Direct link to this question

https://se.mathworks.com/matlabcentral/answers/498677-when-training-an-agent-using-the-reinforcement-learning-toolbox-how-can-i-use-a-custom-stopping-cri

Commented: goc3 on 14 Jul 2020

The current options only allow for 5 predefined choices ("AverageSteps", "AverageReward", "EpisodeReward", "GlobalStepCount", "EpisodeCount"). I want to include a stopping criterion different from these. Is there any option to do the same?

1 Comment
Show -1 older commentsHide -1 older comments

goc3 on 14 Jul 2020

I was about to ask a similar question... The "accepted" answer below doesn't actually answer the question—instead, it confirms that those are the only available stop criteria.

It would be great if additional options and/or support for custom stopping criteria were added.

As an example, for a particular application, I would like to stop training once the episode reward plateaus. It is not known beforehand at what value it will plateau, so having to set a constant before training is very limiting for any application that is programmed to be dynamic or to proceed automatically based on training results.

Sign in to comment.

Sign in to answer this question.

Answer 1

Rajani Mishra on 6 Jan 2020

0
Link

Direct link to this answer

https://se.mathworks.com/matlabcentral/answers/498677-when-training-an-agent-using-the-reinforcement-learning-toolbox-how-can-i-use-a-custom-stopping-cri#answer_408842

trainOpts = rlTrainingOptions(Name,Value) creates an option set for training using specified name-value pairs.

Arguments like - 'StopTrainingCriteria', 'StopTrainingValue', 'MaxEpisodes' should be specified for defining stopping criterion while training an agent.

StopTrainingCriteria: Specifies the termination condition. Takes one of the choices as you have mentioned

StopTrainingValue: Specifies the Critical value of training termination condition. Training terminates when the termination condition specified by the StopTrainingCriteria option equals or exceeds this value

MaxEpisodes: Specifies maximum number of episodes to train the agent, once the number of episodes reached training terminates

For more information please refer to

1 Comment
Show -1 older commentsHide -1 older comments

Tuwe Löfström on 13 Jul 2020

So there is no way of adding a custom stopping criteria, in a similar way as you can define custom reset and step functions?

Sign in to comment.

When training an agent using the Reinforcement Learning Toolbox, how can I use a custom stopping criterion?

1 Comment
Show -1 older commentsHide -1 older comments

Answers (1)

1 Comment
Show -1 older commentsHide -1 older comments

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

When training an agent using the Reinforcement Learning Toolbox, how can I use a custom stopping criterion?

1 Comment Show -1 older commentsHide -1 older comments

Answers (1)

1 Comment Show -1 older commentsHide -1 older comments

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

1 Comment
Show -1 older commentsHide -1 older comments

1 Comment
Show -1 older commentsHide -1 older comments