Reinforcement Learning Toolbox: Episode Q0 does not change ., DDPG agent

4 views (last 30 days)
Hi :-)
I am training my ddpg agent with matlab.
I saw something weird with my trainig graph at reinforcement learning episode manager.
this is my plot, and as you can see,, Q0 value never follows or go near the average reward.
Does this mean that there is something wrong with my critic network? but I can see the traing procedure is working quite properly I guess,,,
Please help!!

Answers (1)

Berk Agin
Berk Agin on 3 Apr 2022
Hello,
I also see that this problem on my training data. I couldn't understand and wonder the solution. Have a nice day.

Products


Release

R2021a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!