question about external action of DDPG

1 view (last 30 days)
Is anyone know the loss function of the Q-network when I set external action=1 during training process?(DDPG)

Accepted Answer

Emmanouil Tzorakoleftherakis
The loss function does not change. What happens is that the experience buffer is populated with the action from the external signal and the respective observations/reward.

More Answers (0)

Products


Release

R2021b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!