How I can access the action output of the actor network in DDPG during training?

Question

Maha Mosalam on 2 Dec 2021

0
Link

Direct link to this question

https://se.mathworks.com/matlabcentral/answers/1601730-how-i-can-access-the-action-output-of-the-actor-network-in-ddpg-during-training

Answered: Yash on 24 Dec 2024

I want to access the action output of the actor network in DDPG during training since I want to change it by force function to other action optimized from sepeate function to accelerate training and improve learning effeciecncy for actor , if any help for that? I wil be thankful

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Yash on 24 Dec 2024

0
Link

Direct link to this answer

https://se.mathworks.com/matlabcentral/answers/1601730-how-i-can-access-the-action-output-of-the-actor-network-in-ddpg-during-training#answer_1556358

You can use the function getAction which returns action from agent, actor or policy object given environment observations. You can write a custom loss function that directly uses getAction and dlgradient within it, and then use dlfeval and dlaccelerate with your custom loss function. For an example, see Train Reinforcement Learning Policy Using Custom Training Loop and Custom Training Loop with Simulink Action Noise.

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

How I can access the action output of the actor network in DDPG during training?

0 Comments
Show -2 older commentsHide -2 older comments

Answers (1)

0 Comments
Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Community Treasure Hunt

How I can access the action output of the actor network in DDPG during training?

0 Comments Show -2 older commentsHide -2 older comments

Answers (1)

0 Comments Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

0 Comments
Show -2 older commentsHide -2 older comments