How to get the policy function?

Question

ryunosuke tazawa on 5 Jun 2022

0
Link

Direct link to this question

https://se.mathworks.com/matlabcentral/answers/1734030-how-to-get-the-policy-function

Answered: Vidhi Agarwal on 4 Nov 2024 at 4:12

I did simulation with pendulum in Reinforcement learning.

After that, I would like to find a policy function of the post -learning controller.

In this case, the policy function will be a torque (control function) that outputs the controller to the state (angle and angle speed).

In addition, I want to tailor the state (angle and angular speed) like Qtable.

In this case, which one use GeneratePolicyFunction or Getaction?

Is this method correct? Or is there another way?　Also, how can I save the network by using Sac (Soft-Acto-Critic)?

clear all;
close all;
%% load 'agent.mat'
load('k5_simplePendulum.mat','agent');
generatePolicyFunction(agent);
%% tiring states(angler velocity , angle)
N = 5; %5 divisions
NN = N*N;
Angle = linspace(-3.14,-4.71,N);
Velocity = linspace(0,-20,N);
State = comvec(Angle,Velocity); % Combination of state, number of tiles 5×5
F = zeros(NN,1);　　% Policy function (Torque predicted by trainned agent?)
for i=1:NN
    F(:,i) = evaluatePolicy(State(:,1));
end

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Vidhi Agarwal on 4 Nov 2024 at 4:12

0
Link

Direct link to this answer

https://se.mathworks.com/matlabcentral/answers/1734030-how-to-get-the-policy-function#answer_1540410

Open in MATLAB Online

Hi @ryunosuke tazawa,

To find the policy function of your post-learning controller using reinforcement learning, you can try use the trained agent to evaluate actions based on given states.

generatePolicyFunction: This function is typically used to generate a standalone policy function from a trained reinforcement learning agent. This function can be useful if you want to deploy the policy outside of the reinforcement learning environment or integrate it into a larger system.
getAction: This method is used to obtain the action from the agent given a specific state. It is more straightforward for evaluating the policy in a simulation or analysis context.

For your purpose of evaluating the policy function (torque) for specific states (angle and angular speed), using getAction is more appropriate. It allows you to directly query the agent for actions based on the states you specify.

For better understanding of these function, you can refer to the below documentation:

If you are using the "Soft Actor-Critic" (SAC) algorithm, the agent consists of both actor and critic networks. You can save the trained agent using the save function in MATLAB, which will include these networks. This saves the entire agent, including its policy (actor network) and value function (critic network).

Revised code for the same is given below:

% Load the trained agent
% Define the state space (angle and angular velocity)
N = 5; % Number of divisions
Angle = linspace(-3.14, -4.71, N);
Velocity = linspace(0, -20, N);
[AngleGrid, VelocityGrid] = meshgrid(Angle, Velocity);
State = [AngleGrid(:), VelocityGrid(:)]; % Combination of states
% Preallocate the policy function output
F = zeros(size(State, 1), 1); % Policy function (Torque predicted by trained agent)
% Evaluate the policy for each state
for i = 1:size(State, 1)
    F(i) = getAction(agent, State(i, :));
end
% Save the trained agent
save('trainedAgent.mat', 'agent');

For better understanding of "SAC" algortihm, refer to the following documentation.

https://www.mathworks.com/help/reinforcement-learning/ref/rl.agent.rlsacagent.html

Hope that helps!

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

How to get the policy function?

0 Comments
Show -2 older commentsHide -2 older comments

Answers (1)

0 Comments
Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Community Treasure Hunt

How to get the policy function?

0 Comments Show -2 older commentsHide -2 older comments

Answers (1)

0 Comments Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

0 Comments
Show -2 older commentsHide -2 older comments