This example shows how to assign normalizer objects to the actor and critics of a DDPG agent.
Create DDPG Agent and Extract Actor and Critic
Create specification objects to define observation and action channels. For this example, the agent has two observation channels. The first channel has a vector with three elements and the second channel has a vector with four elements.
The action channel carries a two-dimensional vector.
Create a default DDPG agent.
Extract the approximator objects.
Create Normalizer Objects
Create one normalizer object for each input channel. DDPG agents use a Q-value function critic, which requires both the observation and the action as inputs. The Mean
, StandardDeviation
, Min
, and Max
properties apply to each element of the channel.
Create the normalizer for the first observation channel.
obs1Nrz =
rlNormalizer with properties:
Normalization: "zscore"
Mean: 2
StandardDeviation: 3
Create the normalizer for the second observation channel.
obs2Nrz =
rlNormalizer with properties:
Normalization: "zerocenter"
Mean: 4
Create the normalizer for the action input channel of the Q-value function critic.
actInNrz =
rlNormalizer with properties:
Normalization: "rescale-symmetric"
Min: -2
Max: 2
To check how the normalizer works on an input, use normalize
.
Assign Normalizer Objects to Actor and Critic
To assign new normalizers to the actor and critic, use setNormalizer
.
You can also assign normalizers to selected channels only. For example, to assign normalizers only to the first observation channel (in the order specified by obsInfo
) and the action channel, use an indices vector.
critic = setNormalizer(critic, [obs1Nrz actInNrz], [1 3]);
Display the normalization properties of the actor and critic.
ans = 1×2 string
"zscore" "zerocenter"
ans = 1×3 string
"zscore" "zerocenter" "rescale-symmetric"
Assign Actor and Critic to Agent
To assign the new actor and critic to the agent, use setActor
and setCritic
.
To check that agent works, use getAction
.
Create DQN Agent and Extract Critic
Define a mixed observation space and a discrete action channel.
Create a default DQN agent.
Extract the agent critic.
Extract Normalizer Objects from Actor or Critic
Use getNormalizer
to extract an array of normalizer objects from the agent critic.
crtNrz =
1×2 rlNormalizer array with properties:
Dimension
Normalization
Mean
StandardDeviation
Min
Max
Modify Normalizer Objects Using Dot Notation
Use dot notation to access and change the property of the second normalizer object (associated with the second observation channel).
Assign the modified normalizer to the critic and assign the critic to the agent.
To check that the agent works, use getAction
.