How the RL agent knows that it has to provide 'action' as 'FLOW' because none of my input or observations is flow? What decides the output of action?
1 view (last 30 days)
Vimal Rathod on 27 Feb 2020
The details are given properly in the link which you have sent. In the example, input is a scalar variable which indicates the amount of flow (Which is Action) and observation is a 1x3 vector which describes the rise or fall in water level.
You could refer to the following link to know what are the values there in observation and action.
To know more about rlNumericSpec which describes a datatype having a range of numeric values refer the following link:
Hope this helps!