when using rlCreateEnvTemplate("MyEnvironment") to create a custom template I came across this line in the reset function:
InitialObservation = [T0;Td0;X0;Xd0];
Initial states seem to be reversed, I believe they should be [X0;Xd0;T0;Td0]?
the same seems to apply to the example loaded with openExample('rl/MATLABCartPoleDQNExample'), although I cannot see the reset function, the example gives the same results as the template and when I tried validating these two against the environments created with openExample('rl/CreateMATLABEnvironmentUsingCustomFunctionsExample') they yield different initial states.
but I'm still kinda new to this and I'm afraid to be missing something here. Could you please clarify?