For "Train Reinforcement Learning Agent in Basic Grid World", is it posible to edit the Q-table with specific values before training begins to influence the robot's early decision making? I am attempting to create a path plan from a start to goal point and I want to provide the robot with some priori knowledge. I have run grid A* and want the Q-table to know this route before training.
I haven't been able to figure this out, so I tried to work around it by giving the robot positive rewards for going along the A* route. But the training ends up just exactly following the A*. Therefore, I want to just give the Q-table some initial values higher than 0 when starting so it probabilistically follows this route to the goal.