Using RL, How to train multi-agents such that each agent will navigate from its initial position to goal position avoiding collisions?

Question

Steve Jessy on 3 Mar 2021

0
Link

Direct link to this question

https://se.mathworks.com/matlabcentral/answers/762276-using-rl-how-to-train-multi-agents-such-that-each-agent-will-navigate-from-its-initial-position-to

Commented: Steve Jessy on 6 Mar 2021

Let's assume there are a set of agents that are spread into 3d cartesian space. A trajectory should be generated for each agent such that if an agent would follow its trajectory while heading to the goal waypoint, no collision would happen with other agents. Any guidance to solve such a task would be highly appreciated

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Emmanouil Tzorakoleftherakis on 5 Mar 2021

0
Link

Direct link to this answer

https://se.mathworks.com/matlabcentral/answers/762276-using-rl-how-to-train-multi-agents-such-that-each-agent-will-navigate-from-its-initial-position-to#answer_640021

Edited: Emmanouil Tzorakoleftherakis on 5 Mar 2021

It's possible that the scenario you described can be solved by training a single agent, and then "deploying" that trained agent to all uavs/uuvs in your fleet. That would make the problem easier and less expensive to train. For a 2D example, take a look at this.

3 Comments
Show 1 older commentHide 1 older comment

Emmanouil Tzorakoleftherakis on 6 Mar 2021

I think it's a matter of what inputs you provide to the policy and the coordinate system you use (although I was thinking the scenario where each agent has its own sensors). If you only use odometry data from all agents, I guess you could transform it to distance from each nearby agent (include heading/bearing probably) and feed all this info into the policy.

Steve Jessy on 6 Mar 2021

@Emmanouil Tzorakoleftherakis

The coordinate system in which the agents are acting is 2D cartesian coordinate system. Yes I can access the distance from an agent to all the other agents in the space. I'd like to kindly ask you if you can provide an example/code in which the multi-agent system is trained based on odom data

Sign in to comment.

Using RL, How to train multi-agents such that each agent will navigate from its initial position to goal position avoiding collisions?

0 Comments
Show -2 older commentsHide -2 older comments

Answers (1)

3 Comments
Show 1 older commentHide 1 older comment

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

Using RL, How to train multi-agents such that each agent will navigate from its initial position to goal position avoiding collisions?

0 Comments Show -2 older commentsHide -2 older comments

Answers (1)

3 Comments Show 1 older commentHide 1 older comment

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

3 Comments
Show 1 older commentHide 1 older comment