PRIMAL_2: Pathfinding via Reinforcement and Imitation Multi_agent Learning - Lifelong

Setting up Code

cd into the od_mstar3 folder.
python3 setup.py build_ext --inplace
Check by going back to the root of the git folder, running python3 and "import cpp_mstar"

Running Code

Pick appropriate number of meta agents via variables NUM_META_AGENTS and NUM_IL_META_AGENTS in parameters.py
The number of RL meta-agents is implicity defined by the difference between total meta-agents and IL meta-agents (NUM_RL_META_AGENTS = NUM_META_AGENTS - NUM_IL_META_AGENTS)
Name training run via training_version in parameters.py
call python driver.py

Frequently asked questions

I got pyglet.canvas.xlib.NoSuchDisplayException: Cannot connect to "None" when running on a server.

Running your code starting with xvfb-run will solve the problem. You may refer to https://stackoverflow.com/questions/60922076/pyglet-canvas-xlib-nosuchdisplayexception-cannot-connect-to-none-only-happens and relevant issues on StackFlow for help.

In one-shot environment, why agent turns black after reaching a goal?

In the one-shot scenario, agent will 'disappear'(i.e., removed from the env). For visualization we keep it as black. Removal of agent who has achieved its goal is necessary, since a lot of narrow corridors in the map could cause unsolvable block and collision. One-shot scenario per se is just a way to test the optimality of the planner. By contrast we do not remove any agents for any reason in continuous env.

Key Files

parameters.py - Training parameters.
driver.py - Driver of program. Holds global network for A3C.
Runner.py - Compute node for training. Maintains a single meta agent.
Worker.py - A single agent in a simulation environment. Majority of episode computation, including gradient calculation, occurs here.
Ray_ACNet.py - Defines network architecture.
Env_Builder.py - Defines the lower level structure of the Lifelong MAPF environment for PRIMAL2, including the world and agents class.
PRIMAL2Env.py - Defines the high level environment class.
Map_Generator2.py - Algorithm used to generate worlds, parameterized by world size, obstacle density and wall components.
PRIMAL2Observer.py - Defines the decentralized observation of each PRIMAL2 agent.
Obsever_Builder.py - The high level observation class

Authors

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
all_plots_continuous		all_plots_continuous
all_plots_moving.ai		all_plots_moving.ai
all_plots_oneshot		all_plots_oneshot
od_mstar3		od_mstar3
.gitignore		.gitignore
Env_Builder.py		Env_Builder.py
GroupLock.py		GroupLock.py
LICENSE.md		LICENSE.md
Map_Generator.py		Map_Generator.py
Observer_Builder.py		Observer_Builder.py
Primal2Env.py		Primal2Env.py
Primal2Observer.py		Primal2Observer.py
README.md		README.md
Ray_ACNet.py		Ray_ACNet.py
Runner.py		Runner.py
Worker.py		Worker.py
driver.py		driver.py
parameters.py		parameters.py
requirements.txt		requirements.txt

License

marmotlab/PRIMAL2

Folders and files

Latest commit

History

Repository files navigation

PRIMAL_2: Pathfinding via Reinforcement and Imitation Multi_agent Learning - Lifelong

Setting up Code

Running Code

Frequently asked questions

Key Files

Other Links

Authors

About

Resources

License

Stars

Watchers

Forks

Languages