Flock Multi Agent RL Environment

Multi-agent RL environment based on Boids, implemented with JAX

The environment is based on popular boids model where agents recreate flocking behaviours based on simple interaction rules. The environment implements boids as a multi-agent reinforcement problem where each boid takes individual actions and have a individual localised view of the environment.

This environment has been built around the gymnax API (a JAX version of the popular RL Gym API):

import flock_env
import jax

key = jax.random.PRNGKey(101)
key_reset, key_act, key_step = jax.random.split(key)

# Initialise a flock environment with 10 agents
env = flock_env.SimpleFlockEnv(
    reward_func=flock_env.rewards.exponential_rewards,
    n_agents=10
)
env_params = env.default_params

# Reset the environment and get state and observation
obs, state = env.reset(key_reset, env_params)
# Sample random action for agents
actions = jax.random.uniform(key_act, (10, 2))
# Step the environment
new_obs, new_state, rewards, dones, _ = env.step_env(
    key_step, state, actions, env_params
)

Usage

See examples/ppo_example.ipynb for an example of training a Proximal-Policy-Optimisation based agent with this environment (using my JAX implementation of PPO).

The package can and requirements can be installed using poetry by running

poetry install

⚠️ Generating observations currently compares all pairs of agents so performance scales as $n^2$ with the number of agents. This means performance may not be great past hundreds of agents.

TODO

More complex observation spaces, e.g. ray-casting view model
Objects/obstacles in the environment
More efficient agent observation generation

Previous Version

The previous version of this project built around Numba can be found in /deprecated

Developers

Pre-Commit Hooks

Pre commit hooks can be installed by running

pre-commit install

Pre-commit checks can then be run using

task lint

Tests

Tests can be run with

task test

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
.github		.github
deprecated		deprecated
examples		examples
flock_env		flock_env
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
poetry.lock		poetry.lock
poetry.toml		poetry.toml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github

.github

deprecated

deprecated

examples

examples

flock_env

flock_env

tests

tests

.gitignore

.gitignore

.pre-commit-config.yaml

.pre-commit-config.yaml

README.md

README.md

poetry.lock

poetry.lock

poetry.toml

poetry.toml

pyproject.toml

pyproject.toml

Repository files navigation

Flock Multi Agent RL Environment

Usage

TODO

Previous Version

Developers

Pre-Commit Hooks

Tests

About

Languages

zombie-einstein/flock_env

Folders and files

Latest commit

History

Repository files navigation

Flock Multi Agent RL Environment

Usage

TODO

Previous Version

Developers

Pre-Commit Hooks

Tests

About

Topics

Resources

Stars

Watchers

Forks

Languages