DSRNN_CrowdNav

This repository contains the codes for our paper titled "Decentralized Structural-RNN for Robot Crowd Navigation with Deep Reinforcement Learning" in ICRA 2021. For more details, please refer to the project website and arXiv preprint. For experiment demonstrations, please refer to the youtube video.

Please check out our more recent works in the following links:

Intention Aware Robot Crowd Navigation with Attention-Based Interaction Graph (with Sim2Real)
Occlusion-Aware Crowd Navigation Using People as Sensors
My curated paper list for robot social navigation (It is under active development)

Abstract

Safe and efficient navigation through human crowds is an essential capability for mobile robots. Previous work on robot crowd navigation assumes that the dynamics of all agents are known and well-defined. In addition, the performance of previous methods deteriorates in partially observable environments and environments with dense crowds. To tackle these problems, we propose decentralized structural-Recurrent Neural Network (DS-RNN), a novel network that reasons about spatial and temporal relationships for robot decision making in crowd navigation. We train our network with model-free deep reinforcement learning without any expert supervision. We demonstrate that our model outperforms previous methods in challenging crowd navigation scenarios. We successfully transfer the policy learned in the simulator to a real-world TurtleBot 2i.

Setup

Install Python3.6 (The code may work with other versions of Python, but 3.6 is highly recommended).
Install the required python package using pip or conda. For pip, use the following command:

pip install -r requirements.txt

For conda, please install each package in requirements.txt into your conda environment manually and follow the instructions on the anaconda website.

Install OpenAI Baselines.

git clone https://github.com/openai/baselines.git
cd baselines
pip install -e .

Install Python-RVO2 library.

Getting started

This repository is organized in three parts:

crowd_sim/ folder contains the simulation environment. Details of the simulation framework can be found here.
crowd_nav/ folder contains configurations and non-neural network policies
pytorchBaselines/ contains the code for the DSRNN network and ppo algorithm.

Below are the instructions for training and testing policies.

Change configurations

Environment configurations and training hyperparameters: modify crowd_nav/configs/config.py

For FoV environment (left in the figure below): change the value of robot.FOV in config.py
For Group environment (right in the figure below): set sim.group_human to True in config.py

Run the code

Train a policy.

python train.py

Test policies.
Please modify the test arguments in the begining of test.py.
We provide two trained example weights for each type of robot kinematics:
- Holonomic: data/example_model/checkpoints/27776.pt
- Unicycle: data/example_model_unicycle/checkpoints/55554.pt

python test.py

Plot training curve.

python plot.py

(We only tested our code in Ubuntu 16.04 and 18.04 with Python 3.6.)

Learning curves

Learning curves of DS-RNN in 360 degrees FoV environment with 5 humans.

Citation

If you find the code or the paper useful for your research, please cite our paper:

@inproceedings{liu2020decentralized,
  title={Decentralized Structural-RNN for Robot Crowd Navigation with Deep Reinforcement Learning},
  author={Liu, Shuijing and Chang, Peixin and Liang, Weihang and Chakraborty, Neeloy and Driggs-Campbell, Katherine},
  booktitle={IEEE International Conference on Robotics and Automation (ICRA)},
  year={2021},
  pages={3517-3524}
}

Credits

Other contributors:
Peixin Chang
Neeloy Chakraborty

Part of the code is based on the following repositories:

[1] C. Chen, Y. Liu, S. Kreiss, and A. Alahi, “Crowd-robot interaction: Crowd-aware robot navigation with attention-based deep reinforcement learning,” in International Conference on Robotics and Automation (ICRA), 2019, pp. 6015–6022. (Github: https://github.com/vita-epfl/CrowdNav)

[2] I. Kostrikov, “Pytorch implementations of reinforcement learning algorithms,” https://github.com/ikostrikov/pytorch-a2c-ppo-acktr-gail, 2018.

[3] A. Vemula, K. Muelling, and J. Oh, “Social attention: Modeling attention in human crowds,” in IEEE international Conference on Robotics and Automation (ICRA), 2018, pp. 1–7. (Github: https://github.com/jeanoh/big)

Contact

If you have any questions or find any bugs, please feel free to open an issue or pull request.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
crowd_nav		crowd_nav
crowd_sim		crowd_sim
data		data
figures		figures
pytorchBaselines		pytorchBaselines
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
plot.py		plot.py
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py

License

Shuijing725/CrowdNav_DSRNN

Folders and files

Latest commit

History

Repository files navigation

DSRNN_CrowdNav

Abstract

Setup

Getting started

Change configurations

Run the code

Learning curves

Citation

Credits

Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Languages