Visually Guided Rearrangement Planning

Code for Y. Labbé, S. Zagoruyko, I. Kalevatykh, I. Laptev, J. Carpentier, M. Aubry and J.Sivic, "Monte-Carlo Tree Search for Efficient Visually Guided Rearrangement Planning", IEEE Robotics and Automation Letters.

[arXiv] [Project Page] [Video] :

We provide the training and evaluation code along a pretrained model for extracting object positions from RGB images with a UR5 robot.

We used the predicted object positions for performing visually guided rearrangement planning but it can be used for various other tasks such as building a pyramid with multiple cubes:

Synthetic dataset generation, rearrangement planning simulation environment and MCTS planning code will be added later.

Installation

The code requires PyTorch 1.0.1 and python 3.6+. After installing PyTorch, run:

pip install -r requirements.txt

Download data

You may use the code from this repository to predict objects positions relative to the robot on new images using a pretrained model, reproduce the results from the paper on the real evaluation dataset or train a new model on the synthetic training dataset. We provide the following data:

A pretrained model.
The real evaluation dataset with 1200 images, taken under two different viewpoints (we used the RGB images from a Kinect 1 and Kinect 2 cameras).
The training dataset that contains 2 million synthetic images (50G).

You can download all (or parts of) the data with:

python download.py --model --eval --train

Downloading and unpacking the training data may take a while depending on your connection and machine.

Tests

You can check that you installed the dependencies and downloaded the data correctly by running the tests for the parts of the code you are planning to use:

pytest -v rearrangement/test -m 'model or eval or train' --disable-pytest-warnings

Using the pretrained model

We provide a pretrained model for the UR5 robot with the 3-Finger Adaptive Robot Gripper. If you have the same configuration, you can use this model to predict the following from one RGB camera pointed at the robot (without any additionnal calibration):

The 2D coordinates of individual objects in the workspace (a 40cm x 40cm area in front of the robot).
The semantic segmentation for the robot and gripper and instance segmentation for the objects.
A depth map. (We did not use it in our experiments).

This information can be plugged into standard planning tools to perform various tasks.

For example, you can use the predictions on one image to extract the position of multiple cubes and build a pyramid:

We provide the notebook rearrangement/notebooks/Visualization that makes predictions and visualize them on frames from the evaluation dataset. You can use it as a starting point for using the model on your own data.

You can also see in this notebook how to use AlexNet features for matching the patches from a source and a target image.

Reproducing the results from the paper

You can reproduce the numbers from the papers which are reported on the real evaluation dataset by running:

python -m rearrangement.eval_on_real --run_id state-prediction-71538 --log_dir data/models

This assumes that you have downloaded the data with the flag --eval.

The figures in relation with the vision section and experiments can be reproduced using the notebook rearrangement/notebooks/Paper Figures.

Training new models

You can train the equivalent of the model that we provide using the following command:

python -m rearrangement.main --ds_root data/datasets/synthetic-shapes-1to6 --save data/models/my-model

This assumes that you have downloaded the data with the flag --train.

You can visualize the training and validation losses in the notebook rearrangement/notebooks/Training logs.

Citation

If you find the code or the model useful please cite this work:

@ARTICLE{labbe2020,
author={Y. {Labbe} and S. {Zagoruyko} and I. {Kalevatykh} and I. {Laptev} and J. {Carpentier} and M. {Aubry} and J. {Sivic}},
journal={IEEE Robotics and Automation Letters},
title={Monte-Carlo Tree Search for Efficient Visually Guided Rearrangement Planning},
year={2020}}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
rearrangement		rearrangement
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
conftest.py		conftest.py
download.py		download.py
requirements.txt		requirements.txt
setup.py		setup.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rearrangement

rearrangement

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

conftest.py

conftest.py

download.py

download.py

requirements.txt

requirements.txt

setup.py

setup.py

tox.ini

tox.ini

Repository files navigation

Visually Guided Rearrangement Planning

Installation

Download data

Tests

Using the pretrained model

Reproducing the results from the paper

Training new models

Citation

About

Releases

Packages

Contributors 2

Languages

License

ylabbe/rearrangement-planning

Folders and files

Latest commit

History

Repository files navigation

Visually Guided Rearrangement Planning

Installation

Download data

Tests

Using the pretrained model

Reproducing the results from the paper

Training new models

Citation

About

Resources

License

Stars

Watchers

Forks

Languages