Reimplement VectorNet 🚗

Paper: VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized Representation

Still under construction:

Inplement a Vectornet: hierarchical GNN encoder (no feature completing) + MLP predictor, without node feature completing.

~~The performance on test is 3.255 on minADE (K=1) v.s that in paper of 1.81.~~ (bug found in GraphDataset: the former implementation contained self-loops connection in graph data, which was wrong; and the preprocessed dataset.pt was also wrong; now the model is still trainning...)

After I fix the bug about self-loops in Graph.Data, I re-train the network with the same setting but only to find the performance on the validation set remains the same for about 2.6 of ADE, which was so disappointing. Notice that I only use the context (social + lanes) with about 5-10 meters around each agent (not enough machine for me), so I tried to change the context radius to 100 meters in config.py file (in the paper it's 200 * 200 if my memory serves me right). Unfortunately, the machines in the lab are not accessible to me right now, so I couldn't train the network with these new settings. 😢

branch master is sync with branch large-scale; branch overfit-small is archived.

Environment

Multi-GPU training on Windows Serer 2016; CUDA version 10.1; 2 Titan Xp GPUs.

Install the packages mentioned in requirements.txt

pip install -r requirements.txt

torch==1.4.0, argoverse-api, numpy==1.18.1, pandas==1.0.0, matplotlib==3.1.1, torch-geometric==1.5.0

Usage

For pre-processed data, pre-trained model, and results *.h5 file: Google Drive

(Remember to run find . -name "*.DS_Store" -type f -delete if you're using MacOS)

Install Argoverse-api. Download HD-maps in argoverse-api as instructed.
download the prepared dataset objects on Google Drive directly and unzip it in path ., and skip step 3.

or prepared the dataset (batchify ...) from raw *.csv.

put all data (folders named train/val/test or a single folder sample) in data folder.

An example folder structure:
```
data - train - *.csv
     \        \ ...
      \
       \- val - *.csv
        \       \ ...
         \
          \- test - *.csv
                   \ ...
```
Modify the config file utils/config.py. Use the proper env paths and arguments.
Feature preprocessing, save intermediate data input features (compute_feature_module.py)
```
$ python compute_feature_module.py
```
Use (200, 200) size for a single sequence as the paper told.
Train the model (train.py; overfit a tiny dataset by setting small_dataset = True, and use GraphDataset in dataset.py to batchify the data)
```
$ python train.py
```

Results on val and test

Some predicting results were uploaded to the Argoverse contest, check the board via the url

Submission ID of the repo: @xkhuang

Result on val

model params	minADE (K=1)	minFDE (K=1)
results in paper	1.66	3.67
epoch_24.valminade_2.637.200624.xkhuang.pth	2.637

Result on test

model params	minADE (K=1)	minFDE (K=1)
results in paper	1.81	4.01
epoch_24.valminade_2.637.200624.xkhuang.pth	3.255298	6.992046

Result and visualization for overfitting tiny dataset

Sample results are shown below:

red lines are agent input and ground truth output
blue points are predicted feature tarjectory
light blue lanes are other moving objects
grey lines are lanes

Using nearby context (about 5M around):

Using 200 * 200 context (about 100M around):

with lanes:

without lanes:

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
images		images
modeling		modeling
utils		utils
.gitignore		.gitignore
README.md		README.md
compute_feature_module.py		compute_feature_module.py
dataset.py		dataset.py
requirements.txt		requirements.txt
single_gpu_train.py		single_gpu_train.py
test_and_generate_H5.py		test_and_generate_H5.py
train.py		train.py

xk-huang/yet-another-vectornet

Folders and files

Latest commit

History

Repository files navigation

Reimplement VectorNet 🚗

Table of Contents

Environment

Usage

Results on val and test

Result on val

Result on test

Result and visualization for overfitting tiny dataset

Using nearby context (about 5M around):

Using 200 * 200 context (about 100M around):

About

Resources

Stars

Watchers

Forks

Languages