feat(lidar_transfusion): add lidar_transfusion 3D detection package #6890

amadeuszsz · 2024-04-25T10:18:55Z

Description

The PR adds a new lidar_transfusion package which brings a new 3D detection component based on TransFusion[1].

lidar_transfusion_demo-2024-04-25_15.56.14.mp4

Tests performed

Model metrics:

------------- T4Metric results -------------
04/25 09:28:12 - mmengine - INFO - | class_name | mAP   | AP@0.5m | AP@1.0m | AP@2.0m | AP@4.0m | error@trans_err | error@scale_err | error@orient_err | error@vel_err | error@attr_err |
04/25 09:28:12 - mmengine - INFO - |----------------------------------------------------------------------------------------------------------------------------------------------------|
04/25 09:28:12 - mmengine - INFO - | car        | 0.788 | 0.609   | 0.798   | 0.859   | 0.884   | 0.26            | 0.159           | 0.103            | 0.779         | 1              |
04/25 09:28:12 - mmengine - INFO - | truck      | 0.59  | 0.315   | 0.577   | 0.7     | 0.766   | 0.379           | 0.149           | 0.0516           | 0.77          | 1              |
04/25 09:28:12 - mmengine - INFO - | bus        | 0.554 | 0.347   | 0.532   | 0.628   | 0.709   | 0.363           | 0.137           | 0.0762           | 1.69          | 1              |
04/25 09:28:12 - mmengine - INFO - | bicycle    | 0.236 | 0.192   | 0.241   | 0.251   | 0.261   | 0.283           | 0.248           | 0.27             | 0.839         | 1              |
04/25 09:28:12 - mmengine - INFO - | pedestrian | 0.628 | 0.57    | 0.613   | 0.642   | 0.688   | 0.253           | 0.281           | 0.424            | 0.626         | 1              |
04/25 09:28:12 - mmengine - INFO - | Total mAP: 0.559

ROS 2 node performance with RTX 4090:

                   | pre-process [ms] | inference [ms] | post-process [ms] | total [ms]  |
------------------------------------------------------------------------------------------
lidar_transfusion  |   1.52 ± 0.14    |  2.29 ± 0.37   |    0.20 ± 0.15    | 4.07 ± 0.44 |
lidar_centerpoint  |        -         |       -        |         -         | 5.53 ± 0.55 |

Notes for reviewers

The package can best tested with rosbag file. If data needed, you can use this rosbag[2] and copy helper files[3] to the launch directory of lidar_transfusion package before building. The default path for onnx model is ~/autoware_data/lidar_transfusion/transfusion.onnx. The model awaits for deployment, temporary please use attached link[4].

To start, run commands:

# terminal 1
ros2 launch lidar_transfusion data_pipeline.launch.py 

# terminal 2
ros2 bag play trailer_yaw_ticket_data_jpntaxi7/01d060d9-8d25-45e0-b45e-9fd7201ac27b_2023-05-26-11-30-03_p0900_3.db3 --loop --clock

# terminal 3
rviz2 --ros-args -p use_sim_time:=True 

# terminal 4 (for convenience you can add <param name="use_sim_time" value="true"/> to the node in xml file)
ros2 launch lidar_transfusion lidar_transfusion.launch.xml

Interface changes

Effects on system behavior

Pre-review checklist for the PR author

The PR author must check the checkboxes below when creating the PR.

I've confirmed the contribution guidelines.
The PR follows the pull request guidelines.

In-review checklist for the PR reviewers

The PR reviewers must check the checkboxes below before approval.

The PR follows the pull request guidelines.
The PR has been properly tested.
The PR has been reviewed by the code owners.

Post-review checklist for the PR author

The PR author must check the checkboxes below before merging.

There are no open discussions or they are tracked via tickets.
The PR is ready for merge.

After all checkboxes are checked, anyone who has write access can merge the PR.

Signed-off-by: amadeuszsz <amadeusz.szymko@tier4.jp>

Signed-off-by: Amadeusz Szymko <amadeusz.szymko@tier4.jp>

knzo25 · 2024-05-02T01:53:28Z

Comments:

When the number of voxels falls outside the expected parameters, the output is 200 NaN objects (200 being the max number of objects). This can happen when the top lidar is missing in the taxi, for example. We should consider lowering the value, and/or aborting the inference + adding an error message.
We need to add transfusion to the perception related launchers. I did it in my local environment, so I will either pass them to the PR's author, commit them myself, or explain him the procedure.
In my environment, I have cuda memory errors sporadically. It happens inside the first kernel of the preprocess step.
Not confirmed, but the stream used in the preprocessing may not be the one we are expecting (since adding device synchronizations instead of streams changed the behavior)

knzo25 · 2024-05-02T01:54:32Z

@scepter914
The PR can still be reviewed to some degree, but I think it is better to wait until the errors disappear 🙏

taikitanaka3 · 2024-05-02T02:12:03Z

@knzo25
I think you wanted to mention @scepter914

knzo25 · 2024-05-02T02:18:56Z

Apologies 🙇

amadeuszsz · 2024-05-02T02:36:23Z

Thank you @knzo25 for your review. Let me investigate the kernels first to find mentioned leak. Regarding the NaN output, as you suggested we will:

change the optimization profile,
handle the cases with not sufficient number of voxels after preprocessing for optimization profile.

Signed-off-by: amadeuszsz <amadeusz.szymko@tier4.jp>

amadeuszsz · 2024-05-02T09:34:52Z

@knzo25
Recent fix solves the issue with cuda memory and NaNs. Please check if the issue disappeared on your machine as well.

Signed-off-by: amadeuszsz <amadeusz.szymko@tier4.jp>

scepter914

Thank you for PR.
Overall very readable code and rich documentation is so great. 👍

I took a quick look at whole code and I ask you to add some members for maintainers because development will be robuster and smoother if multiple people can maintain it.
As the rest of my work, I'll test with some rosbag and I'll approve after I confirm the operation with some Rosbag.

perception/lidar_transfusion/package.xml

scepter914 · 2024-05-08T06:32:19Z

@amadeuszsz

I apologize for the inconvenience, would you fix DCO?
We need to pass CI include DCO for merge (this specification is so invonvenient...).

Co-authored-by: Satoshi Tanaka <16330533+scepter914@users.noreply.github.com> Signed-off-by: amadeuszsz <amadeusz.szymko@tier4.jp>

Signed-off-by: amadeuszsz <amadeusz.szymko@tier4.jp>

Signed-off-by: amadeuszsz <amadeusz.szymko.2@tier4.jp>

scepter914

I confirm the perform with other rosbags.
After kenzo-san's review and uploading model, we can merge this PR. 👍

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

…re.universe into feat/lidar_transfusion

knzo25 · 2024-05-23T10:08:46Z

@amadeuszsz
A comment just in case for future reference. One of the good points of uniform intialization is that it is guaranteed to assign the correct zero to the primitive types and nullptr for pointers when using empty curly braces {}

Signed-off-by: amadeuszsz <amadeusz.szymko.2@tier4.jp>

…sing Signed-off-by: amadeuszsz <amadeusz.szymko.2@tier4.jp>

Signed-off-by: amadeuszsz <amadeusz.szymko.2@tier4.jp>

amadeuszsz · 2024-05-31T01:00:55Z

@knzo25

While doing other task that was asked of me, I was looking at the whole perception task and saw the preprocessing being done in cpu in centerpoint. I kind of forgot you told me you were doing it already in gpu for transfusion so I ended up doing double work and implemented it myself there as well 🙇

When I sent the PR I remembered, and after comparing the implementation, while quite similar, they have different overall costs, so I wonder if you can change the implementation here so it has around the same number of operations:

Let K be the total number of frames used for inference and N the number of points in a pointcloud. Assuming that the queue if full, the preprocessing before proper voxelization cost would be as follows:

transfusion: enqueue:

N* copy(host->host)
transform and concat:

move to gpu: K_N_copy(host->device)

kernel: K_N_transform(device->device)

(proposed) centerpoint: enqueue:

N * copy (host->device)
transform and concat:

kernel: K_N_kernel(device->device)

I modified the code as you suggested. The preprocessing decreased from 2.15 ± 0.55 to 1.21 ± 0.25 [ms] with two clouds in cache (performance differs from PR's init benchmark due to different load). The breaking changes make harder to keep host processing (you predict that before 😃), therefore I just removed it.

After reading your kernels seriously this time, I realized that you went to great lengths to make the operations general. Usually, when writing kernels (cpu or gpu), you want to avoid all the not strictly needed branches and loops. Could you specialize the kernels so that the fastest implementation is used on runtime? (maybe leaving your generic version to the case that no fast implementation is available)

Also, since we control the driver, you usually would not need to pay much attention to reverting the endianness

Changed. Now the package is not compatible with other point cloud formats. However, there is init validation and we throw exception with appropriate logs.

98e31a9
Seems it's last requested modification. Since it was a big change, apart of code review please test it again during the runtime.

perception/lidar_transfusion/lib/preprocess/pointcloud_densification.cpp

knzo25

@amadeuszsz
Thank you for handling all the comments. before merging the PR though, can you turn back the default model to centerpoint?
(checked the runtime and there are no issues)

Signed-off-by: amadeuszsz <amadeusz.szymko.2@tier4.jp>

knzo25

LGTM !
🚀

knzo25 · 2024-06-03T03:53:18Z

@YoshiRi
Due to having touched the launchers, we also need a review from one of the maintainers of that package.
Since it is only a few lines, could you make a short review of that part?

YoshiRi

LGTM for launcher part

knzo25 · 2024-06-04T08:47:33Z

perception/lidar_transfusion/README.md

+
+| Name                             | Type         | Default Value | Description                                                                                        |
+| -------------------------------- | ------------ | ------------- | -------------------------------------------------------------------------------------------------- |
+| `class_names`                    | list[string] | -             | Class names for 3D object detection.                                                               |


@amadeuszsz
Sorry for the late comment, but since you added the schema file, could you change the README.md so it uses the schema instead?

Done 👍🏻

Signed-off-by: amadeuszsz <amadeusz.szymko.2@tier4.jp>

feat(lidar_transfusion): add lidar_transfusion 3D detection package

13121ab

Signed-off-by: amadeuszsz <amadeusz.szymko@tier4.jp>

github-actions bot added type:documentation Creating or refining documentation. (auto-assigned) component:perception Advanced sensor data processing and environment understanding. (auto-assigned) labels Apr 25, 2024

style(pre-commit): autofix

b91fb8e

amadeuszsz marked this pull request as ready for review April 25, 2024 10:25

style(lidar_transfusion): cpplint

b8b2d2e

Signed-off-by: amadeuszsz <amadeusz.szymko@tier4.jp>

scepter914 self-requested a review April 26, 2024 01:21

knzo25 self-requested a review April 26, 2024 01:45

knzo25 and others added 2 commits April 26, 2024 10:45

Merge branch 'main' into feat/lidar_transfusion

972700a

style(lidar_transfusion): cspell

1abe6d7

Signed-off-by: Amadeusz Szymko <amadeusz.szymko@tier4.jp>

amadeuszsz and others added 2 commits May 2, 2024 17:51

fix(lidar_transfusion): CUDA mem allocation & inference input

7447bdd

Signed-off-by: amadeuszsz <amadeusz.szymko@tier4.jp>

style(pre-commit): autofix

a7caa15

amadeuszsz and others added 2 commits May 2, 2024 20:20

fix(lidar_transfusion): arrays size

e2126ee

Signed-off-by: amadeuszsz <amadeusz.szymko@tier4.jp>

style(pre-commit): autofix

c08d189

scepter914 reviewed May 8, 2024

View reviewed changes

perception/lidar_transfusion/package.xml Outdated Show resolved Hide resolved

chore(lidar_transfusion): update maintainers

f5e8146

Co-authored-by: Satoshi Tanaka <16330533+scepter914@users.noreply.github.com> Signed-off-by: amadeuszsz <amadeusz.szymko@tier4.jp>

amadeuszsz force-pushed the feat/lidar_transfusion branch from 8506f8a to f5e8146 Compare May 8, 2024 06:50

amadeuszsz and others added 2 commits May 8, 2024 19:22

fix(lidar_transfusion): array size & grid idx

628d305

Signed-off-by: amadeuszsz <amadeusz.szymko@tier4.jp>

chore(lidar_transfusion): update maintainer email

df5ce91

Signed-off-by: amadeuszsz <amadeusz.szymko.2@tier4.jp>

scepter914 approved these changes May 10, 2024

View reviewed changes

knzo25 added 2 commits May 10, 2024 10:25

chore: added transfusion to the respective launchers

90c5a99

Signed-off-by: Kenzo Lobos-Tsunekawa <kenzo.lobos@tier4.jp>

Merge branch 'feat/lidar_transfusion' of github.com:amadeuszsz/autowa…

1ee5026

…re.universe into feat/lidar_transfusion

amadeuszsz requested a review from miursh as a code owner May 10, 2024 01:27

style(pre-commit): autofix

e47c571

amadeuszsz and others added 12 commits May 24, 2024 17:22

refactor(lidar_transfusion): use of config params

49cd20e

Signed-off-by: amadeuszsz <amadeusz.szymko.2@tier4.jp>

refactor(lidar_transfusion): remove unnecessary condition

dca8b5e

Signed-off-by: amadeuszsz <amadeusz.szymko.2@tier4.jp>

style(lidar_transfusion): switch naming (CPU to HOST)

389c6cb

Signed-off-by: amadeuszsz <amadeusz.szymko.2@tier4.jp>

refactor(lidar_transfusion): remove redundant device sync

fe15baf

Signed-off-by: amadeuszsz <amadeusz.szymko.2@tier4.jp>

style(lidar_transfusion): intensity naming

63a3009

Signed-off-by: amadeuszsz <amadeusz.szymko.2@tier4.jp>

feat(lidar_transfusion): full network shape validation

e83d0bf

Signed-off-by: amadeuszsz <amadeusz.szymko.2@tier4.jp>

feat(lidar_transfusion): validate objects' orientation in host proces…

7a054cc

…sing Signed-off-by: amadeuszsz <amadeusz.szymko.2@tier4.jp>

feat(lidar_transfusion): add json schema

ac01c4a

Signed-off-by: amadeuszsz <amadeusz.szymko.2@tier4.jp>

style(pre-commit): autofix

d80ea72

style(lidar_transfusion): affine matrix naming

3c0187f

Signed-off-by: amadeuszsz <amadeusz.szymko.2@tier4.jp>

style(lidar_transfusion): transformed point naming

2eca809

Signed-off-by: amadeuszsz <amadeusz.szymko.2@tier4.jp>

refactor(lidar_transfusion): add param descriptor & arrays size check

8434721

Signed-off-by: amadeuszsz <amadeusz.szymko.2@tier4.jp>

amadeuszsz force-pushed the feat/lidar_transfusion branch from 3cd4249 to 8434721 Compare May 28, 2024 07:46

amadeuszsz added 3 commits May 29, 2024 13:02

style(lidar_transfusion): affine matrix naming

8b59a37

Signed-off-by: amadeuszsz <amadeusz.szymko.2@tier4.jp>

feat(lidar_transfusion): caching cloud input as device ptr

98e31a9

Signed-off-by: amadeuszsz <amadeusz.szymko.2@tier4.jp>

fix(lidar_transfusion): logging

91e744c

Signed-off-by: amadeuszsz <amadeusz.szymko.2@tier4.jp>

amadeuszsz requested a review from knzo25 May 31, 2024 01:19

knzo25 reviewed May 31, 2024

View reviewed changes

perception/lidar_transfusion/lib/preprocess/pointcloud_densification.cpp Show resolved Hide resolved

knzo25 requested changes May 31, 2024

View reviewed changes

amadeuszsz added 2 commits June 3, 2024 09:24

chore(tier4_perception_launch): revert to centerpoint

c68c6f5

Signed-off-by: amadeuszsz <amadeusz.szymko.2@tier4.jp>

fix(lidar_transfusion): typo

e6a657d

Signed-off-by: amadeuszsz <amadeusz.szymko.2@tier4.jp>

knzo25 approved these changes Jun 3, 2024

View reviewed changes

Merge branch 'main' into feat/lidar_transfusion

5b9bf5c

YoshiRi approved these changes Jun 3, 2024

View reviewed changes

knzo25 reviewed Jun 4, 2024

View reviewed changes

docs(lidar_transfusion): use hook for param description

8c8cf86

Signed-off-by: amadeuszsz <amadeusz.szymko.2@tier4.jp>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(lidar_transfusion): add lidar_transfusion 3D detection package #6890

feat(lidar_transfusion): add lidar_transfusion 3D detection package #6890

amadeuszsz commented Apr 25, 2024 •

edited

knzo25 commented May 2, 2024

knzo25 commented May 2, 2024 •

edited

taikitanaka3 commented May 2, 2024

knzo25 commented May 2, 2024

amadeuszsz commented May 2, 2024

amadeuszsz commented May 2, 2024

scepter914 left a comment •

edited

scepter914 commented May 8, 2024

scepter914 left a comment •

edited

knzo25 commented May 23, 2024

amadeuszsz commented May 31, 2024 •

edited

knzo25 left a comment •

edited

knzo25 left a comment

knzo25 commented Jun 3, 2024

YoshiRi left a comment

knzo25 Jun 4, 2024

amadeuszsz Jun 5, 2024

feat(lidar_transfusion): add lidar_transfusion 3D detection package #6890

Are you sure you want to change the base?

feat(lidar_transfusion): add lidar_transfusion 3D detection package #6890

Conversation

amadeuszsz commented Apr 25, 2024 • edited

Description

Related links

Tests performed

Notes for reviewers

Interface changes

Effects on system behavior

Pre-review checklist for the PR author

In-review checklist for the PR reviewers

Post-review checklist for the PR author

knzo25 commented May 2, 2024

knzo25 commented May 2, 2024 • edited

taikitanaka3 commented May 2, 2024

knzo25 commented May 2, 2024

amadeuszsz commented May 2, 2024

amadeuszsz commented May 2, 2024

scepter914 left a comment • edited

Choose a reason for hiding this comment

scepter914 commented May 8, 2024

scepter914 left a comment • edited

Choose a reason for hiding this comment

knzo25 commented May 23, 2024

amadeuszsz commented May 31, 2024 • edited

knzo25 left a comment • edited

Choose a reason for hiding this comment

knzo25 left a comment

Choose a reason for hiding this comment

knzo25 commented Jun 3, 2024

YoshiRi left a comment

Choose a reason for hiding this comment

knzo25 Jun 4, 2024

Choose a reason for hiding this comment

amadeuszsz Jun 5, 2024

Choose a reason for hiding this comment

amadeuszsz commented Apr 25, 2024 •

edited

knzo25 commented May 2, 2024 •

edited

scepter914 left a comment •

edited

scepter914 left a comment •

edited

amadeuszsz commented May 31, 2024 •

edited

knzo25 left a comment •

edited