New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

[Feature] Add PoseFormer backbone #1215

Open

QwQ2000 wants to merge 24 commits into open-mmlab:dev-0.26 from QwQ2000:dev-0.24

Contributor

QwQ2000 commented Mar 3, 2022

Motivation

Modification

Add backbone, head and config of the PoseFormer (ICCV 2021) into the repository.

BC-breaking (Optional)

Use cases (Optional)

Checklist

Before PR:

I have read and followed the workflow indicated in the CONTRIBUTING.md to create this PR.
Pre-commit or linting tools indicated in CONTRIBUTING.md are used to fix the potential lint issues.
Bug fixes are covered by unit tests, the case that causes the bug should be added in the unit tests.
New functionalities are covered by complete unit tests. If not, please add more unit tests to ensure correctness.
The documentation has been modified accordingly, including docstring or example tutorials.

After PR:

CLA has been signed and all committers have signed the CLA in this PR.

ly015 and others added 15 commits

February 24, 2022 17:40


          [Fix] Update mmcv installation CI and doc (open-mmlab#1205)

9b3baea

* update mmvc installation CI and doc

* fix lint


          add derepcation message for deploy tools

196debc


          change import warnings positions

ee41593


          do yapf

5142fba


          do isort

6cadba0


          Add deprecation message for deploy tools (open-mmlab#1207)

f64a14d

* add derepcation message for deploy tools

* change import warnings positions

* do yapf

* do isort


          refactor dataset evaluation interface (open-mmlab#1209)

fd1e4fb


          [Feature] Add hrformer backbone (open-mmlab#1203)

fcb75a6

* hrformer

* modify cfg

* update url and readme for hrformer.

* add readme for hrformer papar

* modify reaadme

* fix publish year

Co-authored-by: ly015 <liyining0712@gmail.com>


          [Fix] Fix data collate and scatter in inference (open-mmlab#1175)

23d671e

* clean inference code

* LoadImageFromFile supports given img


          [Enhacemnet] api train support cpu training for mmcv<1.4.4 (open-mmla…

33434b4

…b#1161)

co-authored-by: ly015 <liyining0712@gmail.com>


          [Feature] Switch to openmmlab pre-commit-hook for copyright check (op…

f198c80

…en-mmlab#1214)

* switch to open-mmlab/pre-commit-hooks

* deprecate .dev_scripts/github/update_copyright.py


          PoseFormer backbone, head and config

1f449dd


          fix color channel order in visualization functions and docs (open-mml…

031e034

…ab#1212)


          [Feature] Add Windows CI (open-mmlab#1213)

* add windows-ci

* reduce input size to save memory

* fix codecov

* reduce input size to save memory

* skip voxelpose unittest for Windows

* remove win+cuda test


          Merge branch 'dev-0.24' into dev-0.24

40d5df7

ly015 changed the base branch from dev-0.24 to dev-0.25

March 8, 2022 03:04

QwQ2000 added 3 commits

March 8, 2022 17:16


          replace einops.rearrange

f8a2665


          Merge branch 'dev-0.24' of github.com:QwQ2000/mmpose into dev-0.24

ce2adbd


          Merge branch 'dev-0.25' into dev-0.24

82a2bb4

codecov bot commented Mar 8, 2022 •

edited

Codecov Report

Attention: 17 lines in your changes are missing coverage. Please review.

Files	Coverage Δ
mmpose/models/backbones/__init__.py	`100.00% <100.00%> (ø)`
mmpose/models/heads/__init__.py	`100.00% <100.00%> (ø)`
mmpose/datasets/pipelines/pose3d_transform.py	`83.78% <66.66%> (+8.45%)`	⬆️
mmpose/models/heads/poseformer_head.py	`75.00% <75.00%> (ø)`
mmpose/models/backbones/poseformer.py	`91.58% <91.58%> (ø)`

... and 26 files with indirect coverage changes

📢 Thoughts on this report? Let us know!

QwQ2000 added 2 commits

March 8, 2022 18:10


          Add PoseFormer unit test.

73687ca


          Merge branch 'dev-0.24' of github.com:QwQ2000/mmpose into dev-0.24

e035c23

QwQ2000 requested a review from ly015

March 8, 2022 11:28

ly015 requested review from liqikai9 and jin-s13

March 9, 2022 05:28

jin-s13 mentioned this pull request

Roadmap of MMPose #9

Open

jin-s13 reviewed

View reviewed changes

mmpose/models/backbones/poseformer.py

		from .base_backbone import BaseBackbone


		class MultiheadAttention(BaseModule):

Collaborator

jin-s13 Mar 15, 2022

mmcv also has MultiheadAttention module. Can we use that one?

Contributor Author

QwQ2000 Mar 26, 2022

This MultiheadAttention module is adapted from mmcls. Compared with the MultiheadAttention in mmcv, its counterpart in mmcls is more similar to the implementation in the official PoseFormer repository.

Contributor Author

QwQ2000 Mar 26, 2022

I will give mmcv Transformer blocks a try. However, if the output and the model weights of the re-implemented PoseFormer based on mmcv Transformer blocks could not fit the official version of PoseFormer, keeping the mmcls Transformer modules might be a good choice.

liqikai9 reviewed

View reviewed changes

mmpose/models/backbones/poseformer.py Outdated

+                               num_joints=17,
+                               in_chans=2,
+                               embed_dim_ratio=32,
+                               depth=4,

Collaborator

liqikai9 Mar 16, 2022

Is the depth for spatial and temporal transformer always the same? If not, do we need to distinguish them?

Contributor Author

QwQ2000 Mar 26, 2022 •

edited

In the official implementation of PoseFormer, the spatial and temporal transformer share the same depth.
However, I think it's obviously better to distinguish the two depths for clarity and I will follow this comment.

mmpose/models/backbones/poseformer.py Outdated

+                      if norm_cfg is None:
+                          norm_cfg = dict(type='LN')
+                      # Temporal embed_dim is num_joints * spatial embedding dim ratio
+                      embed_dim = embed_dim_ratio * num_joints

Collaborator

liqikai9 Mar 16, 2022

Does embed_dim mean the embedding dimension for each frame? How about using embed_dim_per_frame ?

Contributor Author

QwQ2000 Mar 26, 2022

Now I use spatial_embed_dim and temporal_embed_dim instead of embed_dim_ratio and embed_dim.

mmpose/models/backbones/poseformer.py Outdated

+                              init_cfg=init_cfg) for i in range(depth)
+                      ])
+                      self.blocks = nn.ModuleList([

Collaborator

liqikai9 Mar 16, 2022

Does self.blocks mean the temporal transformer blocks? How about using self.temporal_blocks for clarity?

Contributor Author

QwQ2000 Mar 26, 2022

self.temporal_blocks is a better choice.

mmpose/models/backbones/poseformer.py

		return x


		class TransformerEncoderLayer(BaseModule):

Collaborator

liqikai9 Mar 16, 2022

mmcv has the similar layer BaseTransformerLayermodule under mmcv/cnn/bricks/transformer.py, which also includes MultiheadAttention. Can we use that one?

Contributor Author

QwQ2000 Mar 26, 2022

This TransformerEncoderLayer module is adapted from mmcls. Compared with the BaseTransformerLayer in mmcv, its counterpart in mmcls is more similar to the implementation in the official PoseFormer repository.

Contributor Author

QwQ2000 Mar 26, 2022

I will give mmcv Transformer blocks a try. However, if the output and the model weights of the re-implemented PoseFormer based on mmcv Transformer blocks could not fit the official version of PoseFormer, keeping the mmcls Transformer modules might be a good choice.


          Match official weights & improve code clarity

1d3cd3c

ly015 mentioned this pull request

will you provide more 3d pose method #1296

Open

ly015 mentioned this pull request

Iteration Plan of v0.26.0 - April 2022 #1303

Closed

7 tasks

liqikai9 changed the base branch from dev-0.25 to dev-0.26

April 20, 2022 05:43

QwQ2000 added 3 commits

April 27, 2022 14:18


          Update poseformer_h36m_81frame_cpn.py

8302c2a


          Update poseformer_h36m_81frame_cpn.py

9878ac5

Drop path rate / weight_decay


          Update poseformer_h36m_81frame_cpn.py

30fca23

Change samples_per_gpu to 128

darcula1993 commented Sep 7, 2022

what's wrong with this branch? Is ther any update?
@QwQ2000 @ly015

Contributor Author

QwQ2000 commented Sep 7, 2022

Sorry, I have resigned from the Shanghai AI Lab for a long time. I'm busy working on other projects so I don't have much time for this. If someone could help me finish this feature, I will be always willing to help.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment