U-CondDGCN: Conditional Directed Graph Convolution for 3D Human Pose Estimation

This repo is the unofficial implementation of "Conditional Directed Graph Convolution for 3D Human Pose Estimation, Wenbo Hu, Changgong Zhang, Fangneng Zhan, Lei Zhang, Tien-Tsin Wong" in PyTorch. There are many omitted parts in the paper. Therefore, note that there may be differences between actual papers and the way they are implemented. We welcome feedback on implementation errors.

Abstract

Graph convolutional networks have significantly improved 3D human pose estimation by representing the human skeleton as an undirected graph. However, this representation fails to reflect the articulated characteristic of human skeletons as the hierarchical orders among the joints are not explicitly presented. In this paper, we propose to represent the human skeleton as a directed graph with the joints as nodes and bones as edges that are directed from parent joints to child joints. By so doing, the directions of edges can explicitly reflect the hierarchical relationships among the nodes. Based on this representation, we further propose a spatial-temporal conditional directed graph convolution to leverage varying non-local dependence for different poses by conditioning the graph topology on input poses. Altogether, we form a U-shaped network, named U-shaped Conditional Directed Graph Convolutional Network, for 3D human pose estimation from monocular videos. To evaluate the effectiveness of our method, we conducted extensive experiments on two challenging large-scale benchmarks: Human3.6M and MPI-INF-3DHP. Both quantitative and qualitative results show that our method achieves top performance. Also, ablation studies show that directed graphs can better exploit the hierarchy of articulated human skeletons than undirected graphs, and the conditional connections can yield adaptive graph topologies for different poses.

Visualization

Dependencies

Cuda 11.1
Python 3.8.11
Pytorch 1.9.0+cu111

Dataset setup

Please download the dataset from Human3.6M website and refer to VideoPose3D to set up the Human3.6M dataset ('./dataset' directory).

${POSE_ROOT}/
|-- dataset
|   |-- data_3d_h36m.npz
|   |-- data_2d_h36m_cpn_ft_h36m_dbb.npz

Test the model

To test on pretrained model on Human3.6M:

python main.py --reload --previous_dir 'checkpoint/pretrained'

Here, we compare our U-CondDGCN with recent state-of-the-art methods on Human3.6M dataset. Evaluation metric is Mean Per Joint Position Error (MPJPE) in mm.

Types	Models	MPJPE
TCN	VideoPose3D	46.8
ViT	PoseFormer	44.3
ViT	MHFormer	43.0
GCN	UGCN	45.6
GCN	U-CondDGCN	43.4

Train the model

To train on Human3.6M:

python main.py --train

Citation

If you find our work useful in your research, please consider citing:

@misc{hu2021conditional,
      title={Conditional Directed Graph Convolution for 3D Human Pose Estimation}, 
      author={Wenbo Hu and Changgong Zhang and Fangneng Zhan and Lei Zhang and Tien-Tsin Wong},
      year={2021},
      eprint={2107.07797},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Acknowledgement

Our code is extended from the following repositories. We thank the authors for releasing the codes.

Licence

This project is licensed under the terms of the MIT license.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
common		common
dataset		dataset
figure		figure
model		model
.gitignore		.gitignore
LICENCE		LICENCE
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

common

common

dataset

dataset

figure

figure

model

model

.gitignore

.gitignore

LICENCE

LICENCE

README.md

README.md

main.py

main.py

Repository files navigation

U-CondDGCN: Conditional Directed Graph Convolution for 3D Human Pose Estimation

Abstract

Visualization

Dependencies

Dataset setup

Test the model

Train the model

Citation

Acknowledgement

Licence

About

Languages

License

tamasino52/U-CondDGCN

Folders and files

Latest commit

History

Repository files navigation

U-CondDGCN: Conditional Directed Graph Convolution for 3D Human Pose Estimation

Abstract

Visualization

Dependencies

Dataset setup

Test the model

Train the model

Citation

Acknowledgement

Licence

About

Topics

Resources

License

Stars

Watchers

Forks

Languages