GitHub - fan23j/tcformer-vitpose-ensemble-annotator: An annotation tool that combines TCFormer and ViTPose as an ensemble setup for COCO Wholebody annotations.

Motivation

This repo utilizes a simple ensemble pipeline that combines pose predictions of TCFormer and ViTPose. Both approaches independently produce acceptable inference results, but annecdotally, it seems ViTPose performs better on the classic COCO keypoints (17) while TCFormer outperforms ViTPose on the foot keypoints provided in COCO Wholebody (6). This annotation tool can hopefully save a significant amount of annotation time for custom pose datasets. YoloV5l6 is used as the detector for top-down inference.

Selection Criteria

I only apply a simple criteria that chooses the prediction result with higher confidence (for each keypoint). Although there is no guarantee that higher confidence translates to more accurate prediction, the ensemble results do (anectodally) seem more accurate.

Setup / Installation

We use PyTorch 1.9.0 or NGC docker 21.06, and mmcv 1.3.9 for the experiments.

git clone https://github.com/fan23j/tcformer-vitpose-ensemble-annotator.git
cd tcformer-vitpose-ensemble-annotator
cd mmcv
MMCV_WITH_OPS=1 pip install -e .
# RTX 30 series cards use:
# MMCV_WITH_OPS=1 MMCV_CUDA_ARGS='-gencode=arch=compute_80,code=sm_80' pip install -e .
cd ..
pip install -v -e .

After install the two repos, install timm and einops, i.e.,

pip install timm==0.4.9 einops

Download pre-trained models

Download ViTPose COCO Wholebody pretrained model.

Download TCFormer COCO Wholebody pretrained model.

Run Annotation

Replace placeholders with paths to the pretrained models, custom dataset, and output folder in annotate.sh.

python pose/tools/top_down_img_demo_with_yolov5.py \
    --vitpose-config configs/wholebody/2d_kpt_sview_rgb_img/topdown_heatmap/coco-wholebody/ViTPose_huge_wholebody_256x192.py \
    --vitpose-checkpoint ${VITPOSE_WEIGHTS} \
    --tcformer-config pose/configs/wholebody/2d_kpt_sview_rgb_img/topdown_heatmap/coco-wholebody/tcformer_large_mta_coco_wholebody_384x288.py \
    --tcformer-checkpoint ${TCFORMER_WEIGHTS} \
    --img-root ${CUSTOM_DATASET} \
    --out-img-root ${OUTPUT_FOLDER} \

Acknowledgements

Expand

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
configs		configs
mmcv		mmcv
mmcv_custom		mmcv_custom
mmpose.egg-info		mmpose.egg-info
mmpose		mmpose
pose		pose
requirements		requirements
tcformer_module		tcformer_module
tools		tools
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
annotate.sh		annotate.sh
requirements.txt		requirements.txt
setup.py		setup.py

License

fan23j/tcformer-vitpose-ensemble-annotator

Folders and files

Latest commit

History

Repository files navigation

Motivation

Selection Criteria

Setup / Installation

Download pre-trained models

Run Annotation

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Languages