Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About AP #124

Closed
augenstern-lwx opened this issue Dec 17, 2020 · 11 comments
Closed

About AP #124

augenstern-lwx opened this issue Dec 17, 2020 · 11 comments

Comments

@augenstern-lwx
Copy link

Hi, thanks for your work! I have a question. Why is the accuracy of 61.8 in the original OpenPose paper and 48.6 in your analysis of the original OpenPose?

@Daniil-Osokin
Copy link
Owner

Hi! We have compared with the original model from the paper "Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields". As you can see in the table 4 of paragraph 3.2 the AP is 58.4%. It will increase to 61%, if do an additional refinement for each found person with a separate model for single person pose estimation (CPM). And those 58.4% was obtained in the multi-scale testing mode (6 scales). 48.6% of AP is obtained using a single scale for input data during testing.

@augenstern-lwx
Copy link
Author

augenstern-lwx commented Dec 18, 2020 via email

@Daniil-Osokin
Copy link
Owner

Network inference was performed 4 times (not 6, it is my mistake), each time with different input image resolution (different scale). Then all network outputs were averaged. You can check the validation script for the details, it supports multi-scale option.

@augenstern-lwx
Copy link
Author

augenstern-lwx commented Dec 19, 2020 via email

@augenstern-lwx
Copy link
Author

augenstern-lwx commented Dec 19, 2020 via email

@Daniil-Osokin
Copy link
Owner

Using single or multiple scales for inference is a speed/accuracy trade-off. Loss function is the same.

@augenstern-lwx
Copy link
Author

Thanks, I'd like to know how to calculate the loss after the combination of Heatmaps and PAFs stages?Because the original OpenPose is calculated by two stages.

@Daniil-Osokin
Copy link
Owner

It is just a sum of all losses for heatmaps and pafs. You may check the training script for more details.

@augenstern-lwx
Copy link
Author

augenstern-lwx commented Dec 22, 2020 via email

@lawo123
Copy link

lawo123 commented Dec 10, 2021

Hi! We have compared with the original model from the paper "Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields". As you can see in the table 4 of paragraph 3.2 the AP is 58.4%. It will increase to 61%, if do an additional refinement for each found person with a separate model for single person pose estimation (CPM). And those 58.4% was obtained in the multi-scale testing mode (6 scales). 48.6% of AP is obtained using a single scale for input data during testing.

You wrote in the essay ‘The accuracy of the optimized version nearly matches the baseline: Average Precision (AP) drop is
less than 1% ‘, How it was calculated?
@Daniil-Osokin

@Daniil-Osokin
Copy link
Owner

We have compared with the baseline with 1 refinement stage (see the table 2, AP after refinement stage 1 is 43.4%). Our final model has the 42.8% AP (see the table 5 in the paper). AP was measured with pycocotools.

@Daniil-Osokin Daniil-Osokin mentioned this issue Jun 27, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants