Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

关于预训练模型大小 #1

Closed
dazaier opened this issue Apr 23, 2024 · 4 comments
Closed

关于预训练模型大小 #1

dazaier opened this issue Apr 23, 2024 · 4 comments

Comments

@dazaier
Copy link

dazaier commented Apr 23, 2024

您好!
我下载了预训练模型,有个问题想要请教:为什么不同的数据集的预训练模型会大小不一样呀,我的理解当网络结构确定,模型大小不应该是固定的吗?

@yuell-science
Copy link
Contributor

Thanks for raising the interesting question on different model sizes per dataset! In our model architecture, we are inheriting the multi-path design in this paper: https://openaccess.thecvf.com/content_CVPR_2020/papers/Sundermeyer_Multi-Path_Learning_for_Object_Pose_Estimation_Across_Domains_CVPR_2020_paper.pdf. Basically it uses separate decoders per object. Indeed, this design is also adopted in several follow-up works. As different datasets have different number of objects, the model size will vary. Let me know if you have further questions.

@dazaier
Copy link
Author

dazaier commented Apr 24, 2024

Thanks for your quick response!
Since each object has its own decoder, as the number of object categories in the dataset increases, the size of the model also increases. I observed that for the LMO dataset with 15 classes, the model size is 1.5GB. However, for the YCBV dataset with 21 classes, the model size increases to 2.4GB. Is my understanding correct?

@yuell-science
Copy link
Contributor

It is not necessarily proportional since the model comprises encoder + decoder + classification and regression heads. Only the decoder size is proportional to the number of objects.

@yuell-science
Copy link
Contributor

Closing as no activity happens for a week.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants