Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

what is the image augmentation strategy while training on current demo release? #88

Open
jo4y4y94 opened this issue Apr 14, 2021 · 2 comments

Comments

@jo4y4y94
Copy link

jo4y4y94 commented Apr 14, 2021

hello,thanks for your fabulous work.I wonder what augmentation strategy was using when training on current release demo network?i noticed there are some tiny changes from current training strategy and the mentioned one in original paper.

Original paper seems not mentioning any augmentation,like just input 224x224 images into network,while current one use 176x176 and implement random cropped and mirrored.I wonder what augmentation strategy was using when training on current release demo network?and if it has go through some kind of cropping,where is the image cropped from?is it crop from the image that central crop from imagenet then resize to 256x256 , or just crop from original image from imagenet which has arbitrary resolution?

Please help me if you have free time,thanks!

@jo4y4y94 jo4y4y94 changed the title what is the image augmentation method while training on current demo release? what is the image augmentation strategy while training on current demo release? Apr 14, 2021
@richzhang
Copy link
Owner

I used random crops (176 out of 256) and flipping in the original paper. It was trained in caffe, a long time ago now

@jo4y4y94
Copy link
Author

jo4y4y94 commented Apr 14, 2021

@richzhang Thanks for you reply! Do you know how were those 256x256 images come from? i'm not sure is central cropped from an arbitrary size image then resize to 256x256,or directly resize an arbitrary size image to 256x256,it seems like second option is a little bit weird cause it change the ratio of image.


After reading caffe's official code,i have found out that if you set H and W to 256 while making lmdb file, it will resize all images to 256x256 no matter what original size is .Put it here in case someone is wondering this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants