Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Image 'features' property design question #2

Open
OlehOnyshchak opened this issue Apr 25, 2020 · 0 comments
Open

Image 'features' property design question #2

OlehOnyshchak opened this issue Apr 25, 2020 · 0 comments

Comments

@OlehOnyshchak
Copy link
Owner

So now we have it calculated as following:

output of 5-th convolutional layer of ResNet152 trained on ImageNet dataset. That output of shape (19, 24, 2048) is then max-pooled to a shape (2048,). Features are taken from original images downloaded in jpeg format with a fixed width of 600px. Practically, it is a list of floats with len = 2048

But should we max-pool that tensor instead of saving the original (19, 24, 2048)? And if we really want just 2048, wouldn't it be better just to use ResNet101, which outputs tensors of this shape? (TODO: double-check whether it's true)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant