New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Scene graph baseline for GQA #22
Comments
Hi, what do you mean by scene graph baseline? the baseline reported in the supplementary? Where relation target is the object name of the other node participating in this relation. I may release it sooner. And then I get a set of all objects in the image and run standard MAC over that (instead of using visual features). *Note also that the accuracy there is when testing on the ground-truth scene graphs whereas all other baselines work over the images directly, so the scores should really be compared directly to each other as obviously the direct image task is more difficult. I included this experiment to show a simple ~upper bound of "how well would we do if vision was perfect", and then an ideal model should be in principle achieve 100% in that specific setting. |
Hi, |
Nope, the CNN stem part was for the original MAC version that worked over the older grid features (by extracting from resnet) |
Hello,
Is there a way to run the scene graph baseline reported in the paper or are there any available details on how to implement it?
The text was updated successfully, but these errors were encountered: