Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Visual Grounding code #26

Open
lvchw opened this issue Mar 22, 2024 · 1 comment
Open

Visual Grounding code #26

lvchw opened this issue Mar 22, 2024 · 1 comment

Comments

@lvchw
Copy link

lvchw commented Mar 22, 2024

Hi, thank you for your wonderful work!

I want to perform Visual Grounding on the Chest ImaGnome to visualize the results of report generation, but I failed to found any works to perform VG task on this dataset. I noticed that, in your paper, Appendix B.2 presents three excellent VG results. How are they produced? Is the code in the repository? If not, is it available? I would greatly appreciate it if you could help me.

@ttanida
Copy link
Owner

ttanida commented Mar 25, 2024

Hi,

I am afraid you will have to write some custom code for this, as I didn't commit the code back then.

You can adjust this script by e.g. adjusting this line such that the model also outputs the bounding boxes together besides the beam search output. Then it's only a matter of the plotting the color coded region sentences and associated bounding boxes together.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants