Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implementation of Image-to-Text (Captioning) #6

Open
bryanwong17 opened this issue May 2, 2024 · 0 comments
Open

Implementation of Image-to-Text (Captioning) #6

bryanwong17 opened this issue May 2, 2024 · 0 comments

Comments

@bryanwong17
Copy link

bryanwong17 commented May 2, 2024

Hi, I was wondering if CONCH is able to directly convert an image to text? From the code, it seems like CONCH is only available for "image-to-text retrieval," meaning that given an image and several texts, it will check which text is most similar to the given image. However, in the paper, there is also an example of CONCH doing captioning and a comparison between predicted and corrected captions. If so, could you please provide the code for doing captioning? Thanks!

@bryanwong17 bryanwong17 changed the title Implementation of Image-to-Text Implementation of Image-to-Text (Captioning) May 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant