Implementation of Image-to-Text (Captioning) #6

bryanwong17 · 2024-05-02T08:13:12Z

Hi, I was wondering if CONCH is able to directly convert an image to text? From the code, it seems like CONCH is only available for "image-to-text retrieval," meaning that given an image and several texts, it will check which text is most similar to the given image. However, in the paper, there is also an example of CONCH doing captioning and a comparison between predicted and corrected captions. If so, could you please provide the code for doing captioning? Thanks!

bryanwong17 changed the title ~~Implementation of Image-to-Text~~ Implementation of Image-to-Text (Captioning) May 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementation of Image-to-Text (Captioning) #6

Implementation of Image-to-Text (Captioning) #6

bryanwong17 commented May 2, 2024 •

edited

Implementation of Image-to-Text (Captioning) #6

Implementation of Image-to-Text (Captioning) #6

Comments

bryanwong17 commented May 2, 2024 • edited

bryanwong17 commented May 2, 2024 •

edited