Regarding the training data composition #405

amundra15 · 2024-04-10T12:02:13Z

It is unclear if the model was trained on object-centric images (eg. ImageNet) or scene-level images. In Sec. 3, the authors mention retrieving the dataset by crawling the web. Does this mean that the dataset contains scenes composed of multiple objects?

Perhaps you could release a small sample dataset to give a general idea.

qasfb · 2024-05-13T14:15:26Z

I expect the dataset should contain some images with multiple objects, as we haven't filtered those out in any way.
Releasing a sample dataset is a complex process and therefore I would recommend not planning around that.

qasfb closed this as completed May 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Regarding the training data composition #405

Regarding the training data composition #405

amundra15 commented Apr 10, 2024

qasfb commented May 13, 2024

Regarding the training data composition #405

Regarding the training data composition #405

Comments

amundra15 commented Apr 10, 2024

qasfb commented May 13, 2024