Question: Is it SAM-HQ model applicable for predicting segmentation mask for the input images without boxes, point or label? #106

mzg0108 · 2023-12-06T22:59:17Z

If I understand it correctly, both SAM and SAM-HQ takes box points, input points and labels (text) as input along with the input image.
What about the input images that we don't have these information available for?

If we want to take the human completely out from the scenario and would want the model to take the input and predict the mask, what changes do we need to make to the model?

lkeab · 2023-12-09T09:51:36Z

we can use the everything mode as demonstrated here, which input uniform sampled points on the images as prompt.

jez-moxmo · 2023-12-24T06:57:22Z

if I'm not mistaken, you need the prompt encoder to determine embeddings on an image to mask. automask generator actually is a bit misleading as it just generates a point prompt every 20 pixels or so. For each point prompt, embeddings are encoded and these are matched with the model on their IOU and the most probable (or top 3) is determined to be the masks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question: Is it SAM-HQ model applicable for predicting segmentation mask for the input images without boxes, point or label? #106

Question: Is it SAM-HQ model applicable for predicting segmentation mask for the input images without boxes, point or label? #106

mzg0108 commented Dec 6, 2023

lkeab commented Dec 9, 2023

jez-moxmo commented Dec 24, 2023

Question: Is it SAM-HQ model applicable for predicting segmentation mask for the input images without boxes, point or label? #106

Question: Is it SAM-HQ model applicable for predicting segmentation mask for the input images without boxes, point or label? #106

Comments

mzg0108 commented Dec 6, 2023

lkeab commented Dec 9, 2023

jez-moxmo commented Dec 24, 2023