Unexpected label during prediction #66

yuerout · 2024-10-31T00:32:18Z

Hi! So I'm running the grounded_sam2_hf_model_demo.py on one of my images with labels woman. swab. balcony. room. bucket. sky. However, the results that the processor looks like this: 'labels': ['bucket', 'room', 'woman', 'womanab']. I am not too sure where the womanab label comes from?
Here is the image that I used

Here is the image post processing

Does anyone have any insights on why this might be happening? Thanks!

The text was updated successfully, but these errors were encountered:

rentainhe · 2024-10-31T07:00:36Z

Hello @yuerout , we have similar discussions here: #50. You can refer to this issue for more details.

yuerout · 2024-10-31T21:21:57Z

Hi @rentainhe , thanks for the reply! I looked at the file, but it seems like the input to that is a text prompt ("There is a cat and a dog in the image ."). My input is already parsed into individual labels. I was wondering if you know how I can adapt the code there for my use? Thank you!

yuerout · 2024-10-31T21:42:05Z

I also noticed that for all the labels with an underscore, additional spaces were added into the output label after prediction. For example, in the demo grounded_sam2_hf_model_demo.py, if I change the label of the car to white_car, the predicted label in grounded_sam2_hf_model_demo_results.json becomes white _ car.

rentainhe · 2024-11-01T01:59:36Z

Hi @yuerout

About how the womanab come: Grounding DINO will first compute the box region similarity with each text, if the max score is higher than the pre-defined box_threshold, we will keep the box to the final output list, and the box's label will be all the combination of the texts which has the similarity scores with this box higher than the text_threshold. So in this case, both woman and ab has high score with this box, so the label will be womanab. This may introduce some confusing results to the users.

yuerout · 2024-12-12T00:12:21Z

Hi @rentainhe, thanks for the response! I'm still a bit confused -- Why would a score be computed for ab in the first place? This is not in the list of labels. It looks like a fusion of woman and swab.

…rious typos (#218) (close #217, #66, #67, #69, #91, #126, #127, #145)

rentainhe pushed a commit that referenced this issue Dec 21, 2024

open README.md with unicode (to support Hugging Face emoji); fix va…

7e1596c

…rious typos (#218) (close #217, #66, #67, #69, #91, #126, #127, #145)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unexpected label during prediction #66

Unexpected label during prediction #66

yuerout commented Oct 31, 2024

rentainhe commented Oct 31, 2024

yuerout commented Oct 31, 2024

yuerout commented Oct 31, 2024

rentainhe commented Nov 1, 2024

yuerout commented Dec 12, 2024

Unexpected label during prediction #66

Unexpected label during prediction #66

Comments

yuerout commented Oct 31, 2024

rentainhe commented Oct 31, 2024

yuerout commented Oct 31, 2024

yuerout commented Oct 31, 2024

rentainhe commented Nov 1, 2024

yuerout commented Dec 12, 2024