Bounding box detection error (Gemini 2.0 (experimental) API) #345

Soclatos · 2024-12-12T13:23:34Z

Description of the bug:

I used the latest Gemini 2.0 (experimental) API for image recognition. My test image is a matrix of letters, but the results returned and the visualization do not match. The bounding boxes do not correctly correspond to the letter positions.
I used the same code as https://github.com/google-gemini/cookbook/blob/main/gemini-2/spatial_understanding.ipynb, and the result is shown in the image.
I also tried other visualization methods, modified the code myself, but it still didn't work.
I also used Google AI Studio's Starter Apps for testing and encountered the same issue.
So I'm not sure if there's an issue with the normalized coordinates returned by the API.

Actual vs expected behavior:

No response

Any other information you'd like to share?

No response

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bounding box detection error (Gemini 2.0 (experimental) API) #345

Bounding box detection error (Gemini 2.0 (experimental) API) #345

Soclatos commented Dec 12, 2024

Bounding box detection error (Gemini 2.0 (experimental) API) #345

Bounding box detection error (Gemini 2.0 (experimental) API) #345

Comments

Soclatos commented Dec 12, 2024

Description of the bug:

Actual vs expected behavior:

Any other information you'd like to share?