Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bounding box detection error (Gemini 2.0 (experimental) API) #345

Open
Soclatos opened this issue Dec 12, 2024 · 0 comments
Open

Bounding box detection error (Gemini 2.0 (experimental) API) #345

Soclatos opened this issue Dec 12, 2024 · 0 comments

Comments

@Soclatos
Copy link

Description of the bug:

I used the latest Gemini 2.0 (experimental) API for image recognition. My test image is a matrix of letters, but the results returned and the visualization do not match. The bounding boxes do not correctly correspond to the letter positions.
I used the same code as https://github.com/google-gemini/cookbook/blob/main/gemini-2/spatial_understanding.ipynb, and the result is shown in the image.
I also tried other visualization methods, modified the code myself, but it still didn't work.
I also used Google AI Studio's Starter Apps for testing and encountered the same issue.
So I'm not sure if there's an issue with the normalized coordinates returned by the API.

test7
result_googleaitest
Snipaste_2024-12-12_21-21-56

Actual vs expected behavior:

No response

Any other information you'd like to share?

No response

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant