Show HN: Simple demo tool for Gemini 2.0 Flash object detection (opens in new tab)

(langtail.com)

2 pointsPetrBrzyBrzek1y ago4 comments

I built a straightforward demo to showcase Gemini's object detection capabilities. Upload any image, get bounding boxes and clean JSON output. It's currently the only model I've found that can accurately return object bounds.

Demo: https://langtail.com/gemini-bounding-boxes

The tool is pretty simple:

- Drop an image or use example ones (apples, llamas)

- Hit detect

- Get visual bounding boxes + JSON output

Happy to answer any questions!

4 comments

Jefro1181y ago

Just been playing around with the bounding box feature myself - do you mind sharing how you figured out how to translate the bounding boxes back to the correct dimensions/position on the original image?

Alifatisk1y ago

Why Gemini 2.0 Flash? Any specific reason?

PetrBrzyBrzekOP1y ago

Gemini 2.0 is the first model that can return precise bounding boxes. Other providers and models can't handle this.

Alifatisk1y ago

Oh wow

j / k navigate · click thread line to collapse

4 comments

Jefro1181y ago

Alifatisk1y ago

Why Gemini 2.0 Flash? Any specific reason?

PetrBrzyBrzekOP1y ago

Gemini 2.0 is the first model that can return precise bounding boxes. Other providers and models can't handle this.

Alifatisk1y ago

Oh wow

j / k navigate · click thread line to collapse