These models are marketed as being able to guide the blind or tutoring children using direct camera access.
Promoting those use cases and models failing in these ways is irresponsible. So, yeah, maybe the models are not embarrasing but the hype definitely is.