https://gist.github.com/simonw/68560eddb0b268a8417f80ceb7304...
The high one is notably better - the bicycle frame is the correct shape, unlike thinking level low.
For comparison, here's Opus 4.7: https://gist.github.com/simonw/afcb19addf3f38eb1996e1ebe749c...
Here's an article from 2 months ago for example: https://www.theguardian.com/technology/commentisfree/2026/ma...
It was also implicated in the bombing of a girls elementary school which left 168 dead. The US did a "triple tap" to kill any first responders.
https://www.theguardian.com/news/2026/mar/26/ai-got-the-blam...
https://www.theguardian.com/technology/2026/apr/01/dont-blam...
> Neither Claude nor any other LLMs detects targets, processes radar, fuses sensor data or pairs weapons to targets. LLMs are late additions to Palantir’s ecosystem. In late 2024, years after the core system was operational, Palantir added an LLM layer – this is where Claude sits – that lets analysts search and summarise intelligence reports in plain English
There’s a lot of humans in that loop who make those decisions.
if you kill somebody while trying to render a pelican on a bicycle it's a real problem.
Depending on the how pelicans are created, it is entirely possible to indirectly kill "somebody" due to the externalised costs of global warming etc.
No, the handlebar is wrong. The handle bar is rotating the frame instead of rotating the front wheel. The handle bar should be mounted on the same line as the front wheel is.
Hopefully 4.9 will read my comments :)
https://www.gianlucagimini.it/portfolio-item/velocipedia/
Turns out even humans can be pretty bad at drawing bicycles :)
https://duckduckgo.com/?q=cannondale+lefty&iar=images&t=ffab
Haha
No guarantees is why LLM is akin to gambling. Every new context is essentially picking someone out of the crowd.
https://tools.simonwillison.net/markdown-svg-renderer#url=ht...
medium: redesign bike so peli can reach bars
high: redesign bike so peli can rest on frame
xhigh: yolo
max: big peli reach bars
For max I used 25 input, 17,167 output which cost me 43 cents! https://www.llm-prices.com/#it=25&ot=17167&ic=5&oc=25&sel=cl...
UPDATE: My mistake, the API does support max. I added a max one at the bottom of this page (cost 43 cents): https://tools.simonwillison.net/markdown-svg-renderer#url=ht...
https://gist.github.com/fendy3002/3026a8c4d67d1301666ec40fc0...
looks like the model already trained well on both bicycle and pelicans
...but that pelican's little helmet is adorable.