Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
0 points
irthomasthomas
22d ago
0 comments
Share
Here is an example where the prompt was only a few hundred tokens and the output reasoning chain was correct, but the actual function call was wrong
https://x.com/xundecidability/status/2005647216741105962?s=2...
0 comments
default
newest
oldest
jmalicki
22d ago
I as a human have typos too - and sometimes they're the hardest thing to catch in code review because you know what you meant.
Hopefully there is some of lint process to catch my human hallucinations and typos.
j
/
k
navigate · click thread line to collapse