undefined | Better HN

0 pointswestoncb8mo ago0 comments

Gotcha. Yeah, give o3 a try. If you don't want to get a sub, you can use it over the api for pennies. They do have you do this biometric registration thing that's kind of annoying if you want to use over api though.

You can get the Google pro subscription (forget what they call it) that's ordinarily $20/mo for free right now (1 month free; can cancel whenever), which gives unlimited Gemini 2.5 Pro access.

0 comments

andrepd8mo ago

> Gotcha. Yeah, give o3 a try. If you don't want to get a sub, you can use it over the api for pennies. They do have you do this biometric registration thing that's kind of annoying if you want to use over api though.

I hope you appreciate just how crazy this sentence sounds, even in an age when this is normalised.

westoncbOP8mo ago

Yep, it's surreal.

slacktivism1238mo ago

Yeah, this model didn't work it seems.

You're holding it wrong. You need to utter the right series of incantations to get some semblance of truth.

What, you used the model that was SOTA one week ago? Big mistake, that explains why.

You need to use this SOTA model that came out one day ago instead. That model definitely wasn't trained to overfit the week-old benchmarks and dismiss the naysayers. Look, a pelican!

What? You haven't verified your phone number and completed a video facial scan and passed a background check? You're NGMI.

Hackbraten8mo ago

> Look, a pelican!

Love this reference :)

bradly8mo ago

Thank you for the tip on o3. I will switch to that and see how it goes. I do have a paid sub for ChatGPT, but from the dropdown model descriptions "Great at coding" sounded better than "Advanced reasoning". And 4 is like almost twice as much as 3.

TeMPOraL8mo ago

In my current experience:

- o3 is the bestest and my go-to, but its strength comes from it combining reasoning with search - it's the one model you can count on finding things out for you instead of going off vibe and training data;

- GPT 4.5 feels the smartest, but also has tight usage limits and doesn't do search like o3 does; I use it when I need something creative done, or switch to it mid-conversation to have it reason off an already primed context;

- o4-mini / o4-mini-hard - data transformation, coding stuff that doesn't require looking things up - especially when o3 looked stuff up already, and now I just need ChatGPT to apply it into code/diagrams;

- gpt-4o - only for image generation, and begrudgingly when I run out of quota on GPT 4.5

o3 has been my default starting model for months now; most of my queries generally benefit from having a model that does autonomous reasoning+search. Agentic coding stuff, that I push to Claude Code now.

andrepd8mo ago

I've heard my grandma talk about Catholic saints and their powers with a not dissimilar kind of discourse.

1 more reply

agos8mo ago

the fact that one needs to know stuff like this and that it changes every three months seriously limits the usefulness of LLMs for me

2 more replies

thom8mo ago

I’d also recommend basically always having search enabled. That’s eliminated major hallucinations for me.

westoncbOP8mo ago

lol yep, fully get that. And I mean I'm sure o4 will be great but the '-mini' variant is weaker. Some of it will come down to taste and what kind of thing you're working on too but personal preferences aside, from the heavy LLM users I talk to o3 and gemini 2.5 pro at the moment seem to be top if you're dialoging with them directly (vs using through an agent system).

j / k navigate · click thread line to collapse

0 comments

andrepd8mo ago

I hope you appreciate just how crazy this sentence sounds, even in an age when this is normalised.

westoncbOP8mo ago

Yep, it's surreal.

slacktivism1238mo ago

Yeah, this model didn't work it seems.

You're holding it wrong. You need to utter the right series of incantations to get some semblance of truth.

What, you used the model that was SOTA one week ago? Big mistake, that explains why.

You need to use this SOTA model that came out one day ago instead. That model definitely wasn't trained to overfit the week-old benchmarks and dismiss the naysayers. Look, a pelican!

What? You haven't verified your phone number and completed a video facial scan and passed a background check? You're NGMI.

Hackbraten8mo ago

> Look, a pelican!

Love this reference :)

bradly8mo ago

TeMPOraL8mo ago

In my current experience:

- gpt-4o - only for image generation, and begrudgingly when I run out of quota on GPT 4.5

andrepd8mo ago

I've heard my grandma talk about Catholic saints and their powers with a not dissimilar kind of discourse.

1 more reply

agos8mo ago

the fact that one needs to know stuff like this and that it changes every three months seriously limits the usefulness of LLMs for me

2 more replies

thom8mo ago

I’d also recommend basically always having search enabled. That’s eliminated major hallucinations for me.

westoncbOP8mo ago

j / k navigate · click thread line to collapse