undefined | Better HN

0 pointsfuryofantares12d ago0 comments

I'd been on Codex for a while and with Codex 5.2 I:

1) No longer found the dumb zone

2) No longer feared compaction

Switching to Opus for stupid political reasons, I still have not had the dumb zone - but I'm back to disliking compaction events and so the smaller context window it has, has really hurt.

I hope they copy OpenAI's compaction magic soon, but I am also very excited to try the longer context window.

0 comments

pjerem12d ago

If you use OpenCode (open source Claude Code implementation), you can configure compaction yourself : https://opencode.ai/docs/en/config/#compaction

furyofantaresOP11d ago

OpenAI has some magic they do on their standalone endpoint (/responses/compact) just for compaction, where they keep all the user messages and replace the agent messages or reasoning with embeddings.

> This list includes a special type=compaction item with an opaque encrypted_content item that preserves the model’s latent understanding of the original conversation.

Some prior discussion here https://news.ycombinator.com/item?id=46737630#46739209 regarding an article here https://openai.com/index/unrolling-the-codex-agent-loop/

comboy12d ago

Not sure if it's a common knowledge but I've learned not that long ago that you can do "/compact your instructions here", if you just say what you are working on or what to keep explicitly it's much less painful.

In general LLMs for some reason are really bad at designing prompts for themselves. I tested it heavily on some data where there was a clear optimization function and ability to evaluate the results, and I easily beat opus every time with my chaotic full of typos prompts vs its methodological ones when it is writing instructions for itself or for other LLMs.

brookst12d ago

You can also put guidance for when to compact and with what instructions into Claude.md. The model itself can run /compact, and while I try to remember to use it manually, I find it useful to have “If I ask for a totally different task and the current context won’t be useful, run /compact with a short summary of the new focus”

copperx11d ago

I ofter wonder if I'm missing something, but shouldn't we be able to edit the context manually???

In that way we could erase prompts and responses that didn't yield anything useful or derailed the model.

Why can't we do that?

genewitch12d ago

so you have to garbage collect manually for the AI?

also, i don't want to make a full parent post

1M tokens sounds real expensive if you're constantly at that threshold. There's codebases larger in LOC; i read somewhere that Carmack has "given to humanity" over 1 million lines of his code. Perhaps something to dwell on

karmasimida12d ago

This is true.

When I am using codex, compaction isn’t something I fear, it feels like you save your gaming progress and move on.

For Claude Code compaction feels disastrous, also much longer

mgambati12d ago

1m context in OpenAI and Gemini is just marketing. Opus is the only model to provide real usable bug context.

furyofantaresOP12d ago

I'm directly conveying my actual experience to you. I have tasks that fill up Opus context very quickly (at the 200k context) and which took MUCH longer to fill up Codex since 5.2 (which I think had 400k context at the time).

This is direct comparison. I spent months subscribed to both of their $200/mo plans. I would try both and Opus always filled up fast while Codex continued working great. It's also direct experience that Codex continues working great post-compaction since 5.2.

I don't know about Gemini but you're just wrong about Codex. And I say this as someone who hates reporting these facts because I'd like people to stop giving OpenAI money.

throwthrowuknow12d ago

I agree even though I used to be a die hard Claude fan I recently switched back to ChatGPT and codex to try it out again and they’ve clearly pulled into the lead for consistency, context length and management as well as speed. Claude Code instilled a dread in me about keeping an eye on context but I’m slowly learning to let that go with codex.

HarHarVeryFunny11d ago

Surely compaction is down to the agent rather than the model, so are you comparing Claude Code to Codex CLI?

1 more reply

sagarpatil12d ago

This has been my experience too.

1 more reply

hu312d ago

Source? I ask because I use 500k+ context on these on a daily basis.

Big refactorings guided by automated tests eat context window for breakfast.

8note12d ago

i find gemini gets real real bad when you get far into the context - gets into loops, forgets how to call tools, etc

3 more replies

Bolwin12d ago

How many big refactorings are you doing? And why?

1 more reply

johnebgd12d ago

Codex high reasoning has been a legitimately excellent tool for generating feedback on every plan Claude opus thinking has created for me.

radicality11d ago

Using Codex more for now, and there is definitely some compaction magic. I’m keeping the same conversation going and going for days, some at almost 1B tokens (per the codex cli counters), with seemingly no coherency loss

iknowstuff12d ago

Hmm I’ve felt the dumb zone on codex

nomel12d ago

From what I've seen, it means whatever he's doing is very statistically significant.

j / k navigate · click thread line to collapse

0 comments

pjerem12d ago

If you use OpenCode (open source Claude Code implementation), you can configure compaction yourself : https://opencode.ai/docs/en/config/#compaction

furyofantaresOP11d ago

OpenAI has some magic they do on their standalone endpoint (/responses/compact) just for compaction, where they keep all the user messages and replace the agent messages or reasoning with embeddings.

> This list includes a special type=compaction item with an opaque encrypted_content item that preserves the model’s latent understanding of the original conversation.

Some prior discussion here https://news.ycombinator.com/item?id=46737630#46739209 regarding an article here https://openai.com/index/unrolling-the-codex-agent-loop/

comboy12d ago

brookst12d ago

copperx11d ago

I ofter wonder if I'm missing something, but shouldn't we be able to edit the context manually???

In that way we could erase prompts and responses that didn't yield anything useful or derailed the model.

Why can't we do that?

genewitch12d ago

so you have to garbage collect manually for the AI?

also, i don't want to make a full parent post

karmasimida12d ago

This is true.

When I am using codex, compaction isn’t something I fear, it feels like you save your gaming progress and move on.

For Claude Code compaction feels disastrous, also much longer

mgambati12d ago

1m context in OpenAI and Gemini is just marketing. Opus is the only model to provide real usable bug context.

furyofantaresOP12d ago

I don't know about Gemini but you're just wrong about Codex. And I say this as someone who hates reporting these facts because I'd like people to stop giving OpenAI money.

throwthrowuknow12d ago

HarHarVeryFunny11d ago

Surely compaction is down to the agent rather than the model, so are you comparing Claude Code to Codex CLI?

1 more reply

sagarpatil12d ago

This has been my experience too.

1 more reply

hu312d ago

Source? I ask because I use 500k+ context on these on a daily basis.

Big refactorings guided by automated tests eat context window for breakfast.

8note12d ago

i find gemini gets real real bad when you get far into the context - gets into loops, forgets how to call tools, etc

3 more replies

Bolwin12d ago

How many big refactorings are you doing? And why?

1 more reply

johnebgd12d ago

Codex high reasoning has been a legitimately excellent tool for generating feedback on every plan Claude opus thinking has created for me.

radicality11d ago

iknowstuff12d ago

Hmm I’ve felt the dumb zone on codex

nomel12d ago

From what I've seen, it means whatever he's doing is very statistically significant.

j / k navigate · click thread line to collapse