Claude Code SDK (opens in new tab)

(docs.anthropic.com)

454 pointssync10mo ago194 comments

194 comments

The way Claude Code is going is exactly what I want out of a agentic coding tool with this "unix toolish" philosophy. I've been using Claude code since the initial public preview release, and have seen the direction over time.

The "golden" end state of coding agents is that you give it a Feature Request (EG Jira ticket), and it gives you a PR to review and give feedback on. Cursor, windsurf, etc, are dead ends in that sense as they are local editors, and can not be in CI.

If you are tooling your codebase for optimal AI usage (Rules, MCP, etc), you should target a technology that can bridge the gap to headless usage. The fact Claude Code can trivially be used as part of automation through the tools means it's now the default way I thinking about coding agents (Codex, the npm package, is the same).

Disclaimer, I focus on helping companies tool their codebases for optimal agent usage, so I might have a bias here to easily configurable tools.

jdmoreira10mo ago

Not sure about that golden end state. Mine would be being in a room surround by screens with AI agents coding, designing, testing, etc. I would be there in the center giving guidance, direction, applying taste, etc… All conversational, wouldn’t need to touch the keyboard 99% of the time.

That's what I want and look forward one day

Roritharr10mo ago

Is this a me thing, or a millenial thing?

I hate using voice for anything. I hate getting voice messages, I hate creating them. I get cold sweats just thinking about having to direct 10 AI Agents via voice. Just give me a keyboard and a bunch of screens, thanks.

10 more replies

csto1210mo ago

If that’s the future, that means a massive reduction in software engineers no? What you are describing would require one technical product manager, not a team of software engineers.

5 more replies

geertj10mo ago

I can easily see this happening in 2-3 years. Some chat apps already have outstanding voice mode, such as GPT-4o. It's just a matter of integrating that voice mode, and getting the understanding and generated code to be /slightly/ better than it is today.

1 more reply

rco878610mo ago

It seems unlikely that any one individual would be able to output a sufficient amount of context for that to not go off the rails really quickly (or just be extremely inefficient as most agents sit idle waiting for verification of their work)

cortesoft10mo ago

Basically the Star Trek model of computing.

1 more reply

arguflow10mo ago

In this "end state" what would the AI mind machine even have to code?

chamomeal10mo ago

That sounds like torture for me lol

dakiol10mo ago

No. The "golden" end state of coding agents is free and open source coding agents running on my machine (or in whatever machine I want). Can you imagine paying for every command you run in your terminal? For every `ls`, `ps`, `kill`? No sense, right? Well, same for LLMs.

I'm not saying "ban propietary LLMs", I'm saying: hackers (the ones that used to read sites like this) should have as their main tools free and open source ones.

dontlikeyoueith10mo ago

> Can you imagine paying for every command you run in your terminal?

Yes, because hardware and electricity aren't free.

I literally DO pay for every command. I just don't get an itemized bill so there's no transparency about it. Instead, I made some lump-sum hardware payment which is amortized over the total usage I get out of it, plus some marginal increase in my monthly electric bill when I use it.

1 more reply

notpushkin10mo ago

I agree with the sentiment, but isn’t Claude Code (the CLI) FOSS already? (Not sure it’s coupled to Claude the model API either, but if it is I imagine it’s not too hard to fix.)

1 more reply

syncOP10mo ago

Anthropic also announced something along those lines today as well, in beta: https://docs.anthropic.com/en/docs/claude-code/github-action...

MattSayar10mo ago

How did you find this? It doesn't pop up on any news sections on their site. I want to be on top of these kinds of things too!

2 more replies

breckenedge10mo ago

> Cursor, windsurf, etc, are dead ends in that sense as they are local editors, and can not be in CI.

I was doing this with Cursor and MCPs. Got about a full day of this before I was rate limited and dropped to the slowest, dumbest model. I’ve done it with Claude too and quickly exhaust my rate limits. And the PRs are only “good to go” about 25% of the time, and it’s often faster to just do it right than find out where the AI screwed up.

andrewstuart10mo ago

> The "golden" end state of coding agents is that you give it a Feature Request (EG Jira ticket), and it gives you a PR to review and give feedback on.

I see your point but in the other hand how depressing to be left only with the most soul crushing part of software entering - the Jira ticket.

d_watt10mo ago

I personally find figuring out what the product should be is the fun part. There still a need for architecting a plan, but the actual act of writing code isn't what gives me personal joy, it's the building of something new.

I understand the craft of code itself is what some people love though!

1 more reply

btbuildem10mo ago

Say what you will, but this would have the wonderful side effect of forcing people who write JIRA tickets to actually think through and clearly express what it is they want built.

3 more replies

pjmlp10mo ago

The moment I am able to outsource work for Jira tickets to a level that AI actually delivers a reasonable pull request, many corporate managers will seriously wonder why keep the offshoring team around.

ryandrake10mo ago

It seems like the Holy Grail here has become: "A business is one person, the CEO, sitting at his desk doing deals and directing virtual and physical agents to do accounting, run factories, manage R&D, run marketing campaigns, everything." That's it. A single CEO, (maybe) a lawyer, and a big AI/robotics bill = every business. No pesky employees to pay. That's the ultimate end game here, that's what these guys want. Is that what we want?

4 more replies

dgb2310mo ago

So far, automation has only ever increased the need for software development. Jevons Paradox plus the recursive nature of software means that there's always more stuff to do.

The real threats to our profession are things like climate change, extreme wealth concentration, political instability, cultural regression and so on. It's the stuff that software stands on that one should worry about, not the stuff that it builds towards.

chrsw10mo ago

Maybe I’m not think big picture enough… but have you ever tried using generative AI (i.e., a transformer) to create a circuit schematic? They fail miserably. Worse than Chat GPT-2 at generating text.

The current SOTA models can do some impressive things, in certain domains. But running a business is way more than generating JavaScript.

The way I see it, only some jobs will be impacted by generative AI in the near term. Not replaced, augmented.

yahoozoo10mo ago

Why would they pay you six figures to outsource to AI when they could pay offshore a fraction of that to do the same?

1 more reply

StefanBatory10mo ago

Offshoring team?

No, any team.

2 more replies

k__10mo ago

Can't you have that already?

Put the Aider CLI into a GitHub action that's triggered by an issue creation and you're good to go.

d_watt10mo ago

Aider is definitely in the same camp. Last time I checked, they weren't optimizing for the full "agent infinitely looping until completion" usecase, and didn't have MCP support.

But it's 100% the same class of tool and the awesome part of the unixy model is hopefully agents can be substituted in for each other in your pipeline for whichever one is better for the usecase, just like models are interoperable.

1 more reply

alvis10mo ago

The vision of submitting a feature request and receiving a ready-to-review PR is equally compelling and horrifying from the standpoint of strategy management.

Like Anthropic and most big tech companies, they don't want to show off the best until they need to. They used to stockpile some cool features, and they have time to think about their strategy. But now I feel like they are in a rush to show off everything and I'm worried whether the management has time to think about the big picture.

arkadiytehgraet10mo ago

You should use some of those agents yourself to fix some glaring issues at your landing page.

morsecodist10mo ago

Setting aside predictions about the future and what is best for humanity and all that for a moment this is just such a bummer on a personal level. My whole job would become the worst parts of my job.

max_on_hn10mo ago

(please pardon the self-promotion) This is exactly what my product https://cheepcode.com does (connects to your Linear/Jira/etc and submits PRs to GitHub) - I agree that’s the golden state, and that’s why I’m rushing to get out of private beta as fast as I can* :) It’s a bootstrapped operation right now which limits my speed a bit but this is the vision I’ve been working towards for the past few months.

*I have a few more safety/scalability changes to make but expecting public launch in a few weeks!

virgildotcodes10mo ago

> The "golden" end state of coding agents is that you give it a Feature Request (EG Jira ticket), and it gives you a PR to review and give feedback on. Cursor, windsurf, etc, are dead ends in that sense as they are local editors, and can not be in CI.

Isn’t that effectively the promise of the most recently released OpenAI codex?

From the reviews I’ve been able to find so far though, quality of output is ehh.

d_watt10mo ago

It totally is!

I bias a bit to wanting the agent to be a pluggable component into a flow I own, rather than a platform in a box.

It'll be interesting to see where the different value props/use cases of a Delvin/v0 vs a Codex Cloud vs Claude Code/Codex CLI vs Cursor land.

ramesh3110mo ago

Thats the promise. The reality is that it's just a subpar version of Claude Code which doesn't support MCP.

mistrial910mo ago

golden age consultant paycheck

naiv10mo ago

played around with connecting https://github.com/eyaltoledano/claude-task-master via mcp to create a prd which basically replaces the ticket grooming process and then executing it with claude code creating a branch named like the ticket and pushing after having created the unit tests and constant linting.

Vanclief10mo ago

Claude Code is my favorite way to use LLMs for coding.

However I feel what we really need is to have an open source version of it where you can pass any model and also you can compare different models answers.

(Aider and other alternatives really doesn't feel as good to use as Claude Code)

I know this is not what anthropic would want to do as it removes their moat, but as a consumer I just want the best model and not be tied to an ecosystem. (Which I imagine is the largest fear of LLM model providers)

ayargz10mo ago

OpenAI codex is probably the closest to what you're talking about, its open source and you can use models from any provider. It's not as good as claude code right now but I bet it wont take long for them to catch up.

https://github.com/openai/codex/tree/main

jennings_hunter10mo ago

You might be interested in the OpenCode project: https://github.com/opencode-ai/opencode

It's still under development but looks promising.

energy12310mo ago

> (Aider and other alternatives really doesn't feel as good to use as Claude Code)

What does Claude Code do better than Aider?

baalimago10mo ago

I've self-plugged too hard of late. But check my previous comments for a service I wrote in go, which is exactly what you're asking for.

greyman10mo ago

Question: I agree, but doesn't that other model need to be trained so it knows how to work with MCP servers? Or that isn't an issue?

Semtexzv10mo ago

https://termineer.io/

pram10mo ago

You can use Claude Code as an MCP server so you can kinda do this already.

anotherpaulg10mo ago

Aider has had support for Python and shell scripting [0] for a long time. I made a screencast [1] recently that included ad-hoc bash scripting aider as part of the effort to add support for 130 new programming languages. It may give a flavor for how powerful this approach can be.

[0] https://aider.chat/docs/scripting.html

[1] https://aider.chat/docs/recordings/tree-sitter-language-pack...

hztar10mo ago

Freaking love Aider. MCPs are supported soon as well. Testing a development branch. Then you can actually develop end to end using PR, tickets etc using models you trust.

jacob01910mo ago

That's great news! Love Aider too and that's the main thing that's missing right now. Oh the things I will build.

unshavedyak10mo ago

How close can you get Aider to Claude Code? Ie i liked the Claude Code UX, but i don't use it because i prefer Gemini 2.5 Pro.

I don't really want it committing and stuff, i mostly like the UX of Claude Code. Thoughts?

m3kw910mo ago

You can turn off auto commit

1 more reply

k__10mo ago

Aider could really profit from a polished GitHub Actions workflow.

Add a file to your repo and you can talk to any model via issues.

swyx10mo ago

more context from the claude code team: http://latent.space/p/claude-code

you can skim the transcript but some personal highlights:

- anthropic employees, with unlimited claude, average to $6/day of usage

- headless claude code as a "linux" utility that you use everywhere in CI is pretty compelling

- claude code as a user extensible platform

- future roadmap of claude code: sandboxing, branching, planning

- sonnet 3.7 as a persistent, agentic model

philosophty10mo ago

"- anthropic employees, with unlimited claude, average to $6/day of usage"

From the link:

"Apparently, there are some engineers inside of Anthropic that have spent >$1,000 in one day!"

The question is what is the P50, P75, and P95 spend per employee?

thesurlydev10mo ago

Agree. That would be a great insight as well as what type of activities cause the explosion in spend.

1 more reply

ipsum210mo ago

Maybe I'm holding it wrong, but I can easily spend $20+ using Claude Code for 2 hours. I've stopped using it because it was too expensive for my personal projects.

jasonjmcghee10mo ago

I briefly commented on how I approach cost control before, if useful.

https://news.ycombinator.com/item?id=43737060

2 more replies

d_watt10mo ago

Claude max plan has Claude code bundled into the price. $100/month isn't cheap, but the RoI is there for me personally.

3 more replies

sumeno10mo ago

Maybe engineers at Anthropic use it less because they fully understand the limitations and drawbacks

sagarpatil10mo ago

Why not use Claude Max Plan? Starts at $100.

big_toast10mo ago

I’ve really enjoyed the recent latent space podcasts. I don’t think there is any person†/podcast (or perhaps other content) approaching your general output while maintaining the high SNR. I am continually amazed at the volume and value of public work you’re producing over the last (half?) decade while still growing various businesses. I hope others can find similar productivity gradients. I know you roughly share what works for you but it is not so easy to reproduce.

† simonw, gwern

swyx10mo ago

thanks man, this was nice to read :) idk if it helps but my principles (tm) are here http://learninpublic.org/

i do feel like SNR * quantity could be higher, but its still a challenge to even keep it where it is today. my work life balance/stress levels aren't the best and everyone expects everything from me.

re5i5tor10mo ago

Agree 100%

shostack10mo ago

How neutral was the podcast vs being a sales pitch for this?

woah10mo ago

If I was making an AI code assistant, the last thing I would do is to lock it in to a particular foundation model provider.

The only possible way for this to be a successful offering is if we have just now reached a plateau of model effectiveness and all foundation models will now trend towards having almost identical performance and capabilities, with integrators choosing based on small niceties, like having a familiar SDK.

ChadMoran10mo ago

Other than the command/arguments there isn't much locking you in. It's just input/output. Swap it out for something else or simply wrap it. There's not much going on here.

ramoz10mo ago

"Lock i-"

At this point Claude Code is a software differentiator in the agent coding space.

I am building things related to AI code assistants - we were hacking ways to integrate Claude Code - it was the first thing we wanted to build around.

It's too early to care about lock in.

Need the best, will only build around the best.

Wowfunhappy10mo ago

Claude Code could already be used in non-interactive mode, and by extension it could be integrated into other apps in the same manner as any other UNIX command line utility.

This SDK currently supports only command line usage. Isn't that just what we already had?

I don't understand what's actually new here. What am I missing?

blueorange810mo ago

I don't either - not sure why noone has commented on this as far as i can see

jiangplus10mo ago

I would also recommend Codebuff (https://www.codebuff.com/), a great CLI code assistant comparable to Claude Code, which can save a lot on token costs.

(I am not affiliated with this project, just a user.)

bionhoward10mo ago

> You may not access or use, or help another person to access or use, our Services in the following ways: > 2. To develop any products or services that compete with our Services, including to develop or train any artificial intelligence or machine learning algorithms or models or resell the Services.

Can somebody please tell me what software product or service doesn’t compete with general intelligence?

Imagine selling intelligence with a legal term that, under strict interpretation, says you’re not allowed to use it for anything.

Is it so vague it’s unenforceable?

How do we own the output if we can’t use it to compete with a general intelligence?

Is it just a “lol nerd no one cares about the legal terms” thing? If no one cares then why would they have a blanket prohibition on using the service ?

We’re supposed to accept liability to lose a lawsuit just to accept their slop? So many questions

ChadMoran10mo ago

This is what happens when you let lawyers say what they want.

sumeno10mo ago

You changed the word artificial to general which obviously changes the meaning

m3kw910mo ago

When you have model lock in, it’s a big detriment to use because if anyone comes out with SOTA models, and you have already invested infra development on this, you are stuck. Even if you open it up, it’s likely not to work as your model is likely trained specifically on that CLI. Just look at Codex CLI, you can use Gemini 2.5 pro, but it will get randomly stuck or fail a lot vs OpenAI models

hosainnet10mo ago

The new GitHub action is exactly what I have been looking for https://docs.anthropic.com/en/docs/claude-code/github-action... but there doesn't seem to be a way to use it with the Claude Code's Max plan?

As it only accepts an API key as far as I can tell.

cube222210mo ago

This is great! Especially the GitHub Actions issue/PR integration[0] that’s paired with this is exactly what I’ve been wanting!

[0]: https://docs.anthropic.com/en/docs/claude-code/github-action...

mirekrusin10mo ago

I'll try when they start supporting claude via copilot. Can't use at work anything else.

sean_10mo ago

You can use this in the cloud through a nice UI in https://cloudcoding.ai/chat

sean_10mo ago

https://cloudcoding.ai/ is a way to use a similar claude code sdk in the cloud!

bilater10mo ago

I don't understand why this isn't just baked into cursor/windsurf? I think in reality that's where most devs will use it.

andrewstuart10mo ago

Claude has been left in the dust by Gemini with its million token session and ability to upload a zip file of my entire code base.

Sajarin10mo ago

I wonder if anyone has done an analysis on the HN user sentiment on the varying AI models over time. I'd be curious to see what that looks like. Increasingly, I'm seeing more and more people talk positively about Gemini and Google (and having used Gemini recently, I align with that sentiment)

I think Bard (lol) and Gemini got a late start and so lots of folks dismissed it but I feel like they've fully caught up. Definitely excited to see what Gemini 3 vs GPT-5 vs Claude 4 looks like!

fallinditch10mo ago

I'm using Windsurf IDE so have all the main models available. Mainly doing Python, JS, HTML, CSS, some Go. I have found Claude 3.7 outperforms Gemini 2.5 and ChatGPT 4.1, 4o, Deepseek, etc, for my work in most cases.

I suspect that I experience some performance throttling with Gemini 2.5 in my Windsurf setup because it's just not as good as anecdotal reports by others, and benchmarks.

I also seem to run up against a kind of LLM laziness sometimes when they seemingly can't be bothered to answer a challenging prompt ... a consequence of load balancing in action perhaps.

1 more reply

mbesto10mo ago

Who cares about sentiment when you can just look at a proxy for usage: https://openrouter.ai/rankings

EDIT: Specifically: https://openrouter.ai/rankings/programming?view=week

Karrot_Kream10mo ago

Gemini hit the top of a bunch of leaderboards recently so it probably prompted folks to try Gemini out and they found it useful.

ChadMoran10mo ago

Context is only one part of it. I tried using Gemini and got sub par results. comment-laden code with not not following instructions.

cube222210mo ago

I’ve tried Gemini 2.5 Pro a couple of times and honestly don’t like its output. Claude Sonnet 3.7 is much better at correctly understanding and executing my imprecise prompts.

Gemini 2.5 Flash on the other hand has excellent. I’ve started using it to rewrite whole files after talking the changes through with Claude, because it’s just so ridiculously fast (and dependable enough for applying already outlined changes).

ramoz10mo ago

This is Claude Code.

The two work really well with Gemini as a planner and Claude Code as an executor.

barefootford10mo ago

and then get the honor of copy and pasting all of the changes afterward?

danenania10mo ago

You can try my project Plandex[1] to use Gemini in a way that's comparable to Claude Code without copy-pasting. By default, it combines models from the major providers—Anthropic, OpenAI, and Google.

The default planning/coding models are still Sonnet 3.7 for context size under 200k, but you can switch to Gemini with `\set-model gemini-preview`.

1 - https://github.com/plandex-ai/plandex

andrewstuart10mo ago

“Make me a bash script which creates all the files using heredoc”

Works for a reasonable chunk of files say 5 to 10 that aren’t too big.

No doubt they’ll get to better file access.

Anyhow I’m quite happy to do the copy and paste because Geminis coding and debugging capability is far better than Claude.

termin310mo ago

I'm using this https://github.com/coffeegrind123/gemini-code to use Claude Code with Gemini and it's working perfectly

dimitri-vs10mo ago

Cursor with gemini-2.5 MAX and agentic mode.

I really like the idea of Claude Code but its rare that I fully spec out a feature on my first request and I can't see how it can be used for frontend features that require a lot of browser-centric iteration/debugging to get right.

andy12_10mo ago

I can't say that I don't love Gemini. I use it a lot, and the huge context window does help. But I can also say that I much prefer how Claude writes code.

dgellow10mo ago

claude code still has the best UX IMHO. But I would love to have the million token context, for sure

mickeyp10mo ago

I'm building a browser based tool that runs on your computer, with full tool access of course, that works with all the major models and is far better and more ergonomic to use than code, codex, etc.

If you (or anyone else reading this) wants to try out the upcoming beta give me a ping. (see profile.)

simonw10mo ago

How are you uploading zip files of code to Gemini?

andrewstuart10mo ago

In AI Studio select file upload then select a zip file.

karn9710mo ago

They just need to release a useful model which can actually code now!

hoppp10mo ago

Pretty cool ngl. I think I'll try it

doctorpangloss10mo ago

I’ve cancelled my subscription. Sorry guys…

baalimago10mo ago

Hasn't this been invented already in multiple shapes and forms..? I wrote my own version clai[1] over a year ago which does exactly this, only that it has tools support + is multi vendor.

[1]: https://github.com/baalimago/clai

simonw10mo ago

Looks quite similar to my https://llm.datasette.io tool as well.

Honestly though, CLI tools for accessing LLMs (including piping content in and out of them) is such a clearly good idea I'm glad to see more tools implementing the pattern.

dcre10mo ago

It's very surprising that it has taken this long to see a first-party CLI like this.

1 more reply

kristopolous10mo ago

What is really needed is a usable multiplexed pipeline management and event system.

Then you can instrument through metaprogramming. For instance, an alert system could be:

"If the threshold goes over 1.0, contact the on-call person through their preferred method" - which may work ... maybe.

Or:

if any( "check_condition {x}", condition_set ): find_person("on call", right now).contact("preferred")

... the point is to divide everything up into small one-shots, parallelize them, use it as glue/api. Then you get composability. If you can get a framework for coroutines going then it's real game on. The final step is "needs based pulling" which is an inversion of mcp - contextual streams as event based sub-systems.

Things are still too slow for this to be not painful but that won't be the case forever.

Currently everything is linear. Doesn't have to be ... really doesn't.

j / k navigate · click thread line to collapse

194 comments

d_watt10mo ago

Disclaimer, I focus on helping companies tool their codebases for optimal agent usage, so I might have a bias here to easily configurable tools.

jdmoreira10mo ago

That's what I want and look forward one day

Roritharr10mo ago

Is this a me thing, or a millenial thing?

10 more replies

csto1210mo ago

If that’s the future, that means a massive reduction in software engineers no? What you are describing would require one technical product manager, not a team of software engineers.

5 more replies

geertj10mo ago

1 more reply

rco878610mo ago

cortesoft10mo ago

Basically the Star Trek model of computing.

1 more reply

arguflow10mo ago

In this "end state" what would the AI mind machine even have to code?

chamomeal10mo ago

That sounds like torture for me lol

dakiol10mo ago

I'm not saying "ban propietary LLMs", I'm saying: hackers (the ones that used to read sites like this) should have as their main tools free and open source ones.

dontlikeyoueith10mo ago

> Can you imagine paying for every command you run in your terminal?

Yes, because hardware and electricity aren't free.

1 more reply

notpushkin10mo ago

I agree with the sentiment, but isn’t Claude Code (the CLI) FOSS already? (Not sure it’s coupled to Claude the model API either, but if it is I imagine it’s not too hard to fix.)

1 more reply

syncOP10mo ago

Anthropic also announced something along those lines today as well, in beta: https://docs.anthropic.com/en/docs/claude-code/github-action...

MattSayar10mo ago

How did you find this? It doesn't pop up on any news sections on their site. I want to be on top of these kinds of things too!

2 more replies

breckenedge10mo ago

> Cursor, windsurf, etc, are dead ends in that sense as they are local editors, and can not be in CI.

andrewstuart10mo ago

> The "golden" end state of coding agents is that you give it a Feature Request (EG Jira ticket), and it gives you a PR to review and give feedback on.

I see your point but in the other hand how depressing to be left only with the most soul crushing part of software entering - the Jira ticket.

d_watt10mo ago

I understand the craft of code itself is what some people love though!

1 more reply

btbuildem10mo ago

Say what you will, but this would have the wonderful side effect of forcing people who write JIRA tickets to actually think through and clearly express what it is they want built.

3 more replies

pjmlp10mo ago

ryandrake10mo ago

4 more replies

dgb2310mo ago

So far, automation has only ever increased the need for software development. Jevons Paradox plus the recursive nature of software means that there's always more stuff to do.

chrsw10mo ago

The current SOTA models can do some impressive things, in certain domains. But running a business is way more than generating JavaScript.

The way I see it, only some jobs will be impacted by generative AI in the near term. Not replaced, augmented.

yahoozoo10mo ago

Why would they pay you six figures to outsource to AI when they could pay offshore a fraction of that to do the same?

1 more reply

StefanBatory10mo ago

Offshoring team?

No, any team.

2 more replies

k__10mo ago

Can't you have that already?

Put the Aider CLI into a GitHub action that's triggered by an issue creation and you're good to go.

d_watt10mo ago

Aider is definitely in the same camp. Last time I checked, they weren't optimizing for the full "agent infinitely looping until completion" usecase, and didn't have MCP support.

1 more reply

alvis10mo ago

The vision of submitting a feature request and receiving a ready-to-review PR is equally compelling and horrifying from the standpoint of strategy management.

arkadiytehgraet10mo ago

You should use some of those agents yourself to fix some glaring issues at your landing page.

morsecodist10mo ago

Setting aside predictions about the future and what is best for humanity and all that for a moment this is just such a bummer on a personal level. My whole job would become the worst parts of my job.

max_on_hn10mo ago

*I have a few more safety/scalability changes to make but expecting public launch in a few weeks!

virgildotcodes10mo ago

Isn’t that effectively the promise of the most recently released OpenAI codex?

From the reviews I’ve been able to find so far though, quality of output is ehh.

d_watt10mo ago

It totally is!

I bias a bit to wanting the agent to be a pluggable component into a flow I own, rather than a platform in a box.

It'll be interesting to see where the different value props/use cases of a Delvin/v0 vs a Codex Cloud vs Claude Code/Codex CLI vs Cursor land.

ramesh3110mo ago

Thats the promise. The reality is that it's just a subpar version of Claude Code which doesn't support MCP.

mistrial910mo ago

golden age consultant paycheck

naiv10mo ago

Vanclief10mo ago

Claude Code is my favorite way to use LLMs for coding.

However I feel what we really need is to have an open source version of it where you can pass any model and also you can compare different models answers.

(Aider and other alternatives really doesn't feel as good to use as Claude Code)

ayargz10mo ago

https://github.com/openai/codex/tree/main

jennings_hunter10mo ago

You might be interested in the OpenCode project: https://github.com/opencode-ai/opencode

It's still under development but looks promising.

energy12310mo ago

> (Aider and other alternatives really doesn't feel as good to use as Claude Code)

What does Claude Code do better than Aider?

baalimago10mo ago

I've self-plugged too hard of late. But check my previous comments for a service I wrote in go, which is exactly what you're asking for.

greyman10mo ago

Question: I agree, but doesn't that other model need to be trained so it knows how to work with MCP servers? Or that isn't an issue?

Semtexzv10mo ago

https://termineer.io/

pram10mo ago

You can use Claude Code as an MCP server so you can kinda do this already.

anotherpaulg10mo ago

[0] https://aider.chat/docs/scripting.html

[1] https://aider.chat/docs/recordings/tree-sitter-language-pack...

hztar10mo ago

Freaking love Aider. MCPs are supported soon as well. Testing a development branch. Then you can actually develop end to end using PR, tickets etc using models you trust.

jacob01910mo ago

That's great news! Love Aider too and that's the main thing that's missing right now. Oh the things I will build.

unshavedyak10mo ago

How close can you get Aider to Claude Code? Ie i liked the Claude Code UX, but i don't use it because i prefer Gemini 2.5 Pro.

I don't really want it committing and stuff, i mostly like the UX of Claude Code. Thoughts?

m3kw910mo ago

You can turn off auto commit

1 more reply

k__10mo ago

Aider could really profit from a polished GitHub Actions workflow.

Add a file to your repo and you can talk to any model via issues.

swyx10mo ago

more context from the claude code team: http://latent.space/p/claude-code

you can skim the transcript but some personal highlights:

- anthropic employees, with unlimited claude, average to $6/day of usage

- headless claude code as a "linux" utility that you use everywhere in CI is pretty compelling

- claude code as a user extensible platform

- future roadmap of claude code: sandboxing, branching, planning

- sonnet 3.7 as a persistent, agentic model

philosophty10mo ago

"- anthropic employees, with unlimited claude, average to $6/day of usage"

From the link:

"Apparently, there are some engineers inside of Anthropic that have spent >$1,000 in one day!"

The question is what is the P50, P75, and P95 spend per employee?

thesurlydev10mo ago

Agree. That would be a great insight as well as what type of activities cause the explosion in spend.

1 more reply

ipsum210mo ago

Maybe I'm holding it wrong, but I can easily spend $20+ using Claude Code for 2 hours. I've stopped using it because it was too expensive for my personal projects.

jasonjmcghee10mo ago

I briefly commented on how I approach cost control before, if useful.

https://news.ycombinator.com/item?id=43737060

2 more replies

d_watt10mo ago

Claude max plan has Claude code bundled into the price. $100/month isn't cheap, but the RoI is there for me personally.

3 more replies

sumeno10mo ago

Maybe engineers at Anthropic use it less because they fully understand the limitations and drawbacks

sagarpatil10mo ago

Why not use Claude Max Plan? Starts at $100.

big_toast10mo ago

† simonw, gwern

swyx10mo ago

thanks man, this was nice to read :) idk if it helps but my principles (tm) are here http://learninpublic.org/

i do feel like SNR * quantity could be higher, but its still a challenge to even keep it where it is today. my work life balance/stress levels aren't the best and everyone expects everything from me.

re5i5tor10mo ago

Agree 100%

shostack10mo ago

How neutral was the podcast vs being a sales pitch for this?

woah10mo ago

If I was making an AI code assistant, the last thing I would do is to lock it in to a particular foundation model provider.

ChadMoran10mo ago

Other than the command/arguments there isn't much locking you in. It's just input/output. Swap it out for something else or simply wrap it. There's not much going on here.

ramoz10mo ago

"Lock i-"

At this point Claude Code is a software differentiator in the agent coding space.

I am building things related to AI code assistants - we were hacking ways to integrate Claude Code - it was the first thing we wanted to build around.

It's too early to care about lock in.

Need the best, will only build around the best.

Wowfunhappy10mo ago

Claude Code could already be used in non-interactive mode, and by extension it could be integrated into other apps in the same manner as any other UNIX command line utility.

This SDK currently supports only command line usage. Isn't that just what we already had?

I don't understand what's actually new here. What am I missing?

blueorange810mo ago

I don't either - not sure why noone has commented on this as far as i can see

jiangplus10mo ago

I would also recommend Codebuff (https://www.codebuff.com/), a great CLI code assistant comparable to Claude Code, which can save a lot on token costs.

(I am not affiliated with this project, just a user.)

bionhoward10mo ago

Can somebody please tell me what software product or service doesn’t compete with general intelligence?

Imagine selling intelligence with a legal term that, under strict interpretation, says you’re not allowed to use it for anything.

Is it so vague it’s unenforceable?

How do we own the output if we can’t use it to compete with a general intelligence?

Is it just a “lol nerd no one cares about the legal terms” thing? If no one cares then why would they have a blanket prohibition on using the service ?

We’re supposed to accept liability to lose a lawsuit just to accept their slop? So many questions

ChadMoran10mo ago

This is what happens when you let lawyers say what they want.

sumeno10mo ago

You changed the word artificial to general which obviously changes the meaning

m3kw910mo ago

hosainnet10mo ago

As it only accepts an API key as far as I can tell.

cube222210mo ago

This is great! Especially the GitHub Actions issue/PR integration[0] that’s paired with this is exactly what I’ve been wanting!

[0]: https://docs.anthropic.com/en/docs/claude-code/github-action...

mirekrusin10mo ago

I'll try when they start supporting claude via copilot. Can't use at work anything else.

sean_10mo ago

You can use this in the cloud through a nice UI in https://cloudcoding.ai/chat

sean_10mo ago

https://cloudcoding.ai/ is a way to use a similar claude code sdk in the cloud!

bilater10mo ago

I don't understand why this isn't just baked into cursor/windsurf? I think in reality that's where most devs will use it.

andrewstuart10mo ago

Claude has been left in the dust by Gemini with its million token session and ability to upload a zip file of my entire code base.

Sajarin10mo ago

I think Bard (lol) and Gemini got a late start and so lots of folks dismissed it but I feel like they've fully caught up. Definitely excited to see what Gemini 3 vs GPT-5 vs Claude 4 looks like!

fallinditch10mo ago

I suspect that I experience some performance throttling with Gemini 2.5 in my Windsurf setup because it's just not as good as anecdotal reports by others, and benchmarks.

I also seem to run up against a kind of LLM laziness sometimes when they seemingly can't be bothered to answer a challenging prompt ... a consequence of load balancing in action perhaps.

1 more reply

mbesto10mo ago

Who cares about sentiment when you can just look at a proxy for usage: https://openrouter.ai/rankings

EDIT: Specifically: https://openrouter.ai/rankings/programming?view=week

Karrot_Kream10mo ago

Gemini hit the top of a bunch of leaderboards recently so it probably prompted folks to try Gemini out and they found it useful.

ChadMoran10mo ago

Context is only one part of it. I tried using Gemini and got sub par results. comment-laden code with not not following instructions.

cube222210mo ago

I’ve tried Gemini 2.5 Pro a couple of times and honestly don’t like its output. Claude Sonnet 3.7 is much better at correctly understanding and executing my imprecise prompts.

ramoz10mo ago

This is Claude Code.

The two work really well with Gemini as a planner and Claude Code as an executor.

barefootford10mo ago

and then get the honor of copy and pasting all of the changes afterward?

danenania10mo ago

The default planning/coding models are still Sonnet 3.7 for context size under 200k, but you can switch to Gemini with `\set-model gemini-preview`.

1 - https://github.com/plandex-ai/plandex

andrewstuart10mo ago

“Make me a bash script which creates all the files using heredoc”

Works for a reasonable chunk of files say 5 to 10 that aren’t too big.

No doubt they’ll get to better file access.

Anyhow I’m quite happy to do the copy and paste because Geminis coding and debugging capability is far better than Claude.

termin310mo ago

I'm using this https://github.com/coffeegrind123/gemini-code to use Claude Code with Gemini and it's working perfectly

dimitri-vs10mo ago

Cursor with gemini-2.5 MAX and agentic mode.

andy12_10mo ago

I can't say that I don't love Gemini. I use it a lot, and the huge context window does help. But I can also say that I much prefer how Claude writes code.

dgellow10mo ago

claude code still has the best UX IMHO. But I would love to have the million token context, for sure

mickeyp10mo ago

I'm building a browser based tool that runs on your computer, with full tool access of course, that works with all the major models and is far better and more ergonomic to use than code, codex, etc.

If you (or anyone else reading this) wants to try out the upcoming beta give me a ping. (see profile.)

simonw10mo ago

How are you uploading zip files of code to Gemini?

andrewstuart10mo ago

In AI Studio select file upload then select a zip file.

karn9710mo ago

They just need to release a useful model which can actually code now!

hoppp10mo ago

Pretty cool ngl. I think I'll try it

doctorpangloss10mo ago

I’ve cancelled my subscription. Sorry guys…

baalimago10mo ago

Hasn't this been invented already in multiple shapes and forms..? I wrote my own version clai[1] over a year ago which does exactly this, only that it has tools support + is multi vendor.

[1]: https://github.com/baalimago/clai

simonw10mo ago

Looks quite similar to my https://llm.datasette.io tool as well.

Honestly though, CLI tools for accessing LLMs (including piping content in and out of them) is such a clearly good idea I'm glad to see more tools implementing the pattern.

dcre10mo ago

It's very surprising that it has taken this long to see a first-party CLI like this.

1 more reply

kristopolous10mo ago

What is really needed is a usable multiplexed pipeline management and event system.

Then you can instrument through metaprogramming. For instance, an alert system could be:

"If the threshold goes over 1.0, contact the on-call person through their preferred method" - which may work ... maybe.

Or:

if any( "check_condition {x}", condition_set ): find_person("on call", right now).contact("preferred")

Things are still too slow for this to be not painful but that won't be the case forever.

Currently everything is linear. Doesn't have to be ... really doesn't.

j / k navigate · click thread line to collapse