Claude AI built me a React app to compare maps side by side (opens in new tab)

(github.com)

222 pointscaspg1y ago201 comments

201 comments

For years I've kept a list of apps / ideas / products I may do someday. I never made the time, with Cursor AI I have already built one, and am working on another. It's enabling me to use frameworks I barely know, like React Native, Swift, etc..

The first prompt (with o1) will get you 60% there, but then you have a different workflow. The prompts can get to a local minimum, where claude/gpt4/etc.. just can't do any better. At which point you need to climb back out and try a different approach.

I recommend git branches to keep track of this. Keep a good working copy in main, and anytime you want to add a feature, make a branch. If you get it almost there, make another branch in case it goes sideways. The biggest issue with developing like this is that you are not a coder anymore; you are a puppet master of a very smart and sometimes totally confused brain.

lxgr1y ago

> For years I've kept a list of apps / ideas / products I may do someday. I never made the time, with Cursor AI I have already built one, and am working on another.

This is one fact that people seem to severely under-appreciate about LLMs.

They're significantly worse at coding in many aspects than even a moderately skilled and motivated intern, but for my hobby projects, until now I haven't had any intern that would even as much as taking a stab at some of the repetitive or just not very interesting subtasks, let alone stick with them over and over again without getting tired of it.

Sakos1y ago

It also reduces the knowledge needed. I don't particularly care about learning how to setup and configure a web extension from scratch. With LLM, I can get 90% of that working in minutes, then focus on the parts that I am interested in. As somebody with ADHD, it was primarily all that supplementary, tangential knowledge which felt like an insurmountable mountain to me and made it impossible to actually try all the ideas I'd had over the years. I'm so much more productive now that I don't have to always get into the weeds for every little thing, which could easily delay progress for hours or even days. I can pick and choose the parts I feel are important to me.

1 more reply

imiric1y ago

I'm curious: what do you do when the LLM starts hallucinating, or gets stuck in a loop of generating non-working code that it can't get out of? What do you do when you need to troubleshoot and fix an issue it introduced, but has no idea how to fix?

In my experience of these tools, including the flagship models discussed here, this is a deal-breaking problem. If I have to waste time re-prompting to make progress, and reviewing and fixing the generated code, it would be much faster if I wrote the code from scratch myself. The tricky thing is that unless you read and understand the generated code, you really have no idea whether you're progressing or regressing. You can ask the model to generate tests for you as well, but how can you be sure they're written correctly, or covering the right scenarios?

More power to you if you feel like you're being productive, but the difficult things in software development always come in later stages of the project[1]. The devil is always in the details, and modern AI tools are just incapable of getting us across that last 10%. I'm not trying to downplay their usefulness, or imply that they will never get better. I think current models do a reasonably good job of summarizing documentation and producing small snippets of example code I can reuse, but I wouldn't trust them for anything beyond that.

[1]: https://en.wikipedia.org/wiki/Ninety%E2%80%93ninety_rule

6 more replies

psygn891y ago

If you have the budget, I have also taken a liking to perplexity.ai. I got it free from my school and it basically aggregates searches for me with sources (but be sure to check them since sometimes it reads between the links so to speak). It basically does the Google searching for me and have returned more up to date API info than Claude nor ChatGPT knew about. Then I would let Claude or ChatGPT know about it by copying doc and source code to work from.

jajko1y ago

That's literally going through the dark maze blindfolded, just bouncing off the walls randomly and hoping you are generally at least moving to your goal.

If software engineering should look like this, oh boy am I happy to be retiring in mere 17 years (fingers crossed) and not having to spend more time in such work. No way any quality complex code can come up from such approach, and people complain about quality of software now .

squigz1y ago

> The first prompt (with o1) will get you 60% there, but then you have a different workflow. The prompts can get to a local minimum, where claude/gpt4/etc.. just can't do any better. At which point you need to climb back out and try a different approach.

So you're basically bruteforcing development, a famously efficient technique for... anything.

elorant1y ago

Good luck debugging it on production.

poszlem1y ago

This is such a lazy, pointless comment that doesn't add anything to the conversation. It's also way off base about what LLMs can actually do, and the fact that they're pretty handy for debugging production code too.

cloverich1y ago

I mean i debug code other engineers wrote every single day... being good at that is part of the job. The biggest difference is i never have to deal with the LLM writing parts i don't want it to write.

CtrlAltmanDel1y ago

What a feat! There's at least 3 pages of google search results for the nearly same thing. The "prompt" I used in google.com is:

site:github.com map comparison

I guess the difference, is that my way uses dramatically less time and resources, but requires directly acknowledging the original coders instead of relying on the plagiarism-ish capabilities of reguritating something through an LLM.

mvdtnz1y ago

But creating things for which there are many existing, documented examples is what LLMs do best. Without this use case it's almost like they don't provide any value at all.

smusamashah1y ago

Everything you can think of right now has already been made in one form or another and hence learnt by LLMs, do you agree?

Can you come up easily with many things that LLMs have no clue of and hence will fail?

spaceman_20201y ago

I have about 6 months of coding experience. All I really knew was how to build a basic MERN app

I’ve been using Sonnet 3.5 to code and I’ve managed to build multiple full fledged apps, including paid ones

Maybe they’re not perfect, but they work and I’ve had no complaints yet. They might not scale to become the next Facebook, but not everything has to scale

lucianbr1y ago

I learned to drive before in-car GPS was widely available, at least where I lived.

Going to some new place meant getting a map, looking at it, making a plan, following the plan, keeping track on the map, that sort of thing.

Then I traveled somewhere new, for the first time, with GPS and a navigation sofware. It was quite impressive, and rather easier. I got to my destination the first time, without any problems. And each time after that.

But I did remark that I did not learn the route. The 10th time, the 50th time, I still needed the GPS to guide me. And without it, I would have to start the whole thing from scratch: get a map, make a plan, and so on.

Having done the "manual" navigation with maps lots of times before, it never worries me what I would do without a GPS. But if you're "born" with the GPS, I wonder what you do when it fails.

Are you not worried how you would manage your apps if for some reason the AIs were unavailable?

conscion1y ago

> The 10th time, the 50th time, I still needed the GPS to guide me.

If anyone else is frustrated by this experience, I've found that changing the setting in Google Maps to have the map always point north has helped me with actually building a mental model of directions. I found instead of just following the line, it forced me to think about whether I'm going north, south, east, or west for each directions.

1 more reply

duggan1y ago

Pretty sure people I remember similar conversations happening when people decided to produce content for YouTube full time, lean into Node.js as a dev stack, or build iOS apps.

Make hay while the sun shines, friends. It might not last forever, but neither will you!

1 more reply

dmd1y ago

I was told a similar thing when a mentor discovered I didn’t know how to wire-wrap my own CPU from scratch.

1 more reply

vishnugupta1y ago

I learned to code during when internet access was limited to about 1hr/week, extremely slow, and unreliable. But now without inherent I just can’t get any work done. I guess it’s same for a good chunk of people.

I never worried about what would happen if internet were to become unavailable. Given that it’s become one an essential service I just trust that powers that be will make sure to get it back up.

1 more reply

kenjackson1y ago

After 50x? I use GPS too, but I definitely learn the route after a few times with it. There are probably a class of people who don’t ever learn it, but I feel like this has to be a minority.

2 more replies

sails1y ago

Wondering around a new city today I had a similar thought.

Prior to an iPhone I’d have the general lay of a city memorised within 10min of landing, using a paper tourist map, and probably never feel disoriented, let alone lost.

This morning I walked 2 blocks further than needed (of a 1 block walk) because I wasn’t at all oriented while following Google maps.

I won’t spell out the AI comparison, other than I think more “apps” will be created, and predictable “followed the GPS off a bridge” revelations.

spaceman_20201y ago

You have to temper your ambitions. Choose languages it understands really well (typescript or python). Choose easier deployment solutions (vercel over docker). Be specific about the versions you’re using (“I’m using nextjs 14 with app router”)

poslathian1y ago

We’ll see what the future holds but as an old timer, using LLMs to creat applications seems exactly the same as:

Python/JS and their ecosystem replacing OS hosted C/C++ which replaced bare metal Assembly which replaced digital logic which replaced analog circuits which replaced mechanical design as the “standard goto tool” for how to create programs.

Starting with punchcard looms and Ada Lovelace maybe.

In every case we trade resource efficiency and lower level understanding for developer velocity and raise the upper bound on system complexity, capability, and somehow performance (despite the wasted efficiency).

quantum_state1y ago

Well said .. Hope the pile of complexity accumulated would not be a time bomb ...

1 more reply

rlty_chck1y ago

Every time I see claims like this, I instinctively click on the user's profile and try to verify if their story checks out.

>I played around a lot with code when I was younger. I built my first site when I was 13 and had a good handle on Javascript back when jQuery was still a pipe dream.

>Started with the Codecademy Ruby track which was pretty easy. Working through RailsTutorial right now.

posted on April 15, 2015, https://news.ycombinator.com/item?id=9382537

>I've been freelancing since I was 17. I've dabbled in every kind of online trade imaginable, from domain names to crypto. I've built and sold multiple websites. I also built and sold a small agency.

>I can do some marketing, some coding, some design, some sales, but I'm not particularly good at any of those in isolation.

posted on Jan 20, 2023, https://news.ycombinator.com/item?id=34459482

So I don't really understand where this claim of only "6 months of coding experience" is coming from, when you clearly have been coding on and off for multiple decades.

spaceman_20201y ago

you do know that there are other kinds of freelancing apart from coding, right?

1 more reply

yodsanklai1y ago

What do you do if your app has a bug that your LLM isn't able to fix? is your coding experience enough to fix it, or do you ship with bugs hoping customers won't mind?

amonith1y ago

If customers do mind then at best it's an opportunity cost (less people will buy). Shipping with bugs > not shipping, simple as.

1 more reply

jstanley1y ago

What does anyone do if they have a bug they don't know how to fix?

Find a way to work around it.

ipaddr1y ago

What I see is people using llm to make a new app without the bug

1 more reply

instalabs1y ago

You start over from scratch /s(50%)

epolanski1y ago

What's the point of this question?

Everybody ships nasty bugs in production that he himself might find impossible to debug, everybody.

Thus he will do the very same thing me, you or anybody else on this planet do, find a second pair of eyes, virtually or not, paying or not.

4 more replies

hipadev231y ago

Genuine question: Do you feel like you're learning the language/frameworks/techniques well? Or do you feel like you're just getting more adept at leveraging the LLM?

Do you think you could you maintain and/or debug someone else's application?

spaceman_20201y ago

Not as much as I would have if I was writing everything from scratch. But then again, my goal isn’t to be a coder or get a job as a coder - I’m primarily a marketer and got into coding simply because I had a stack of ideas I wanted to experiment with

Most of the things I’ve built are fun things

See: GoUnfaked.com and PlaybookFM.com as examples

PlaybookFM.com is interesting because everything from the code to the podcasts to the logo are AI generated

2 more replies

jstummbillig1y ago

The more important question that programmers, who are not product makers, often miss is: Are you solving real problems?

It's a slightly orthogonal way of thinking about this but if you are solving real problems, you get away with so much shit, it's unreal.

Maybe Google is not gonna let you code monkey on their monorepo, but you do not have to care. There's enough not-google in the world, and enough real problems.

lxgr1y ago

There are so many frameworks, especially on the web and in Javascript, that I have absolutely zero interest in learning.

In fact, my main reason for not doing any web development is that I find the amount of layers of abstraction and needless complexity for something that should really be simple quite deterring.

I'm sure e.g. React and GraphQL allow people to think about web apps in really elegant and scalable ways, but the learning curve is just way more than I can justify for a side project or a one-off thing at work that will never have more than two or three users opening it once every few months.

jchanimal1y ago

I think the front end is the most interesting place right now, because it’s where people are making stuff for themselves with the help of LLMs.

The browser is a great place to build voice chat, 3d, almost any other experience. I expect a renewed interest in granting fuller capabilities to the web, especially background processing and network access.

grugagag1y ago

That seems a bit too much to ask, I want the piece of mind of knowing the browser keeps isolated sandboxes, if that philosophy changes I would be very uncomfortable using the browser.

How about we go back to thick clients, with LLMs the effort required to do that for multiple operating systems will also be reduced, no?

dartos1y ago

In your opinion as a newer dev, what were the most complicated things that sonnet was able to do and was not able to do?

njtransit1y ago

Can you share some examples?

spaceman_20201y ago

ThumbnailGenius.com (has a ton of new features I haven’t pushed yet as I wait for approval from a payment processor)

MetHacker.io (has a lot of features I had to remove because of X API’s new pricing - see /projects on it)

GoUnfaked.com

PlaybookFM.com

TokenAI.dev (working with blowfish to remove the warning flag)

7 more replies

lostemptations51y ago

I'm not saying you're wrong at all or in disbelief -- but I've spent lots of time with Claude 3.5 trying to prototype React apps and not even full fledged prototypes -- and I can't get it to make anything bug free somehow.

Maybe I'm "holding it wrong" -- I mean using it incorrectly.

True it renders quite interesting mockups and has React code behind it -- but then try and get this into even a demoable state for your boss or colleagues...

Even a simple "please create a docker file with everything I need in a directory to get this up and running"...doesn't work.

Docker file doesnt work (my fault maybe for not expressing I'm on Arm64), app is miss configured, files are in the wrong directories, key things are missing.

Again just my experience.

I find Claude interesting for generating ideas-- but I have a hard time seeing how a dev with six months experience could get multiple "paid" apps out with it. I have 20 years (bla, bla) experience and still find it requires outrageous hand holding for anything serious.

Again I'm not doubting you at all -- I'm just saying me personally I find it hard to be THAT productive with it.

spaceman_20201y ago

You have to temper your ambitions. Choose languages it understands well. Deploy on vercel. Specify exactly what you’re working with (“I’m using nextjs 14 with app router”)

vachina1y ago

Agreed. LLMs can give you ideas on how to get there, but you still need foundational knowledge of the language or framework to extend the code it generates.

fragmede1y ago

would you be willing to share any of your chats? Like say the docker one?

1 more reply

belter1y ago

If there are complaints who is going to fix it? :-)

smallerfish1y ago

Claude is fantastic. I think the model itself is good enough to be able to write good software when competently directed; it's let down only by the UI/UX around it.

My only complaints are:

a) that it's really easy to hit the usage limit, especially when refactoring across a half dozen files. One thing that'd theoretically be easyish to fix would be automatically updating files in the project context (perhaps with an "accept"/"reject" prompt) so that the model knows what the latest version of your code is without having to reupload it constantly.

b) it oscillating between being lazy in really annoying ways (giving largeish code blocks with commented omissions partway through) and supplying the full file unnecessarily and using up your usage credits.

My hope is that Jetbrains give up on their own (pretty limited) LLM and partner with Anthropic to produce a super-tight IDE native integration.

caspgOP1y ago

I wanted to develop a simple tool to compare maps. I thought about using this opportunity to try out Claude AI for coding a project from scratch. It worked surprisingly well!

At least 95% of the code was generated by AI (I reached the limit so had to add final bits on my own).

MrMcCall1y ago

The problem is that you must understand that 95% in order to complete the last 5%.

negoutputeng1y ago

exactly right.

POCs and demos are easy to build by anyone these days. The last 10% is what separates student projects from real products.

any engineer who has spent time in the trenches understands that fixing corner cases in code produced by inexperienced engineers consumes a lot of time.

in fact, poor overall design and lack of diligence tanks entire projects.

1 more reply

ericskiff1y ago

Interestingly, I’m pretty sure they mean they hit the limit with tokens on Claude.

There’s a daily 2.5 million token limit that you can use up fairly quickly with 100K context

So they may very well have completed the whole program with Claude. It’s just the machine literally stopped and the human had to do the final grunt work.

1 more reply

trash_cat1y ago

>> The problem is that you must understand that 95% in order to complete the last 5%.

What stops you from using AI to explain the code base?

ipaddr1y ago

I asked Claude AI to make me an app and it refused and called it dangerous. I asked what kind of apps they could build and they suggested social media or health. So I asked it to make one but it refused too dangerous. I asked it to make anything.. anything app and it refused. I told it it sucked and it said it didn't. Then I deleted my account.

I can't think of a worse llm than Claude.

7thpower1y ago

There have been rumors of the system prompt changing for some services if the user had strikes on their account from earlier conversations. I wonder if you were impacted by this because what you described has not been my experience nor have I seen it discussed previously.

1 more reply

fragmede1y ago

Tbh this sounds like a skill issue.

1 more reply

vunderba1y ago

I think we're going to see a similar backlash to AI apps as we did with AI art.

Not necessarily because users can identify AI apps, but more because due to the lower barrier of entry - the space is going to get hyper-competitive and it'll be VERY difficult to distinguish your app from the hundreds of nearly identical other ones.

Another thing that worries me (because software devs in particular seem to take a very loose moral approach to plagiarism and basic human decency) is that it'll be significantly easier for a less scrupulous dev to find an app that they like, and use an LLM to instantly spin up a copy of it.

I'm trying not to be all gloom and doom about GenAI, because it can be really nifty to see it generate a bunch of boilerplate (YAML configs, dev opsy stuff, etc.) but sometimes it's hard....

CaptainFever1y ago

I hope not. I commend that software devs in particular seem to be adaptable to new technologies instead of trying to stop progress.

Take this very post for example. Imagine an artist forum having daily front-page articles on AI, and most of the comments are curious and non-negative. That's basically what HackerNews is doing, but with developers instead. The huge culture difference is curious, and makes me happy with the posters on this site.

You attribute it to the difficulty of using AI coding tools. But such tools to cut out the programmer and make it available to the layman has always existed: libraries, game engines, website builders, and now web app builders. You also attribute it to the flooding of the markets. But the website and mobile markets are famously saturated, and yet there we continue making stuff, because we want to (and because quality things make more money).

I instead attribute it to our culture of free sharing (what one might call "plagiarism"... of ideas?!), adaptability, and curiosity. And that makes me hopeful.

grugagag1y ago

No doubt about it, things will get very competitive in the software space and while anyone will be able to use generative AI tools, I think more will be expected for less.

vunderba1y ago

Reminds me of when OpenAI rolled out custom GPTs, and in a matter of a few months there were more than a million of them on the store.

People don't seem to realize that the same thing is going to happen to regular app development once AI tooling gets even easier.

2024user1y ago

Claude built me a simple react app AND rendered it in it's own UI - including using imports and stuff.

I am looking forward to this type of real time app creation being added into our OSs, browsers, phones and glasses.

swatcoder1y ago

> I am looking forward to this type of real time app creation being added into our OSs, browsers, phones and glasses.

What do you see that being used for?

Surely, polished apps written for others are going to be best built in professional tools that live independently of whatever the OS might offer.

So I assume you're talking about quick little scratch apps for personal use? Like an AI-enriched version of Apple's Automator or Shortcuts, or of shell scripts, where you spend a while coahcing an AI to write the little one-off program you need instead of visually building a worrkflow or writing a simple script? Is that something you believe there's a high unmet need for?

This is an earnest question. I'm sincerely curious what you're envisioning and how it might supercede the rich variety of existing tools that seem to only see niche use today.

cj1y ago

When I was in college (10+ years ago) there was a system that allowed you to select your classes. During the selection period, certain people had priority (people a year above you got to select first).

Once a class was full, you could still get in if someone who was selected for the classes changed their mind, which (at an unpredictable time) would result in a seat becoming available in that class until another student noticed the availability and signed up.

So I wrote a simple PHP script that loaded the page every 60 seconds checking, and the script would send me a text message if any of the classes I wanted suddenly had an opening. I would then run to a computer and try to sign up.

These are the kind of bespoke, single-purpose things that I presume AI coding could help the average person with.

“Send me a push notification when the text on this webpage says the class isn’t full, and check every 60 seconds”

3 more replies

bdcravens1y ago

There's no shortage of applications, both desktop and mobile, that never really stray outside of the default toolkits. Line of business apps, for instance, don't need the polish that apps targeting consumers need. They just need to effectively manipulate data.

nkingsy1y ago

Hard to say as someone with the power.

Ask a bird what flying is good for and their answer will be encumbered by reality.

Kind of the opposite of “everything looks like a nail”.

2024user1y ago

Yeah I was talking about small apps/services for personal use rather than professional applications built to serve a business need.

Two ideas: "For every picture of food I take, create a recipe to recreate it so I can make it at home in the future" or "Create an app where I can log my food for today and automatically calculate the calories based on the food I put in".

croes1y ago

That will be a whole new level of malware attack angle.

meiraleal1y ago

Every new tech is a new attack surface.

mmsc1y ago

Can you expand on what you mean by this, and why?

1 more reply

williamcotton1y ago

I used Claude (and a bit of ChatGPT) to write a multi-pass recursive descent parser for a search query DSL:

https://github.com/williamcotton/search-input-query

Why multi-pass? So multiple semantic errors can be reported at once to the user!

The most important factor here is that I've written lexers and parsers beforehand. I was very detailed in my instructions and put it together piece-by-piece. It took probably 100 or so different chats.

Try it out with the GUI you see in the gif in the README:

  git clone git@github.com:williamcotton/search-input-query.git
  cd search-input-query/search-input-query-demo
  npm install
  npm run dev

yieldcrv1y ago

I wish Claude let you share conversations more easily, I’d be curious to see how this one went and what follow on questions you had

ffsm81y ago

huh? there should be a button on the top right to generate a share link in any conversation? is that really too hard?

its even documented on their site

https://support.anthropic.com/en/articles/9519189-project-vi...

    Click the "Share" button in the upper right corner of your chat.

    Click the "Share & Copy Link" button to create a shareable link and add the chat snapshot to your project’s activity feed.

/edit: i just checked. i think they had a regression? or at least i cannot see the button anymore. go figure. must be pretty recently, as i shared a chat just ~2-3 weeks ago

raldi1y ago

Note the section you’re in at that doc link: “Claude for Work (Team & Enterprise Plans) -> Team & Enterprise Plan Features -> Project visibility and sharing”

1 more reply

EcommerceFlow1y ago

Been using LLMs since got3 beta in June 2021 and it’s interesting to see how my use cases have continuously been upgraded as models advanced.

Started off with having it create funny random stories, to slowly creating more and more advanced programs.

It’s shocking how good 3.5 Sonnet is at coding, considering the size of the model.

GaggiX1y ago

>considering the size of the model.

We don't know the size of Claude 3.5 Sonnet or any other Anthropic model.

truckerbill1y ago

Cool! Did you just prompt -> copy -> paste or did you come up with some specific workflow?

caspgOP1y ago

I used Claude AI project to attach requirement for the project. Then I just went with single conversation. I specified that I want to do it in small steps and then was just doing copy -> paste until I reached the limit. I think it was because I was doing one big convo instead attaching code to the project.

So pretty simple flow, totally not scalable for bigger projects.

I need to read and check Cursor AI which can also use Claude models.

johnisgood1y ago

I wish I could try out Cursor, but I cannot due to this bug: https://github.com/getcursor/cursor/issues/598

1 more reply

Omnipresent1y ago

are you able to share the link to your prompts / conversation?

hijinks1y ago

you can use the vscode cline to give a task and it uses a LLM to go out and create the app for you.

In django i had it create a backend, set admin user, create requirements.txt and then do a whole frontend in vue as a test. It even can do screen testing and tested what happens if it puts a wrong login in.

wayeq1y ago

Is Claude 'better' than o1-preview? I've had phenomenal results with o1-preview (switching to o1-mini for simpler asks to avoid running out of queries), and tried Claude once and wasn't super impressed. Wondering if I should give it another shot.

glonq1y ago

Has somebody evaluated the pros and cons of giving developers a programming-specific AI tool like copilot versus a general-purpose AI tool like chatgpt or claude? We are a small shop so I would prefer to not pay for both for every developer.

nitwit0051y ago

Ideally, Claude should have told you about easier approaches. I don't see any reason to mess around with code.

There are plenty of website builder tools that will glue third party maps. Even the raw Google Maps API website will generate an HTML page with customized maps.

nine_k1y ago

This is great progress.

Next obvious steps: make it understand large existing programs, learn form the style of the existing code while avoiding to learn the bad style where it's present, and then contribute features or fixes to that codebase.

lxgr1y ago

Claude has worked amazingly well for me as somebody really not into UI/web development.

There are so many small tasks that I could, but until now almost never would automate (whether it's not worth the time [1] or I just couldn't bring myself to do it as I don't really enjoy doing it). A one-off bitmask parser at work here, a proof of concept webapp at home there – it's literally opened up a new world of quality-of-life improvements, in a purely quantitative sense.

It extends beyond UI and web development too: Very often I find myself thinking that there must be a smarter way to use CLI tools like jq, zsh etc., but considering how rarely I use them and that I do already know an ineffective way of getting what I need, up until now I couldn't justify spending the hours of going through documentation on the moderately high chance of finding a few useful nuggets letting me shave off a minute here and there every month.

The same applies to SQL: After plateauing for several years (I get by just fine for my relatively narrow debugging and occasional data migration needs), LLMs have been much better at exposing me to new and useful patterns than dry and extensive documentation. (There are technical documents I really do enjoy reading, but SQL dialect specifications, often without any practical motivation as to when to use a given construct, are really not it.)

LLMs have generally been great at that, but being able to immediately run what they suggest in-browser is where Claude currently has the edge for me. (ChatGPT Plus can apparently evaluate Python, but that's server-side only and accordingly doesn't really allow interactive use cases.)

[1] https://xkcd.com/1205/

grp0001y ago

Can anyone measure in how Claude compares to copilot? Copilot feels like a fancy auto complete, but people seem to have good experiences with Claude, even in more complex settings.

cluckindan1y ago

You can use Claude in Copilot.

bowsamic1y ago

I’ve had insanely, shockingly good experiences prototyping a musical web app using tone.js using Claude with copilot

Omnipresent1y ago

it'd be cool to see the prompts used and the edits required to get to the end product here.

jckahn1y ago

This sort of thing will be interesting to me once it can be done with fully local and open source tech on attainable hardware (and no, a $5,000 MacBook Pro is not attainable). Building a dependence on yet another untrustworthy AI startup that will inevitably enshittify isn’t compelling despite what the tech can do.

We’re getting there with some of the smaller open source models, but we’re not quite there yet. I’m looking forward to where we’ll be in a year!

Veuxdo1y ago

> and no, a $5,000 MacBook Pro is not attainable

In many professions, $5000 for tools is almost nothing.

cpursley1y ago

Yep. Typical landscape crew rolls with $50k in equipment (maybe more). People push back on tooling pricing in other industries (especially when the tooling is "soft') but have no clue what that the cost of doing biz is huge for others.

torginus1y ago

Yeah, but those tools don't get obsoleted in 3 years.

2 more replies

sigmar1y ago

I like opensource and reproducible methods too. but here, the code was written by claude and then exported. Is that considered a dependency? They can find a different LLM or pay someone to improve/revise/extend the code later if necessary

zamadatix1y ago

The nice thing is it doesn't really matter all too much which you use "today", you can take the same inputs to any and the outputs remain complete forever. If the concern is you'll start using these tools, like them, start using them a lot, then are worried suddenly all hosted options to run a query disappear tomorrow (meaning being able to run local is important to you) then Qwen2.5-Coder 32B with a 4 bit quant will run 30+ tokens/second will give you many years of use for <$1k in hardware.

If you want to pay that <$1k up front to just say "it was always just on my machine, nobody elses" then more power to you. Most just prefer this "pay as you go for someone else to have set it up" model. That doesn't imply it's unattainable if you want to run it differently though.

phony-account1y ago

> (and no, a $5,000 MacBook Pro is not attainable)

I know we all love dunking on how expensive Apple computers are, but for $5000 you would be getting a Mac Mini maxed-out with an M4 Pro chip with 14‑core CPU, 20‑core GPU, 16-core Neural Engine, 64GB unified RAM memory, an 8TB SSD and 10 Gigabit Ethernet.

M4 MacBook Pros start at $1599.

zamadatix1y ago

I get where GP is coming from and it's not really related to typical Apple price bashing. You can list the most fantastical specs for the craziest value and it all really comes down to that single note: "64 GB memory for the GPU/NPU" - where the mini caps out. The GPU/NPU might change the speed of the output by a linear factor but the memory is a hard wall of how good a model you can run and 64 GB total is surprisingly not that high in the AI world. The MacBook Pro units referenced at $5k are the ones that support 128 GB, hence why they are popularly mentioned. ~ the same $ for the Mac Studio when you minimally load it up to 128 GB. Even then you're not able to run the biggest local models, 128 GB still isn't enough, but you can at least run the mid sized ones unquantized.

What I think GP was overlooking is newer mid range models like Qwen2.5-Coder 32B produce more than usable outputs for this kind of scenario on much lower end consumer (instead of prosumer) hardware so you don't need to go looking for the high memory stuff to do this kind of task locally, even if you may need the high memory stuff for serious AI workloads or AI training.

bikamonki1y ago

Could this be used to RPA my browser? Is it safe?

caspgOP1y ago

What is RPA? Robotic Process Automation? If yes then I have no experience with that.

ronyba1y ago

Is it possible in Java?

j / k navigate · click thread line to collapse

201 comments

thefourthchime1y ago

lxgr1y ago

> For years I've kept a list of apps / ideas / products I may do someday. I never made the time, with Cursor AI I have already built one, and am working on another.

This is one fact that people seem to severely under-appreciate about LLMs.

Sakos1y ago

1 more reply

imiric1y ago

[1]: https://en.wikipedia.org/wiki/Ninety%E2%80%93ninety_rule

6 more replies

psygn891y ago

jajko1y ago

That's literally going through the dark maze blindfolded, just bouncing off the walls randomly and hoping you are generally at least moving to your goal.

squigz1y ago

So you're basically bruteforcing development, a famously efficient technique for... anything.

elorant1y ago

Good luck debugging it on production.

poszlem1y ago

cloverich1y ago

CtrlAltmanDel1y ago

What a feat! There's at least 3 pages of google search results for the nearly same thing. The "prompt" I used in google.com is:

site:github.com map comparison

mvdtnz1y ago

But creating things for which there are many existing, documented examples is what LLMs do best. Without this use case it's almost like they don't provide any value at all.

smusamashah1y ago

Everything you can think of right now has already been made in one form or another and hence learnt by LLMs, do you agree?

Can you come up easily with many things that LLMs have no clue of and hence will fail?

spaceman_20201y ago

I have about 6 months of coding experience. All I really knew was how to build a basic MERN app

I’ve been using Sonnet 3.5 to code and I’ve managed to build multiple full fledged apps, including paid ones

Maybe they’re not perfect, but they work and I’ve had no complaints yet. They might not scale to become the next Facebook, but not everything has to scale

lucianbr1y ago

I learned to drive before in-car GPS was widely available, at least where I lived.

Going to some new place meant getting a map, looking at it, making a plan, following the plan, keeping track on the map, that sort of thing.

Having done the "manual" navigation with maps lots of times before, it never worries me what I would do without a GPS. But if you're "born" with the GPS, I wonder what you do when it fails.

Are you not worried how you would manage your apps if for some reason the AIs were unavailable?

conscion1y ago

> The 10th time, the 50th time, I still needed the GPS to guide me.

1 more reply

duggan1y ago

Pretty sure people I remember similar conversations happening when people decided to produce content for YouTube full time, lean into Node.js as a dev stack, or build iOS apps.

Make hay while the sun shines, friends. It might not last forever, but neither will you!

1 more reply

dmd1y ago

I was told a similar thing when a mentor discovered I didn’t know how to wire-wrap my own CPU from scratch.

1 more reply

vishnugupta1y ago

I never worried about what would happen if internet were to become unavailable. Given that it’s become one an essential service I just trust that powers that be will make sure to get it back up.

1 more reply

kenjackson1y ago

After 50x? I use GPS too, but I definitely learn the route after a few times with it. There are probably a class of people who don’t ever learn it, but I feel like this has to be a minority.

2 more replies

sails1y ago

Wondering around a new city today I had a similar thought.

Prior to an iPhone I’d have the general lay of a city memorised within 10min of landing, using a paper tourist map, and probably never feel disoriented, let alone lost.

This morning I walked 2 blocks further than needed (of a 1 block walk) because I wasn’t at all oriented while following Google maps.

I won’t spell out the AI comparison, other than I think more “apps” will be created, and predictable “followed the GPS off a bridge” revelations.

spaceman_20201y ago

poslathian1y ago

We’ll see what the future holds but as an old timer, using LLMs to creat applications seems exactly the same as:

Starting with punchcard looms and Ada Lovelace maybe.

quantum_state1y ago

Well said .. Hope the pile of complexity accumulated would not be a time bomb ...

1 more reply

rlty_chck1y ago

Every time I see claims like this, I instinctively click on the user's profile and try to verify if their story checks out.

>I played around a lot with code when I was younger. I built my first site when I was 13 and had a good handle on Javascript back when jQuery was still a pipe dream.

>Started with the Codecademy Ruby track which was pretty easy. Working through RailsTutorial right now.

posted on April 15, 2015, https://news.ycombinator.com/item?id=9382537

>I've been freelancing since I was 17. I've dabbled in every kind of online trade imaginable, from domain names to crypto. I've built and sold multiple websites. I also built and sold a small agency.

>I can do some marketing, some coding, some design, some sales, but I'm not particularly good at any of those in isolation.

posted on Jan 20, 2023, https://news.ycombinator.com/item?id=34459482

So I don't really understand where this claim of only "6 months of coding experience" is coming from, when you clearly have been coding on and off for multiple decades.

spaceman_20201y ago

you do know that there are other kinds of freelancing apart from coding, right?

1 more reply

yodsanklai1y ago

What do you do if your app has a bug that your LLM isn't able to fix? is your coding experience enough to fix it, or do you ship with bugs hoping customers won't mind?

amonith1y ago

If customers do mind then at best it's an opportunity cost (less people will buy). Shipping with bugs > not shipping, simple as.

1 more reply

jstanley1y ago

What does anyone do if they have a bug they don't know how to fix?

Find a way to work around it.

ipaddr1y ago

What I see is people using llm to make a new app without the bug

1 more reply

instalabs1y ago

You start over from scratch /s(50%)

epolanski1y ago

What's the point of this question?

Everybody ships nasty bugs in production that he himself might find impossible to debug, everybody.

Thus he will do the very same thing me, you or anybody else on this planet do, find a second pair of eyes, virtually or not, paying or not.

4 more replies

hipadev231y ago

Genuine question: Do you feel like you're learning the language/frameworks/techniques well? Or do you feel like you're just getting more adept at leveraging the LLM?

Do you think you could you maintain and/or debug someone else's application?

spaceman_20201y ago

Most of the things I’ve built are fun things

See: GoUnfaked.com and PlaybookFM.com as examples

PlaybookFM.com is interesting because everything from the code to the podcasts to the logo are AI generated

2 more replies

jstummbillig1y ago

The more important question that programmers, who are not product makers, often miss is: Are you solving real problems?

It's a slightly orthogonal way of thinking about this but if you are solving real problems, you get away with so much shit, it's unreal.

Maybe Google is not gonna let you code monkey on their monorepo, but you do not have to care. There's enough not-google in the world, and enough real problems.

lxgr1y ago

There are so many frameworks, especially on the web and in Javascript, that I have absolutely zero interest in learning.

In fact, my main reason for not doing any web development is that I find the amount of layers of abstraction and needless complexity for something that should really be simple quite deterring.

jchanimal1y ago

I think the front end is the most interesting place right now, because it’s where people are making stuff for themselves with the help of LLMs.

grugagag1y ago

That seems a bit too much to ask, I want the piece of mind of knowing the browser keeps isolated sandboxes, if that philosophy changes I would be very uncomfortable using the browser.

How about we go back to thick clients, with LLMs the effort required to do that for multiple operating systems will also be reduced, no?

dartos1y ago

In your opinion as a newer dev, what were the most complicated things that sonnet was able to do and was not able to do?

njtransit1y ago

Can you share some examples?

spaceman_20201y ago

ThumbnailGenius.com (has a ton of new features I haven’t pushed yet as I wait for approval from a payment processor)

MetHacker.io (has a lot of features I had to remove because of X API’s new pricing - see /projects on it)

GoUnfaked.com

PlaybookFM.com

TokenAI.dev (working with blowfish to remove the warning flag)

7 more replies

lostemptations51y ago

Maybe I'm "holding it wrong" -- I mean using it incorrectly.

True it renders quite interesting mockups and has React code behind it -- but then try and get this into even a demoable state for your boss or colleagues...

Even a simple "please create a docker file with everything I need in a directory to get this up and running"...doesn't work.

Docker file doesnt work (my fault maybe for not expressing I'm on Arm64), app is miss configured, files are in the wrong directories, key things are missing.

Again just my experience.

Again I'm not doubting you at all -- I'm just saying me personally I find it hard to be THAT productive with it.

spaceman_20201y ago

You have to temper your ambitions. Choose languages it understands well. Deploy on vercel. Specify exactly what you’re working with (“I’m using nextjs 14 with app router”)

vachina1y ago

Agreed. LLMs can give you ideas on how to get there, but you still need foundational knowledge of the language or framework to extend the code it generates.

fragmede1y ago

would you be willing to share any of your chats? Like say the docker one?

1 more reply

belter1y ago

If there are complaints who is going to fix it? :-)

smallerfish1y ago

Claude is fantastic. I think the model itself is good enough to be able to write good software when competently directed; it's let down only by the UI/UX around it.

My only complaints are:

My hope is that Jetbrains give up on their own (pretty limited) LLM and partner with Anthropic to produce a super-tight IDE native integration.

caspgOP1y ago

I wanted to develop a simple tool to compare maps. I thought about using this opportunity to try out Claude AI for coding a project from scratch. It worked surprisingly well!

At least 95% of the code was generated by AI (I reached the limit so had to add final bits on my own).

MrMcCall1y ago

The problem is that you must understand that 95% in order to complete the last 5%.

negoutputeng1y ago

exactly right.

POCs and demos are easy to build by anyone these days. The last 10% is what separates student projects from real products.

any engineer who has spent time in the trenches understands that fixing corner cases in code produced by inexperienced engineers consumes a lot of time.

in fact, poor overall design and lack of diligence tanks entire projects.

1 more reply

ericskiff1y ago

Interestingly, I’m pretty sure they mean they hit the limit with tokens on Claude.

There’s a daily 2.5 million token limit that you can use up fairly quickly with 100K context

So they may very well have completed the whole program with Claude. It’s just the machine literally stopped and the human had to do the final grunt work.

1 more reply

trash_cat1y ago

>> The problem is that you must understand that 95% in order to complete the last 5%.

What stops you from using AI to explain the code base?

ipaddr1y ago

I can't think of a worse llm than Claude.

7thpower1y ago

1 more reply

fragmede1y ago

Tbh this sounds like a skill issue.

1 more reply

vunderba1y ago

I think we're going to see a similar backlash to AI apps as we did with AI art.

I'm trying not to be all gloom and doom about GenAI, because it can be really nifty to see it generate a bunch of boilerplate (YAML configs, dev opsy stuff, etc.) but sometimes it's hard....

CaptainFever1y ago

I hope not. I commend that software devs in particular seem to be adaptable to new technologies instead of trying to stop progress.

I instead attribute it to our culture of free sharing (what one might call "plagiarism"... of ideas?!), adaptability, and curiosity. And that makes me hopeful.

grugagag1y ago

No doubt about it, things will get very competitive in the software space and while anyone will be able to use generative AI tools, I think more will be expected for less.

vunderba1y ago

Reminds me of when OpenAI rolled out custom GPTs, and in a matter of a few months there were more than a million of them on the store.

People don't seem to realize that the same thing is going to happen to regular app development once AI tooling gets even easier.

2024user1y ago

Claude built me a simple react app AND rendered it in it's own UI - including using imports and stuff.

I am looking forward to this type of real time app creation being added into our OSs, browsers, phones and glasses.

swatcoder1y ago

> I am looking forward to this type of real time app creation being added into our OSs, browsers, phones and glasses.

What do you see that being used for?

Surely, polished apps written for others are going to be best built in professional tools that live independently of whatever the OS might offer.

This is an earnest question. I'm sincerely curious what you're envisioning and how it might supercede the rich variety of existing tools that seem to only see niche use today.

cj1y ago

These are the kind of bespoke, single-purpose things that I presume AI coding could help the average person with.

“Send me a push notification when the text on this webpage says the class isn’t full, and check every 60 seconds”

3 more replies

bdcravens1y ago

nkingsy1y ago

Hard to say as someone with the power.

Ask a bird what flying is good for and their answer will be encumbered by reality.

Kind of the opposite of “everything looks like a nail”.

2024user1y ago

Yeah I was talking about small apps/services for personal use rather than professional applications built to serve a business need.

croes1y ago

That will be a whole new level of malware attack angle.

meiraleal1y ago

Every new tech is a new attack surface.

mmsc1y ago

Can you expand on what you mean by this, and why?

1 more reply

williamcotton1y ago

I used Claude (and a bit of ChatGPT) to write a multi-pass recursive descent parser for a search query DSL:

https://github.com/williamcotton/search-input-query

Why multi-pass? So multiple semantic errors can be reported at once to the user!

Try it out with the GUI you see in the gif in the README:

  git clone git@github.com:williamcotton/search-input-query.git
  cd search-input-query/search-input-query-demo
  npm install
  npm run dev

yieldcrv1y ago

I wish Claude let you share conversations more easily, I’d be curious to see how this one went and what follow on questions you had

ffsm81y ago

huh? there should be a button on the top right to generate a share link in any conversation? is that really too hard?

its even documented on their site

https://support.anthropic.com/en/articles/9519189-project-vi...

    Click the "Share" button in the upper right corner of your chat.

    Click the "Share & Copy Link" button to create a shareable link and add the chat snapshot to your project’s activity feed.

/edit: i just checked. i think they had a regression? or at least i cannot see the button anymore. go figure. must be pretty recently, as i shared a chat just ~2-3 weeks ago

raldi1y ago

Note the section you’re in at that doc link: “Claude for Work (Team & Enterprise Plans) -> Team & Enterprise Plan Features -> Project visibility and sharing”

1 more reply

EcommerceFlow1y ago

Been using LLMs since got3 beta in June 2021 and it’s interesting to see how my use cases have continuously been upgraded as models advanced.

Started off with having it create funny random stories, to slowly creating more and more advanced programs.

It’s shocking how good 3.5 Sonnet is at coding, considering the size of the model.

GaggiX1y ago

>considering the size of the model.

We don't know the size of Claude 3.5 Sonnet or any other Anthropic model.

truckerbill1y ago

Cool! Did you just prompt -> copy -> paste or did you come up with some specific workflow?

caspgOP1y ago

So pretty simple flow, totally not scalable for bigger projects.

I need to read and check Cursor AI which can also use Claude models.

johnisgood1y ago

I wish I could try out Cursor, but I cannot due to this bug: https://github.com/getcursor/cursor/issues/598

1 more reply

Omnipresent1y ago

are you able to share the link to your prompts / conversation?

hijinks1y ago

you can use the vscode cline to give a task and it uses a LLM to go out and create the app for you.

wayeq1y ago

glonq1y ago

nitwit0051y ago

Ideally, Claude should have told you about easier approaches. I don't see any reason to mess around with code.

There are plenty of website builder tools that will glue third party maps. Even the raw Google Maps API website will generate an HTML page with customized maps.

nine_k1y ago

This is great progress.

lxgr1y ago

Claude has worked amazingly well for me as somebody really not into UI/web development.

[1] https://xkcd.com/1205/

grp0001y ago

Can anyone measure in how Claude compares to copilot? Copilot feels like a fancy auto complete, but people seem to have good experiences with Claude, even in more complex settings.

cluckindan1y ago

You can use Claude in Copilot.

bowsamic1y ago

I’ve had insanely, shockingly good experiences prototyping a musical web app using tone.js using Claude with copilot

Omnipresent1y ago

it'd be cool to see the prompts used and the edits required to get to the end product here.

jckahn1y ago

We’re getting there with some of the smaller open source models, but we’re not quite there yet. I’m looking forward to where we’ll be in a year!

Veuxdo1y ago

> and no, a $5,000 MacBook Pro is not attainable

In many professions, $5000 for tools is almost nothing.

cpursley1y ago