undefined | Better HN

0 pointssho_hn1y ago0 comments

Is this a local run of one of the smaller models and/or other-models-distilled-with-r1, or are you using their Chat interface?

I've also compared o1 and (online-hosted) r1 on Qt/C++ code, being a KDE Plasma dev, and my impression so far was that the output is roughly on par. I've given both models some tricky tasks about dark corners of the meta-object system in crafting classes etc. and they came up with generally the same sort of suggestions and implementations.

I do appreciate that "asking about gotchas with few definitive solutions, even if they require some perspective" and "rote day-to-day coding ops" are very different benchmarks due to how things are represented in the training data corpus, though.

0 comments

throwup2381y ago

I use it through Kagi Assistant which has the proper R1 model through Together.ai/Fireworks.ai

My standard test is to ask the model to write a QSyntaxHighlighter subclass that uses TreeSitter to implement syntax highlighting. O1 can do it after a few iterations, but R1’s output has been a mess. That said, its thought process revealed a few issues that I then fixed in my canonical implementation.

nialv71y ago

Tried this on chat.deepseek.com, it seems to be able to do it.

throwup2381y ago

Does it compile? Put the full chat in Pastebin and let’s check it out!

I haven’t used their official chat interface or API for privacy reasons.

CamperBob21y ago

Some have said (for what little that's worth) that Kagi's version is not the real thing, but one of the distillations.

sho_hnOP1y ago

Thanks for adding detail! My prompts have been very in-the-bubble-of-Qt I'd say, less so about mashing together Qt and something else, which I agree is a good real-world test case.

throwup2381y ago

I haven’t had the chance to try it out with R1 yet but if you implement a debugger class that screenshots the widget/QML element, dumps its metadata like GammaRay, and includes the source, you can feed that context into Sonnet and o1. They are scarily good at identifying bugs and making modifications if you include all that context (although you have to be selective with what metadata you include. I usually just dump a few things like properties, bindings, signals, etc).

j / k navigate · click thread line to collapse

0 pointssho_hn1y ago0 comments

Is this a local run of one of the smaller models and/or other-models-distilled-with-r1, or are you using their Chat interface?

0 comments

throwup2381y ago

I use it through Kagi Assistant which has the proper R1 model through Together.ai/Fireworks.ai

nialv71y ago

Tried this on chat.deepseek.com, it seems to be able to do it.

throwup2381y ago

Does it compile? Put the full chat in Pastebin and let’s check it out!

I haven’t used their official chat interface or API for privacy reasons.

CamperBob21y ago

Some have said (for what little that's worth) that Kagi's version is not the real thing, but one of the distillations.

sho_hnOP1y ago

Thanks for adding detail! My prompts have been very in-the-bubble-of-Qt I'd say, less so about mashing together Qt and something else, which I agree is a good real-world test case.

throwup2381y ago

j / k navigate · click thread line to collapse