undefined | Better HN

0 pointshereonout21mo ago0 comments

I was playing about with Chat GPT the other day, uploading screen shots of sheet music and asking it to convert it to ABC notation so I could make a midi file of it.

The results seemed impressive until I noticed some of the "Thinking" statements in the UI.

One made it apparent the model / agent / whatever had read the title from the screenshot and was off searching for existing ABC transcripts of the piece Ode to Joy.

So the whole thing was far less impressive after that, it wasn't reading the score anymore, just reading the title and using the internet to answer my query.

0 comments

nobodywillobsrv1mo ago

Yes I have found that grok for example actually suddenly becomes quite sane when you tell it to stop querying the internet And just rethink the conversation data and answer the question.

It's weird, it's like many agents are now in a phase of constantly getting more information and never just thinking with what they've got.

Szpadel1mo ago

but isn't it what we wanted? we complained so much that LLM uses deprecated or outdated apis instead of current version because they relied so much on what they remembered

nobodywillobsrv1mo ago

To be clear, what I mean is that grok will query 30 pages and then answer your question vaguely or wrongly and then ask for clarification of what it meant and then it goes and requeries everything again ... I can imagine why it might need to revisit pages etc and it might be a UI thing but it still feels like until you yell at it to stop searching for answers to summarise it doesn't activate it's "think with what you got" mode.

I guess we could call this gathering and then do your best conditional on what you found right now.

HappMacDonald1mo ago

2010's: Google Search is making humans who constantly rely on it dumber

2020's: LLMs are making humans who constantly rely on them dumber

2026: Google Search is making LLMs who constantly rely on it dumber

bestham1mo ago

Touché, that is what we humans are doing to some degree as well.

anomaly_1mo ago

Sounds pretty human like! Always searching for a shortcut

lpcvoid1mo ago

It sounds like it's lying and making stuff up, something everybody seems to be okay with when using LLMs.

LeanderK1mo ago

I am not sure why...you want the LLM to solve problems not come up with answers itself. It's allowed to use tools, precisely because it tends to make stuff up. In general, only if you're benchmarking LLMs you care about whether the LLM itself provided the answer or it used a tool. If you ask it to convert the notation of sheet music it might use a tool, and it's probably the right decision.

1 more reply

kouunji1mo ago

For structured outputs like that wouldn’t it be better to get the LLM to create a script to repeatably make the translation?

j / k navigate · click thread line to collapse

0 pointshereonout21mo ago0 comments

I was playing about with Chat GPT the other day, uploading screen shots of sheet music and asking it to convert it to ABC notation so I could make a midi file of it.

The results seemed impressive until I noticed some of the "Thinking" statements in the UI.

One made it apparent the model / agent / whatever had read the title from the screenshot and was off searching for existing ABC transcripts of the piece Ode to Joy.

So the whole thing was far less impressive after that, it wasn't reading the score anymore, just reading the title and using the internet to answer my query.

0 comments

nobodywillobsrv1mo ago

Yes I have found that grok for example actually suddenly becomes quite sane when you tell it to stop querying the internet And just rethink the conversation data and answer the question.

It's weird, it's like many agents are now in a phase of constantly getting more information and never just thinking with what they've got.

Szpadel1mo ago

but isn't it what we wanted? we complained so much that LLM uses deprecated or outdated apis instead of current version because they relied so much on what they remembered

nobodywillobsrv1mo ago

I guess we could call this gathering and then do your best conditional on what you found right now.

HappMacDonald1mo ago

2010's: Google Search is making humans who constantly rely on it dumber

2020's: LLMs are making humans who constantly rely on them dumber

2026: Google Search is making LLMs who constantly rely on it dumber

bestham1mo ago

Touché, that is what we humans are doing to some degree as well.

anomaly_1mo ago

Sounds pretty human like! Always searching for a shortcut

lpcvoid1mo ago

It sounds like it's lying and making stuff up, something everybody seems to be okay with when using LLMs.

LeanderK1mo ago

1 more reply

kouunji1mo ago

For structured outputs like that wouldn’t it be better to get the LLM to create a script to repeatably make the translation?

j / k navigate · click thread line to collapse