For example, an AI would already use linux commands like tree to traverse the code base. And again it already has good training in this.
The other problem is that it is easy to cook up examples which demonstrate the efficacy of tools like these - but actually proving that the cognitive deficit that such tools result it, is surmounted by their efficacy in long horizon runs. My first contact instinct is that this will result in a net negative 'deployable intelligence' over long horizon runs - make the agent perform worse than using existing tools.
Proving the opposite is a non-trivial problem - but maybe it might be something you want to take up.
No comments yet.