The problem comes with thinking you can bridge both of those use cases - vague task descriptions to final output. The work described in the article of getting an LLM itself to break down a task seems to work sometime but struggles in many scenarios. Products that can define their domain narrowly enough, and embed enough domain knowledge into the system, and can ask the feedback at the right points, and going to be successful and more generalized systems will either need to act more like tools rather than complete solutions.