I don’t think I’m naively expecting LLMs to magically fix things. The reason voice is currently a bad interface is that the agents are dumb, mapping verbs to hand-coded action trees. The missing pieces are first intelligence and second long-term adaptability/personalization.
LLMs can plausibly solve both.