Thanks for the feedback. The main performance focus was rendering.
Claude Code and other TUIs (except Codex) use a layer of abstraction over the raw terminal escape sequences.
I directly used `crossterm`, which gave me more control and lower latency.
For example if nothing is going on, I don't render anything in the terminal. Or only render at keypress.