And even then - the LLMs are going to be so much slower than the MCP server, I've never quite understood the point of trying to optimise it. Even FastMCP is pretty good because everything is async.
But even given all that - this is very impressive :)
Then with C++ implementation, you get more portability than Python or other interpreted languages.
unless you're working on 10 features in parallel with smaller models in best-of-4 groups
So MCP optimization may be something that’s targeted to those areas.
I also ported the Claude Agent SDK to C++, so fastmcpp is needed for that: https://github.com/0xeb/claude-agent-sdk-cpp
RapidJSON is a good suggestion, maybe I will switch to it at one point.