This is a bit different than the "read a paper" TTS app. I mentioned the idea just to say it's possible and coming.
The blend of the two isn't out of the question though.
Think of asking for a reading of a paper wherein you could interject at any time.
System: "This work is presented fromainly 3 groups: Deepmind, University of Pennsylvania, and ETH Zurich - the authors are Matthew Botvinick, Dani Bassett, and Bastian Rieck. They uncover a useful meta-learning program that relies on an AT methodology rooted in the bifiltration of the Ricci curvature of the embeddings and training step, wherein ..."
You: "Wait a second - the algebraic topology method - what are the prior works in that area and why would that be the starting point for this paper"
System: "It appears that the relevant citations point to Anne Sizemore's work while in Bassett's lab, with a few other key authors such as Guisti. The titles suggest that..."
(...)
System: "Now that we've cleared that up a bit (and added it to a research list for further exploration later), to continue on the paper ..."
And so on.
This is very achievable today with a little bit of work.
Perhaps not easy to work _super well_ - but likely easy enough to get working to _some degree_. A well polished product that does work super well certajnly isn't out of the question though.