Actually, in the past few days o3 has proven fairly unreliable for me. I've gone back to o1-pro. But when I wrote the above it was reasonably reliable.
o3 with a pdf or in deep research mode is excellent. Especially if you’re disciplined about staying to what’s research. But really, it’s excellent, better than benchmarks indicate, I’d say.