I'm not sure for text it's a better performing model. I was just testing GPT-4o on a use case (generating AP MCQ questions) and -4o is repeatedly generating questions with multiple correct answers and will not fix it when prompted.
(Providing the history to GPT-4Turbo results in it fixing the MCQ just fine).