Speech is more than articulating words. It is also about rhythm and melody and idealy also body language.
The way someone speaks is very unique ... and it is actually very, very important how you speak to bring your point across. Or ... to convince people.
A robot voice might present the best arguments, but it will very likely loose to a good speaker who can (literally) tune in to his audience.
Speech is a complex pattern of sound waves, containing much more information, than binary encoded words.
So if there was a ML tool to make people with strong accent more understandable, why not. But you can also numble without any accent.
And I can enjoy and understand certain people with strong accents much better than natives, because they are just good speakers.
And having subtitles is one thing, but changing their voice .. would require consent I believe. (unless you run the tool for yourself, but I believe parents point was, he speaks and then automatically a tool enhances his voice, I would not like that, too)