How else would a study scientifically determine the accuracy of an AI model in diagnosis? By testing it on real people before they know how good it is?
Why not ? Have AI do it then have human doctor do a follow-up/review ? I might not be a fan of this for urgent care but for general visits I wouldn't mind spending a bit extra time if they it was followed by an expert exam.