I'd imagine that you'd have some hybrid of ML and traditional code, and may be able to reason statistically about the ML sections, and user traditional (verified) code to cut the tail on the distribution.
All pipe dreams of mine, but the research potential here could be worth flaming truckloads of grant money :-)