That is NOT what the first study you've cited says at all:
> "The empirically-keyed, response-option scored biodata scale demonstrated incremental validity over the computerized aptitude test battery in predicting scores representing the core technical skills of en route controllers."
I.e the aptitude test battery is WORSE than the biodata scale.
The second citation you offered merely notes that the AT-SAT battery is a better predictor than the older OPM battery, not that is the best.
I'd also say at a higher level that both of those papers absolutely reek of non-reproduceability and low N problems that plague social and psychological research. I'm not saying they're wrong. They are just not obviously definitive.