You just look at the distribution of p values that are used to support the authors' hypotheses. If the distribution is skewed high, then something fishy is going on.
But this article feels like reading a teaser chapter of a bigger story.
> The amount of (toil) required to actually create data like this from scratch if (very) nightmarish. It’s a task drastically out of reach of the (foolish people) who’d try such a bush league stunt in the first place.
This assumes that all experiments lead to publications. We know there's a strong publication bias, and that the bias favors positive results, and dramatically favors unintuitive positive results. Which means you need to find correlations where none were expected. How many experiments do you need to get a significant correlation when there is none? Hint: more than one.
It's also worth noting that it wouldn't be difficult to produce a genetic algorithm using various statistical checks, including this one, as a fitness function.
https://medium.com/@jamesheathers/a-lot-of-people-are-hung-u...
> A lot of people are hung up the fact that it’s possible to collect ages in finer grains — to the nearest month, or day, etc.
> Remember a) this is just a hypothetical illustration and b) the vast majority of the time, age is collected exactly how I’ve described here.