It's because reproduced audio doesn't have the bass the same as you hearing it conducted through your jawbone (though of course this will sound too bassy to everyone else!)
Makes sense. I think also until YouTube and podcasting became popular, most of the mics that the typical person would accidentally stumble on were probably bright rather than warm.