The problem is that careful (conference) reviews don’t scale. Large conferences end up suffering from a highly stochastic behavior where excellent work is borderline rejected on a regular basis while mediocre/incorrect work gets accepted every so often. github/arxiv are no silver bullet but offer an interesting alternative (with their own set of challenges, though).