If a school system is designed so that the average kid in 3rd grade is expected to be in 4th grade the following year, the fact that a statistically significant subset of kids is not able to meet that bar is a sign that the system is failing those kids.
What's the goal here? Is it to get pretty metrics by filtering out the failures, or is it to provide an effective education to all kids?