This is the big one. You pay a 50% penalty for actual CPU bound, iteration heavy code with bounds checking enabled.
https://github.com/matklad/bounds-check-cost
Running this with 1.65 on an Intel 12400 gets a nearly 4x speedup when bounds checking is not needed. Just wow.
Bounds checking avoidance is important when it becomes a significant chunk of your hot-path.