Counterintuitive thing I learned: when I tried to skip the explicit gate and 'let the model learn it', accuracy dropped meaningfully. The deterministic preprocessing wins over end-to-end here. Kind of an inverse of Padgham's 'accident becomes intent' — the technique survives, just on the analysis side now instead of the production side.