For me, the big surprise is that Copperhead departs from NESL-like flattening transformations (e.g., those used by Data Parallel Haskell.) It's a bit less surprising when you realize the creator is a GPU expert :)
Edit: Vasily, the guy behind the paper advertised in Continuum's blog post, is also from our lab ;-)