i had the same thoughts, but one usecase was to quickly create GPU calculations. i wanted to allow users to create GPU kernels but without having to write a kernel.
Data has to get transferred from the GPU eventually, so breaking it up into steps back and forth does seem like over (under?) kill. Hopefully, since they're pipes in memory it won't be so bad as going to disk.