So the question where the speech recognition happens isn’t easily answered.
http://www.soundhound.com/hound#!