I'm really interested in doing more machine learning work (my current projects, as interesting as they are, dont really require it).
I've done a few weirdo projects with NLTK, tho, and its great fun. By stream hacking do you mean offloading learning sets (active or initial) and that heavy overhead into the "cloud", or am I misunderstanding the terminology?