That's pretty broad, here are some questions that might help you find something relevant: 1) is it a big batch job or incremental/streaming analysis, 2) does the data reasonably fit on one machine, 3) does your existing analysis depend on any complex 3rd party libraries (hard to port away from), 4) would you be willing to use a cloud provider's proprietary tool, 5) what level of commercial support do you need?
You can get pretty far with R or Pandas + Scipy on a fast machine, after that then you start taking on more hassle of Spark or whatever fits your situation.
Oh, and 0) pain that's motivating the rebuild. Feel free to e-mail me even just to rubber duck your thinking.