Online random forest in C++, uses the GPU with OpenCL. (opens in new tab)

(github.com)

4 pointsilijavanil12y ago2 comments

My little machine learning project.. Serializable, online random forest algorithm implemented in C++, uses the GPU over OpenCL.

2 comments

switch3312y ago

This is awesome. Using the GPU with OpenCL means they will be decent performance. Any test case with data would be nice to read though.

Also, how comparable this is to something that does handle random forests well like BigML would be interesting.

ilijavanilOP12y ago

The implementation uses shared pointers to replicate as little data as possible. The memory management is there to maximize the amount of data you can process on a single computer with limited memory. There is a simple zmq wrapper that can be used to link bunch of these in parallel(or any other structure) on multiple computers. The same zmq wrapper can be used for other online learning algorithms. I just didn't implement the others yet. If there will be more people interested, I could whip up a few test runs and examples.

j / k navigate · click thread line to collapse

2 comments

switch3312y ago

This is awesome. Using the GPU with OpenCL means they will be decent performance. Any test case with data would be nice to read though.

Also, how comparable this is to something that does handle random forests well like BigML would be interesting.

ilijavanilOP12y ago

j / k navigate · click thread line to collapse