Jubatus:…Realtime Analysis of Big Data [XLDB2012 Presentation]

Saturday, October 20th, 2012

XLDB2012: Jubatus: Distributed Online Machine Learning Framework for Realtime Analysis of Big Data by Hiroyuki Makino.

I first pointed to Jubatus here.

The presentation reviews some impressive performance numbers and one technique that merits special mention.

Intermediate results are shared among the servers during processing to improve their accuracy. That may be common in distributed machine learning systems but it was the first mention I have encountered.

In parallel processing of topic maps, has anyone considered sharing merging information across servers?


Saturday, October 29th, 2011

Jubatus: Distributed Online Machine Learning Framework

The Jubatus library is a online machine learning framework which runs in distributed environment. Jubatus library includes these functions:

  • multi-class/binary classification,
  • pre-proccessing data(for natural language), and
  • process management.

Talk about something that will make you perk up on a rainy afternoon!