Archive for the ‘Hadapt’ Category

HadoopDB: Efficient Processing of Data Warehousing Queries in a Split Execution Environment

Sunday, January 29th, 2012

HadoopDB: Efficient Processing of Data Warehousing Queries in a Split Execution Environment

From the post:

The buzz about Hadapt and HadoopDB has been around for a while now as it is one of the first systems to combine ideas from two different approaches, namely parallel databases based on a shared-nothing architecture and map-reduce, to address the problem of large scale data storage and analysis.

This early paper that introduced HadooDB crisply summarizes some reasons why parallel database solutions haven’t scaled to hundreds machines. The reasons include –

  1. As the number of nodes in a system increases failures become more common.
  2. Parallel databases usually assume a homogeneous array of machines which becomes impractical as the number of machines rise.
  3. They have not been tested at larger scales as applications haven’t demanded more than 10′s of nodes for performance until recently.

Interesting material to follow on the HPCC vs. Hadoop post.

Not to take sides, just the beginning of the type of analysis that will be required.

Hadapt is moving forward

Friday, November 25th, 2011

Hadapt is moving forward

A bullet-point type review, mostly a summary of information from the vendor. Not a bad thing, can be useful. But, you would think that when reviewing a vendor or their product, there would be a link to the vendor/product. Yes? No one that I can find in that post.

Let me make it easy for you: Hadapt.com. How hard was that? Maybe 10 seconds of my time and that is because I have gotten slow? The point of the WWW, at least as I understand it, is to make information more accessible to users. But it doesn’t happen by itself. Put in hyperlinks where appropriate.

There is a datasheet on the Adaptive Analytic Platform &trade:.

You can follow the link for the technical report and register, but it is little more than a sales brochure.

More informative is: Efficient Processing of Data Warehousing Queries in a Split Execution Environment.

I don’t have a local setup that would exercise Hadapt. If you do or if you are using it in the cloud, would appreciate any comments or pointers you have.