Another Word For It Patrick Durusau on Topic Maps and Semantic Diversity

April 19, 2012

7 top tools for taming big data

Filed under: BigData,Jaspersoft,Karmasphere,Pentaho,Skytree,Splunk,Tableau,Talend — Patrick Durusau @ 7:20 pm

7 top tools for taming big data by Peter Wayner.

Peter covers:

  • Jaspersoft BI Suite
  • Pentaho Business Analytics
  • Karmasphere Studio and Analyst
  • Talend Open Studio
  • Skytree Server
  • Tableau Desktop and Server
  • Splunk

Not as close to the metal as Lucene/Solr, Hadoop, HBase, Neo4j, and many other packages but not bad starting places.

Do be mindful of Peter’s closing paragraph:

At a recent O’Reilly Strata conference on big data, one of the best panels debated whether it was better to hire an expert on the subject being measured or an expert on using algorithms to find outliers. I’m not sure I can choose, but I think it’s important to hire a person with a mandate to think deeply about the data. It’s not enough to just buy some software and push a button.

February 3, 2012

Karmasphere Studio Community Edition

Filed under: Hadoop,Hive,Karmasphere — Patrick Durusau @ 4:52 pm

Karmasphere Studio Community Edition

From the webpage:

Karmasphere Studio Community Edition is the free edition of our graphical development environment that facilitates learning Hadoop MapReduce jobs. It supports the prototyping, developing, and testing phases of the Hadoop development lifecycle.

The parallel and parameterized queries features in their Analyst product attracted me to the site:

From the webpage:

According to Karmasphere, the updated version of Analyst offers a parallel query capability that they say will make it faster for data analysts to iteratively query their data and create visualizations. The company claims that the new update allows data analysts to submit queries, view results, submit a new set and then compare those results across the previous outputs. In essence, this means users can run an unlimited number of queries concurrently on Hadoop so that one or more data sets can be viewed while the others are being generated.

Karmasphere also says that the introduction of parameterized queries allows users to submit their queries as they go, while offering them output in easy-to-read graphical representations of the findings, in Excel spreadsheets, or across a number of other outside reporting tools.

Hey, it says “…in Excel spreadsheets,” do you think they are reading my blog? (Spreadsheet -> Topic Maps: Wrong Direction? 😉 I didn’t really think so either.) I do take that as validation of the idea that offering users a familiar interface is more likely to be successful than an unfamiliar one.

Powered by WordPress