SIMR – Spark on top of Hadoop by Danny Bickson.
From the post:
Just learned from my collaborator Aapo Kyrola that the Spark team have now released a plugin which allows running Spark on top of Hadoop, without installation anything and without administrator privileges. This will probably encourage many more companies to try out Spark, which significantly improves on Hadoop performance.
The tools for data are getting easier to use every day.
Which moves the semantic wall a little closer with each improvement.
Efficiently processing TB of data only to confess it isn’t clear what the data may or may not mean, isn’t going to win IT any friends.