Impala: Real-time Queries in Hadoop
From the description:
Learn how Cloudera Impala empowers you to:
- Perform interactive, real-time analysis directly on source data stored in Hadoop
- Interact with data in HDFS and HBase at the “speed of thought”
- Reduce data movement between systems & eliminate double storage
You can also grab the slides here
.Almost fifty-nine minutes.
Speedup on Hive over MapReduce reported to be 4-30X faster.
I’m not sure about #2 above but then I lack a skull-jack. Maybe this next Christmas. 😉