Dealing with Data in the Hadoop Ecosystem – Hadoop, Sqoop, and ZooKeeper by Rachel Roumeliotis.
From the post:
Kathleen Ting (@kate_ting), Technical Account Manager at Cloudera, and our own Andy Oram 0:22]
ZooKeeper, the canary in the Hadoop coal mine [Discussed at 1:10] Leaky clients are often a problem ZooKeeper detects [Discussed at 2:10] Sqoop is a bulk data transfer tool [Discussed at 2:47] Sqoop helps to bring together structured and unstructured data [Discussed at 3:50] ZooKeep is not for storage, but coordination, reliability, availability [Discussed at 4:44]
Conference interview so not deep but interesting.
For example, reported that 44% of production errors could be traced to misconfiguration errors.
[…] Another Word For It Patrick Durusau on Topic Maps and Semantic Diversity « Dealing with Data in the Hadoop Ecosystem… […]
Pingback by Hadoop Ecosystem Configuration Woes? « Another Word For It — November 7, 2013 @ 3:15 pm