Million Song Dataset in Minutes! (Video)
Actually 5:35 as per the video.
The summary of the video reads:
Created Web Project [zero install]
Loaded data from S3
Developed in Pig and Python [watch for the drop down menus of pig fragments]
ILLUSTRATE’d our work [perhaps the most impressive feature, tests code against sample of data]
Ran on Hadoop [drop downs to create a cluster]
Downloaded results [50 “densest songs”, see the video]
It’s not all “hands free” or without intellectual effort on your part.
But, a major step towards a generally accessible interface for Hadoop/MapReduce data processing.