Facebook open sources its SQL-on-Hadoop engine, and the web rejoices by Derrick Harris.
From the post:
Facebook has open sourced Presto, the interactive SQL-on-Hadoop engine the company first discussed in June. Presto is Facebook’s take on Cloudera’s Impala or Google’s Dremel, and it already has some big-name fans in Dropbox and Airbnb.
Technologically, Presto and other query engines of its ilk can be viewed as faster versions of Hive, the data warehouse framework for Hadoop that Facebook created several years ago. Facebook and many other Hadoop users still rely heavily on Hive for batch-processing jobs such as regular reporting, but there has been a demand for something letting users perform ad hoc, exploratory queries on Hadoop data similar to how they might do them using a massively parallel relational database.
Presto is 10 times faster than Hive for most queries, according to Facebook software engineer Martin Traverso in a blog post detailing today’s news.
I think my headline is the more effective one. 😉
You won’t know anything until you download Presto, read the documentation, etc.
The first job is to get your attention, then you have to get the information necessary to be informed.
From Derrick’s post, which points to other SQL-on-Hadoop options, interesting times are ahead!