Apache Tajo SQL-on-Hadoop engine now a top-level project by Derrick Harris.
From the post:
Apache Tajo, a relational database warehouse system for Hadoop, has graduated to to-level status within the Apache Software Foundation. It might be easy to overlook Tajo because its creators, committers and users are largely based in Korea — and because there’s a whole lot of similar technologies, including one developed at Facebook — but the project could be a dark horse in the race for mass adoption. Among Tajo’s lead contributors are an engineer from LinkedIn and members of the Hortonworks technical team, which suggests those companies see some value in it even among the myriad other options.
It is far too early to be choosing winners in the Hadoop ecosystem.
There are so many contenders, with their individual boosters, that if you don’t like the solutions offered today, wait a week or so, another one will pop up on the horizon.
Which isn’t a bad thing. There isn’t any reason to think IT has uncovered the best data structures or algorithms for your data. Anymore than you would have thought that twenty years ago.
The caution I would offer is to hold tightly to your requirements and not those of some solution. Compromise may be necessary on your part, but fully understand what you are giving up and why.
The only utility that software can have, for any given user, is that it performs some task they require to be performed. For vendors, adopters, promoters, software has other utilities, which are unlikely to interest you.