Open Source Distributed Analytics Engine with SQL interface and OLAP on Hadoop by eBay – Kylin by Avkash Chauhan.
From the post:
Key Features:
- Extremely Fast OLAP Engine at Scale:
- Kylin is designed to reduce query latency on Hadoop for 10+ billions of rows of data
- ANSI-SQL Interface on Hadoop:
- Kylin offers ANSI-SQL on Hadoop and supports most ANSI-SQL query functions
- Interactive Query Capability:
- Users can interact with Hadoop data via Kylin at sub-second latency, better than Hive queries for the same dataset
- MOLAP Cube:
- User can define a data model and pre-build in Kylin with more than 10+ billions of raw data records
- Seamless Integration with BI Tools:
- Kylin currently offers integration capability with BI Tools like Tableau.
- Other Highlights:
- Job Management and Monitoring
- Compression and Encoding Support
- Incremental Refresh of Cubes
- Leverage HBase Coprocessor for query latency
- Approximate Query Capability for distinct Count (HyperLogLog)
- Easy Web interface to manage, build, monitor and query cubes
- Security capability to set ACL at Cube/Project Level
- Support LDAP Integration
Find it at Github: https://github.com/KylinOLAP/Kylin
Learn more at: http://www.kylin.io/index.html
More info:
A useful write-up for an overview of Kylin: Announcing Kylin: Extreme OLAP Engine for Big Data, the blog post from eBay that announces the open sourcing of Kylin.
What caught my eye was the use of pre-calculation of combinations of dimensions using Hadoop. Sounds promising!