So, what’s brewing with HCatalog
From the post:
Apache HCatalog announced release of version 0.5.0 in the past week. Along with that, it has initiated steps to graduate from an incubator project to be an Apache Top Level project or sub-project. Let’s look at the current state of HCatalog, its increasing relevance and where it is heading.
HCatalog for a small introduction, is a “table management and storage management layer for Apache Hadoop” which:
- enables Pig, MapReduce, and Hive users to easily share data on the grid.
- provides a table abstraction for a relational view of data in HDFS
- ensures format indifference (viz RCFile format, text files, sequence files)
- provides a notification service when new data becomes available
Nice summary of the current state of HCatalog, pointing to a presentation by Alan Gates from Big Data Spain 2012.