Pig Macro for TF-IDF Makes Topic Summarization 2 Lines of Pig by Russell Jurney.
From the post:
In a recent post we used Pig to summarize documents via the Term-Frequency, Inverse Document Frequency (TF-IDF) algorithm.
In this post, we’re going to turn that code into a Pig macro that can be called in one line of code:
Any Pig macros in your trick bag?