Working with Pig by Dan Morrill. (video)
From the description:
Pig is a SQL like command language for use with Hadoop, we review a simple PIG script line by line to help you understand how pig works, and regular expressions to help parse data. If you want a copy of the slide presentation – they are over on slide share http://www.slideshare.net/rmorrill.
Very good intro to PIG!
Mentions a couple of resources you need to bookmark:
Input Validation Cheat Sheet (The Open Web Security Application Project – OWASP) – regexes to re-use in Pig scripts. Lots of other regex cheat sheet pointers. (Being mindful that “\” must be escaped in PIG.)
Regular-Expressions.info A more general resource on regexes.
I first saw this at: This Quick Pig Overview Brings You Up to Speed Line by Line.