Spark in Clojure by Mykhailo Kozik.
From the post:
Apache Spark is a fast and general engine for large-scale data processing.
100 times faster than Hadoop.
Everyone knows SQL. But traditional databases are not good in hadling big amount of data. Nevertheless, SQL is a good DSL for data processing and it is much easier to understand Spark if you have similar query implemented in SQL.
This article shows how common SQL queries implemented in Spark.
Another long holiday weekend appropriate posting.
Good big data practice too.