Another Word For It Patrick Durusau on Topic Maps and Semantic Diversity

November 5, 2013

Hadoop for Data Science: A Data Science MD Recap

Filed under: Data Science,Hadoop — Patrick Durusau @ 2:02 pm

Hadoop for Data Science: A Data Science MD Recap by Matt Motyka.

From the post:

On October 9th, Data Science MD welcomed Dr. Donald Miner as its speaker to talk about doing data science work and how the hadoop framework can help. To start the presentation, Don was very clear about one thing: hadoop is bad at a lot of things. It is not meant to be a panacea for every problem a data scientist will face.

With that in mind, Don spoke about the benefits that hadoop offers data scientists. Hadoop is a great tool for data exploration. It can easily handle filtering, sampling and anti-filtering (summarization) tasks. When speaking about these concepts, Don expressed the benefits of each and included some anecdotes that helped to show real world value. He also spoke about data cleanliness in a very Baz Luhrmann Wear Sunscreen sort of way, offering that as his biggest piece of advice.

What?

Hadoop is not a panacea for every data problem????

😉

Don’t panic when you start the video. The ads, etc., take almost seven (7) minutes but Dr. Miner is on the way.


Update: Slides for Hadoop for Data Science. Enjoy!

No Comments

No comments yet.

RSS feed for comments on this post.

Sorry, the comment form is closed at this time.

Powered by WordPress