Microsoft’s plan for Hadoop and big data by Edd Dumbill.
From the post:
Microsoft has placed Apache Hadoop at the core of its big data strategy. It’s a move that might seem surprising to the casual observer, being a somewhat enthusiastic adoption of a significant open source product.
The reason for this move is that Hadoop, by its sheer popularity, has become the de facto standard for distributed data crunching. By embracing Hadoop, Microsoft allows its customers to access the rapidly-growing Hadoop ecosystem and take advantage of a growing talent pool of Hadoop-savvy developers.
Microsoft’s goals go beyond integrating Hadoop into Windows. It intends to contribute the adaptions it makes back to the Apache Hadoop project, so that anybody can run a purely open source Hadoop on Windows.
If MS is taking the data integration road, isn’t that something your company needs to be thinking about?
There is all that data diversity that Hadoop processing is going to uncover, but I have some suggestions about that issue. 😉
Nothing but good can come of MS using Hadoop as an integration data appliance. MS customers will benefit and parts of MS won’t have to worry about stepping on each other. A natural outcome of hard coding into formats. But that is an issue for another day.