Another Word For It Patrick Durusau on Topic Maps and Semantic Diversity

May 20, 2012

Top 10 challenging problems in data mining

Filed under: Data Mining — Patrick Durusau @ 1:12 pm

Top 10 challenging problems in data mining by Sandro Saitta (March 27, 2008)

I mention the date of this post because the most recent response to it was four days ago, May 15, 2012.

I should write a post that gets comments that long after publication!

Sandro writes:

In a previous post, I wrote about the top 10 data mining algorithms, a paper that was published in Knowledge and Information Systems. The “selective” process is the same as the one that has been used to identify the most important (according to answers of the survey) data mining problems. The paper by Yang and Wu has been published (in 2006) in the International Journal of Information Technology & Decision Making. The paper contains the following problems (in no specific order):

  • Developing a unifying theory of data mining
  • Scaling up for high dimensional data and high speed data streams
  • Mining sequence data and time series data
  • Mining complex knowledge from complex data
  • Data mining in a network setting
  • Distributed data mining and mining multi-agent data
  • Data mining for biological and environmental problems
  • Data Mining process-related problems
  • Security, privacy and data integrity
  • Dealing with non-static, unbalanced and cost-sensitive data
  • It’s a little over five years later.

    Same list? Different list?

    BTW, the 2006 article by Yang and Wu, along with slides, can be found at: 10 Challenging Problems in Data Mining Research

    The full citation of the article is:

    Qiang Yang and Xindong Wu (Contributors: Pedro Domingos, Charles Elkan, Johannes Gehrke, Jiawei Han, David Heckerman, Daniel Keim, Jiming Liu, David Madigan, Gregory Piatetsky-Shapiro, Vijay V. Raghavan, Rajeev Rastogi, Salvatore J. Stolfo, Alexander Tuzhilin, and Benjamin W. Wah), 10 Challenging Problems in Data Mining Research, International Journal of Information Technology & Decision Making, Vol. 5, No. 4, 2006, 597-604.

    While searching for this paper I encountered:

    Xindong Wu’s Publications in Data Mining and Machine Learning

    Pick any paper at random and you are likely to learn something new.

    No Comments

    No comments yet.

    RSS feed for comments on this post.

    Sorry, the comment form is closed at this time.

    Powered by WordPress