Another Word For It Patrick Durusau on Topic Maps and Semantic Diversity

March 10, 2011

Top Ten Algorithms in Data Mining

Filed under: Algorithms,Data Mining — Patrick Durusau @ 9:13 am

Top Ten Algorithms in Data Mining

Summary of paper on data mining algorithms nominated and voted on by ACM KDD Innovation Award and IEEE ICDM Research Contributions Award winners to come up with a top 10 list.

I was curious about how the entries on the list from 2007 have fared.

I searched CiteseerX limiting the publication year to 2010.

The results, algorithm followed by citation count, were as follows:

  1. C4.5 – 41
  2. The k-Means algorithm – 86
  3. Support Vector Machines – 64
  4. The Apriori algorithm – 46
  5. Expectation-Maximization – 41
  6. PageRank – 19
  7. AdaBoost – 11
  8. k-Nearest Neighbor Classification – 36*
  9. Naive Bayes – 25
  10. CART (Classification and Regression Trees) – 11

*Searched as “k-Nearest Neighbor”.

Not a scientific study but enough variation to make me curious about:

  1. Broader survey of algorithm citation.
  2. What articles cite more than one algorithm?
  3. Are there any groupings by subject of study?

Not a high priority item but something I want to return to examine more closely.

1 Comment

  1. […]      […]

    Pingback by Top Ten Algorithms in Data Mining | Statistika | Scoop.it — August 27, 2012 @ 11:17 am

RSS feed for comments on this post.

Sorry, the comment form is closed at this time.

Powered by WordPress