Call for contribution: the RDataMining package – an R package for data mining by Yanchang Zhao.
Join the RDataMining project to build a comprehensive R package for data mining http://www.rdatamining.com/package
We have started the RDataMining project on R-Forge to build an R package for data mining. The package will provide various functionalities for data mining, with contributions from many R users. If you have developed or will implement any data mining algorithms in R, please participate in the project to make your work available to R users worldwide.
Background
Although there are many R packages for various data mining functionalities, there are many more new algorithms designed and published every year, without any R implementations for them. It is far beyond the capability of a single team, even several teams, to build packages for oncoming new data mining algorithms. On the other hand, many R users developed their own implementations of new data mining algorithms, but unfortunately, used for their own work only, without sharing with other R users. The reason could be that they donot know or donot have time to build packages to share their code, or they might think that it is not worth building a package with only one or two functions.
Objective
To forester the development of data mining capability in R and facilitate sharing of data mining codes/functions/algorithms among R users, we started this project on R-Forge to collaboratively build an R package for data mining, with contributions from many R users, including ourselves.
Definitely worth considering if you are using R for data mining.
It also makes me think of the various public data dumps. I assume someone has mined some (most?) of those and has gained insights into their quirks.
Are there any projects gathering data mining tips or experiences with public data sets? Or are those buried in footnotes or asides, when they are recorded at all?