Large Scale Data Mining Using Genetics-Based Machine Learning Authors: Jaume Bacardit, Xaiver Llorà
Tutorial on data mining with genetics-based machine learning algorithms.
Usual examples of exploding information from genetics to high energy physics.
While those are good examples, it really isn’t necessary to go there in order to get large scale data sets.
Imagine constructing a network for all the entities and their relationships in a single issue of the New York Times.
That data isn’t as easily available or to process as genetic databases or results from the Large Hadron Collider.
But that is a question of ease of access and processing, not being large scale data.
The finance pages alone have listings for all the major financial institutions in the country. What about mapping their relationships to each other?
Or for that matter, mapping the phone calls, emails and other communications between the stock trading houses? Broken down by subjects discussed.
Important problems often as not have data that is difficult to acquire. Doesn’t make them any less important problems.