Knowledge Discovery from Mining Big Data – Presentation by Kirk Borne by Bruce Berriman.
From the post:
My friend and colleague Kirk Borne, of George Mason University, is a specialist in the modern field of data mining and astroinformatics. I was delighted to learn that he was giving a talk on an introduction to this topic as part of the Space Telescope Engineering and Technology Colloquia, and so I watched on the webcast. You can watch the presentation on-line, and you can download the slides from the same page. The presentation is a comprehensive introduction to data mining in astronomy, and I recommend it if you want to grasp the essentials of the field.
Kirk began by reminding us that responding to the data tsunami is a national priority in essentially all fields of science – a number of nationally commissioned working groups have been unanimous in reaching this conclusion and in emphasizing the need for scientific and educational programs in data mining. The slides give a list of publications in this area.
Deeply entertaining presentation on big data.
The first thirty minutes or so are good for “big data” quotes and hype but the real meat comes at about slide 22.
Extends the 3 V’s (Volume, Variety, Velocity) to include Veracity, Variability, Venue, Vocabulary, Value.
And outlines classes of discovery:
- Class Discovery
- Finding new classes of objects and behaviors
- Learning the rules that constrain the class boundaries
- Novelty Discovery
- Finding new, rare, one-in-a-million(billion)(trillion) objects and events
- Correlation Discovery
- Finding new patterns and dependencies, which reveal new natural laws or new scientific principles
- Association Discovery
- Finding unusual (improbable) co-occurring associations
A great presentation with references and other names you will want to follow on big data and astroinformatics.