My impression is that data mining isn’t cumulative.
That is when I read about a new data mining technique, even over a known data set, like the ones used at TREC, they all make a fresh start on the data.
It is like having read a book and to find a particular passage, you start over at page one. That seems like a poor use of resources.
Another approach would be to record previously discovered relevant documents. Subsequent users can then benefit from what has been found. (Note the use of past tense.)
Can anyone suggest examples of cumulative data mining?