CDG – Community Data Generator
From the post:
CDG is a datawarehouse generator and the newest member of the Ctools family. Given the definition of dimensions that we want, CDG will randomize data within certain parameters and output 3 different things:
- Database and table ddl for the fact table
- A file with inserts for the fact table
- Mondrian schema file to be used within pentaho
While most of the documentation mentions the usage within the scope of Pentaho there’s absolutely nothing that prevents the resulting database to be used in different contexts.
I had mentioned ctools before but not in any detail. This was the additional resource that made me pick them back up.
It isn’t hard to see how this data generator will be useful.
For subject-centric software, generating files with known “same subject” characteristics would be more useful.
Thoughts, suggestions or pointers to work on generation of such files?