Master Indexing and the Unified View by David Loshin.
From the post:
1) Identity resolution – The master data environment catalogs the set of representations that each unique entity exhibits in the original source systems. Applying probabilistic aggregation and/or deterministic rules allows the system to determine that the data in two or more records refers to the same entity, even if the original contexts are different.
2) Data quality improvement – Linking records that share data about the same real-world entity enable the application of business rules to improve the quality characteristics of one or more of the linked records. This doesn’t specifically mean that a single “golden copy” record must be created to replace all instances of the entity’s data. Instead, depending on the scenario and quality requirements, the accessibility of the different sources and the ability to apply those business rules at the data user’s discretion will provide a consolidated view that best meets the data user’s requirements at the time the data is requested.
3) Inverted mapping – Because the scope of data linkage performed by the master index spans the breadth of both the original sources and the collection of data consumers, it holds a unique position to act as a map for a standardized canonical representation of a specific entity to the original source records that have been linked via the identity resolution processes.
In essence this allows you to use a master data index to support federated access to original source data while supporting the application of data quality rules upon delivery of the data.
It’s been a long day but does David’s output have all the attributes of a topic map?
- Identity resolution – Two or more representatives the same subject
- Data quality improvement – Consolidated view of the data based on a subject and presented to the user
- Inverted mapping – Navigation based on a specific entity into original source records