The Data Strategy Advisor
Master Data Management
Information Management Magazine, July 2005
An Old Problem Continues to Challenge both Users and Vendors
The role of master data management is back on the front burner again. When you think about it, it never really went away. Wherever you have data, you have master data. The care and management of that data is how the information system comes to represent the business context in which it operates. Master data is one of the ways of setting the standard for defining data and information quality. If the master data is out of line, then so is the quality of the information. The enterprise resource planning (ERP) revolution raised the hope of finally consolidating master data around a single transactional system of record. However, these hopes were dashed as proliferating instances of ERP applications were supplemented with customer relationship management (CRM), supply chain management (SCM), and analytic applications (data marts) corresponding to each. In short, the single version of the truth and its representation of the system of record continues to be a point on the horizon toward which our system development efforts converge, but which we never seem to be able to reach. If it is supposed to be a master file, then why are there so many of them? We are chasing a moving target. Meanwhile, the critical path to enterprise data warehousing continues to lie through the design and implementation of consistent and unified representations of customers, products and whatever other master data entities are needed to run your business.
Advertisement
Because the relational database does not support different versions of the same table with the same identifying name in the relational catalog, some installations have used the object-relational features of user-defined objects to simulate different versions of these dynamic dimensions. Though this is not intended to be a product evaluation (or endorsement), the Kalido data warehouse product targets this issue with an abstraction layer that makes possible the manipulation of master data across different dimensions. In my opinion, Kalido has finally come into its own in describing itself as a master data management tool first and a data warehouse second.
In the final analysis, data integration requires schema integration. Schema integration is the determination, reconciliation and rationalization of the underlying meaning of the data models representing the business entities and related functions. Technologies such as ETL (extract, transform and load) tools, meta data repositories and message brokers can be useful in rationalizing and conforming data to a consistent and unified representation of customer, products and other essential data dimensions; however, they cannot solve the problem of understanding how and why the definition of customer in your ERP system is different than that in the CRM or SCM system. For that, the insight of a data administrator is still needed. That is, human insight is needed, at least until advances in semantic analysis or perhaps even the elusive semantic chip comes forth from the labs.
The trend is to continue to expect a variety of technologies to be repositioned in an opportunistic way into the master data management market. In general, this is a good thing, provided that the tools really do offer automation and support for business methods in addressing master data management. Ultimately, the issue is about data architecture and there the recommendation is still to design and implement a centralized architecture for the economies of scale of centralized processing. If the enterprise is a highly distributed one, then it is still advantageous to design a single consistent representation of the master data even if the implementation requires incremental, stepwise rationalization across multiple instances. The compromises necessitated by the heterogeneous data environment of the modern multidivisional corporation mean that many firms will spend the better part of their lives in federation on the path to unification.
Lou Agosta is an independent industry analyst in data warehousing. A former industry analyst at Giga Information Group, Agosta has published extensively on industry trends in data warehousing, data mining and data quality. He can be reached at LAgosta@acm.org.
For more information on related topics, visit the following channels:






