In my column last month, I described the data transformation chain. In summary, all systems that transform raw data about business entities into information that can be used for business insight must go through a set of common steps: The quality derived at each step in the transformation is heavily dependent on the quality of the preceding steps. Thus, solving data quality problems, particularly those found when consolidating across databases, requires that you start remediation at the top and work down the chain. The example that I will use to illustrate the logic is a problem I have encountered at three different companies. In all three cases, the CEO asked what appeared to be a simple question: What 10, 25 or 50 companies are our largest customers? The last time I checked, smart and hardworking people at all three companies were still trying to answer the question. In all three companies, significant software expenditures were made for state-of-the-art customer data integration (CDI) and/or master data management (MDM) software, but the answer continues to elude the teams working on the question. Why is it so difficult? In large measure, it is because business entities are very complex. There are both external and internal reasons for the complexity. The following are examples of factors that introduce complexity. External factors include: Internal factors include: Whenever the exercise to determine the 25 largest customers is undertaken, start with a focus on the sourcing, matching and identifying steps of the transformation chain because linking up related entities within corporate family trees is the hardest step. You dont want to do this heavy lifting with data that is dirty when it arrives at the relationship process. The only way to solve the problem of determining the 25 largest customers is to: Lets make this concrete with an example. Suppose you work for a company with multiple databases supporting sales, customer service, order to cash and fulfillment. Your company has information about another company, lets call it ABCD, Inc. ABCD was created by the merger of AC Corp. and BD Inc. In the various databases from around the world, there might be business entities with names like AC Ltd., BD GmB, plus other business entities with names like AC Enterprises, BD Petroleum and other names that appear to be unrelated. Your sales teams around the globe may use shorthand names like A-D, etc. A salesperson may have a name and contact address for a subsidiary that was acquired last year, while the order-to-cash team has changed to the post-merger name. The point is that the family trees of multinational corporations are complicated and are regularly changing. If you want to determine your largest customers and understand how they interact with your company, start at the top of the data transformation chain. Simply tackling linkages will provide your CEO with a seriously flawed answer.
Vicki P. Raeburn is president of Scofield Ridge Associates, Inc., a business consultancy focused on data governance and data quality.She has nearly 30 years of leadership experience in the information industry, where she has held positions in global business and product development, marketing, and data operations. Most recently, Raeburn was chief quality officer at Dun & Bradstreet, consulting with customers on the successful implementation of customer information solutions. She has a B.A. from New College of Florida and a Ph.D. from Yale University. She may be reached at vicki@scoridge.com.










Be the first to comment on this post using the section below.