Question: I am writing a thesis on data warehousing. I would like to know how to decide when to implement Kimball or Inmon's approach. When is it better to have normalized data to create data marts and when is it better to have dimensional data? Are there some criteria or guidelines
Chuck Kelleys Answer: Hoping not be bombarded by emails, I still dont get the difference between Kimballs and Inmons approach. Kimball looks at the data from the usage of the business and hence concentrates on data marts and star schemas, which is the right thing to do.
Inmon on the other hand looks at it from a data management prospective. That is, how you manage the data, which is the right thing to do.
From a high level, Kimball and Inmon say the same thing. Extract data from your sources (and external data) and put them in a big blob (Kimball calls it a staging area and Inmon calls it a data warehouse). Next, extract the data required to solve business issues and load them into a data mart. Yes, I know that there are some little nuances that separate them, but on the whole, I think they are the same, except for the perspective.
Joe Oates Answer: First of all, in the current Inmon approach, the enterprise data warehouse (EDW) is normalized and has denormalized or dimensional data marks off of the EDW for ease of querying. Also, there is a lot of confusion over what a dimensional model is.
Initially, dimensional designs were described as being strictly star schemas with all of the dimension tables as flat tables without many-to-many relationships. As dimensional modeling has matured, permissible snowflaking (a normalization technique) has become part of dimensional modeling as well as helper or bridge tables which are associative or junction tables that resolve a many-to-many relationship. An example of permissible snowflaking is the heterogeneous product problem. An example of a helper table is multiple insured drivers associated with a single insurance policy. Both of these are covered in The Data Warehouse Toolkit Second Edition by Ralph Kimball et al. The former example is found on page 213 and the latter is found on page 318.
So, the bottom line is that, while there are still real differences, the Inmon and Kimball approaches have grown closer over time.
While having successfully done data warehouses using both approaches, I personally have found that a combination of the two approaches works well for enterprise data warehouses where there is value in things like supertypes and subtypes. On the other hand, the dimensional approach is almost always adequate for data marts.
Larissa Mosss Answer: Both of Steve Hobermans books have great guidelines for data modeling: Data Modelers Workbench (ISBN 0-471-11175-9) and Data Modeling Made Simple (ISBN 0-9771400-0-8). You can also sign up for his Design Challenges.
Chuck Kelley is an internationally known expert in database and data warehousing technology. He has 30 years of experience in designing and implementing operational/production systems and data warehouses. Kelley has worked in some facet of the design and implementation phase of more than 50 data warehouses and data marts. He also teaches seminars, co-authored four books on data warehousing and has been published in many trade magazines on database technology, data warehousing and enterprise data strategies. He can be contacted at firstname.lastname@example.org.
Joe Oates is an internationally known speaker, author and consultant on data warehousing. Oates has more than 30 years of experience in the successful management and technical development of business, real-time and data warehouse applications for industry and government clients. He has designed or helped design and implement more than 30 successful data warehouse projects.
Larissa Moss is founder and president of Method Focus Inc., a company specializing in improving the quality of business information systems. She has more than 20 years of IT experience with information asset management. Moss is coauthor of three books: Data Warehouse Project Management (Addison-Wesley, 2000), Impossible Data Warehouse Situations (Addison-Wesley, 2002) and Business Intelligence Roadmap: The Complete Project Lifecycle for Decision- Support Applications (Addison-Wesley, 2003). Moss can be reached at email@example.com.