Doesn't it require a lot of data duplication to keep tons of data in a central data warehouse, then export that data to various data marts? How do you keep down storage and management costs for huge amounts of data?
Sid Adelman's answer: Remember when they said that database management systems would eliminate redundant data? It never happened and never will. The data warehouse will result in a certain amount of data duplication, and a central (enterprise) data warehouse feeding data marts will result in data duplication. The best you can do is try to control unrestrained growth. See the article entitled, "The Data Warehouse Explosion" (by Sid Adelman) in the December, 1996 Issue of DM Review for ways to keep storage contained.
David Marco's answer: A data warehouse feeding dependent data marts (marts fed from a warehouse) certainly creates some data duplication, however the alternative of creating a series of independent data marts (sourced directly from operational systems) creates a much greater amount of data redundancy. This redundancy occurs because each of the independent data marts requires its own, typically duplicated copy of the detailed corporate data. Often a great deal of this detailed data is not required in the data marts, which typically provide summarized views.
Register or login for access to this item and much more
All Information Management content is archived after seven days.
Community members receive:
- All recent and archived articles
- Conference offers and updates
- A full menu of enewsletter options
- Web seminars, white papers, ebooks
Already have an account? Log In
Don't have an account? Register for Free Unlimited Access