| What are the differences between active data warehousing and real-time data warehousing? | |
|
A: |
Larissa Moss'Answer: Active data warehousing is Teradata's term for real-time data warehousing. "Real time" is somewhat of a misnomer since there is no "two-phase commit" updating of the DW database(s) and the operational database(s) synchronizing the databases at the same time (the impact to operational systems would be too great). In other words, DW databases are updated in small batch streams independently and usually lag behind at least a few minutes. "Near real-time" data warehousing might be a more accurate term. Les Barbusinski's Answer: Although these terms are often thought of as synonymous, they're actually quite different. A real-time data warehouse is one that acquires, cleanses, transforms, stores and disseminates information in near real time. An active data warehouse, on the other hand, is one that operates in a closed loop with one-or-more operational systems. It receives raw transactional data from the operational systems and disseminates cleansed, integrated and enhanced information back to them via Web services, specialized APIs, message brokers and database triggers. In this way, the data warehouse becomes an "active" participant in driving the operational processes of the enterprise. Hope this helps. |
Larissa Moss is founder and president of Method Focus Inc., a company specializing in improving the quality of business information systems. She has more than 20 years of IT experience with information asset management. Moss is coauthor of three books: Data Warehouse Project Management (Addison-Wesley, 2000), Impossible Data Warehouse Situations (Addison-Wesley, 2002) and Business Intelligence Roadmap: The Complete Project Lifecycle for Decision- Support Applications (Addison-Wesley, 2003). Moss can be reached at methodfocus@earthlink.net.










Be the first to comment on this post using the section below.