OCT 23, 2007 11:28am ET

Related Links

Oracle to Buy Taleo
February 9, 2012
Birst Automates Connections to Big Data
February 8, 2012
PaaS Matures, But With Doubts
February 3, 2012

Web Seminars

Deliver Better Enterprise Data through Better Reference Data Management
Available On Demand
The Strategic Move - Migrating to a New Data Warehouse Platform
Available On Demand
The Data Dilemma: Is the Answer in the Cloud?
Available On Demand

2007 ISA for Data Integration

Print
Reprints
Email

Data integration involved the acquiring, integrating and reconciling disparate data for analytic purposes.

Solution Implementer: RLPTechnologies

Solution Provider: DataFlux Corporation

Business Pain

R.L. Polk & Co. has set the standard for automotive vehicle and consumer data for years, continually improving its data management methods along the way. Polk was looking to maintain and extend its competitive advantage amid significant industry, regulatory and technology change. Polk recognized it needed to move beyond incremental improvements and develop a new, innovative approach that would greatly enhance its existing core foundational data warehouse.

  1. Industry challenges - In the automotive industry, original equipment managers (OEMs) and dealers alike are constantly pushing for the timeliest, most complete data and analytics to compete more effectively in a flat U.S. market.
  2. Regulatory compliance - Issues facing business today regarding data privacy made it clear that a flexible and agile IT environment was required to proactively get ahead of the likelihood of stricter regulations in the future.
  3. Technology change - As an organization collecting automotive data since 1922, Polk had grown a very complex data management environment that was difficult to maintain. Emerging technologies showed promise to streamline the environment, enabling the ability to introduce new data or application offerings faster, while lowering IT total cost of ownership.

As a result, RLPTechnologies, a subsidiary of R.L. Polk & Co., was asked to lead a re-engineering effort to implement breakthrough technologies that would collect, standardize and enhance data from disparate sources and compile them into a “single source of the truth” for distribution to analytical and operational applications. The project led to the development of OneView360° as a comprehensive, integrated application for data integration.

Successful Solution

OneView360° is a fully automated data integration solution designed to capture feeds from many disparate data sources, centrally maintain and manage master reference data, incorporate multipoint data quality inspections and load data into one or many databases.

The sophisticated service orchestration engine in OneView360° optimizes data production throughput, enabling real-time data integration. With an open, service-oriented architecture (SOA), OneView360° can seamlessly integrate investments in legacy, commercial off-the-shelf applications and Web services.

Within the implementation at Polk North America, the solution must handle large-scale, complex data management needs as Polk compiles data from more than 240 different sources, representing 500 million unique vehicle transactions per year, with data on over 246 million unique households. The system has been built and was deployed in phases over a 17-month period. The data operations group at Polk uses the solution to manage this wealth of data.

Two significant projects within the re-engineering effort had a very specific data quality focus - name and address standardization and data profiling. RLPTechnologies selected DataFlux dfPower Studio to analyze, improve and control data quality with the external interface file (EIF). The DataFlux Integration Server allowed RLPTechnologies to expose business rules for data quality via SOA, creating a real-time processing architecture to ensure that data entering the EIF meets corporate standards for data integrity. The name and address project was tasked with implementing decades of unique and complex business rules used to both parse and standardize name and address data.

The second data quality-focused initiative was data profiling, which was designed to catch data quality issues as soon as possible in the lifecycle of the data. Issues monitored by this process include incomplete data files with missing or incomplete records and content and layout changes made by a data supplier without notice.

The DataFlux tools used to perform name and address standardization included Architect and the Quality Knowledge Base (QKB) from DataFlux’s dfPower Studio. The Architect tool provided a series of predefined processing nodes that enabled Polk to properly separate and subsequently parse data elements into tokens such as first, last, street and town names. Polk also tapped the power of DataFlux to perform address standardization, gender analysis and generate match codes to perform householding. The QKB provided Polk with the flexibility to create custom rules that would easily integrate with all future releases of DataFlux software. DataFlux’s powerful source data profiling engine provides all the raw data necessary for EIF Custom Profiling engine to evaluate rules, measure thresholds and alter for violations.

Innovation

We believe our originality and uniqueness stem from several areas:

  • Applying lean manufacturing principles to the discipline of data management, our solution is built as a “data factory” to automate data management and database production.
  • We have built a proprietary service orchestration engine, of which we have a provisional patent, for the uniqueness and sophistication employed to improve data processing throughput in complex, large-scale data management needs.
  • Our solution provides inherent data governance and workflow capabilities - simplifying how data and business analysts can work in one application vs. many.

We have been recognized in several ways that speak to the innovativeness and originality of the solution.

  • On September 25, 2006, Polk was selected by Computerworld as a recipient of its “Best Practices in Business Intelligence” awards program in the category of “Planning, Designing and Building the BI Infrastructure.”
  • RLPTechnologies was recognized by JBoss in June, 2006 as the Innovator of the Year.
  • In October, 2006, RLPTechnologies was honored by DataFlux with its Innovation Award for exemplifying superlative vision and creativity.
  • On June 4, 2007, Polk was recognized as a Laureate by the Computerworld Honors Program for its use of OneView360° within its information technology to benefit society.
  • In the Gartner research note titled, Cross-Functional Analytics Are Key to Auto Industry Agility, of August 25, 2006 by Thilo Koslowski, Research Vice President, he notes that RLPTechnologies' OneView360° is the first offered solution that specifically addresses the needs of the automotive industry.”

Filed under:

Advertisement

Comments (0)

Be the first to comment on this post using the section below.

Add Your Comments:
You must be registered to post a comment.
Not Registered?
You must be registered to post a comment. Click here to register.
Already registered? Log in here
Please note you must now log in with your email address and password.
Twitter
Facebook
LinkedIn
Login  |  My Account  |  White Papers  |  Web Seminars  |  Events |  Newsletters |  eBooks
FOLLOW US
Please note you must now log in with your email address and password.