Data integration involved the acquiring, integrating and reconciling disparate data for analytic purposes.
Solution Implementer: RLPTechnologies
Solution Provider: DataFlux Corporation
Business Pain
R.L. Polk & Co. has set the standard for automotive vehicle and consumer data for years, continually improving its data management methods along the way. Polk was looking to maintain and extend its competitive advantage amid significant industry, regulatory and technology change. Polk recognized it needed to move beyond incremental improvements and develop a new, innovative approach that would greatly enhance its existing core foundational data warehouse.
- Industry challenges - In the automotive industry, original equipment managers (OEMs) and dealers alike are constantly pushing for the timeliest, most complete data and analytics to compete more effectively in a flat U.S. market.
- Regulatory compliance - Issues facing business today regarding data privacy made it clear that a flexible and agile IT environment was required to proactively get ahead of the likelihood of stricter regulations in the future.
- Technology change - As an organization collecting automotive data since 1922, Polk had grown a very complex data management environment that was difficult to maintain. Emerging technologies showed promise to streamline the environment, enabling the ability to introduce new data or application offerings faster, while lowering IT total cost of ownership.
As a result, RLPTechnologies, a subsidiary of R.L. Polk & Co., was asked to lead a re-engineering effort to implement breakthrough technologies that would collect, standardize and enhance data from disparate sources and compile them into a single source of the truth for distribution to analytical and operational applications. The project led to the development of OneView360° as a comprehensive, integrated application for data integration.
Successful Solution
OneView360° is a fully automated data integration solution designed to capture feeds from many disparate data sources, centrally maintain and manage master reference data, incorporate multipoint data quality inspections and load data into one or many databases.
The sophisticated service orchestration engine in OneView360° optimizes data production throughput, enabling real-time data integration. With an open, service-oriented architecture (SOA), OneView360° can seamlessly integrate investments in legacy, commercial off-the-shelf applications and Web services.
Within the implementation at Polk North America, the solution must handle large-scale, complex data management needs as Polk compiles data from more than 240 different sources, representing 500 million unique vehicle transactions per year, with data on over 246 million unique households. The system has been built and was deployed in phases over a 17-month period. The data operations group at Polk uses the solution to manage this wealth of data.
Two significant projects within the re-engineering effort had a very specific data quality focus - name and address standardization and data profiling. RLPTechnologies selected DataFlux dfPower Studio to analyze, improve and control data quality with the external interface file (EIF). The DataFlux Integration Server allowed RLPTechnologies to expose business rules for data quality via SOA, creating a real-time processing architecture to ensure that data entering the EIF meets corporate standards for data integrity. The name and address project was tasked with implementing decades of unique and complex business rules used to both parse and standardize name and address data.
The second data quality-focused initiative was data profiling, which was designed to catch data quality issues as soon as possible in the lifecycle of the data. Issues monitored by this process include incomplete data files with missing or incomplete records and content and layout changes made by a data supplier without notice.
The DataFlux tools used to perform name and address standardization included Architect and the Quality Knowledge Base (QKB) from DataFluxs dfPower Studio. The Architect tool provided a series of predefined processing nodes that enabled Polk to properly separate and subsequently parse data elements into tokens such as first, last, street and town names. Polk also tapped the power of DataFlux to perform address standardization, gender analysis and generate match codes to perform householding. The QKB provided Polk with the flexibility to create custom rules that would easily integrate with all future releases of DataFlux software. DataFluxs powerful source data profiling engine provides all the raw data necessary for EIF Custom Profiling engine to evaluate rules, measure thresholds and alter for violations.
Innovation
We believe our originality and uniqueness stem from several areas:
- Applying lean manufacturing principles to the discipline of data management, our solution is built as a data factory to automate data management and database production.
- We have built a proprietary service orchestration engine, of which we have a provisional patent, for the uniqueness and sophistication employed to improve data processing throughput in complex, large-scale data management needs.
- Our solution provides inherent data governance and workflow capabilities - simplifying how data and business analysts can work in one application vs. many.
We have been recognized in several ways that speak to the innovativeness and originality of the solution.
- On September 25, 2006, Polk was selected by Computerworld as a recipient of its Best Practices in Business Intelligence awards program in the category of Planning, Designing and Building the BI Infrastructure.
- RLPTechnologies was recognized by JBoss in June, 2006 as the Innovator of the Year.
- In October, 2006, RLPTechnologies was honored by DataFlux with its Innovation Award for exemplifying superlative vision and creativity.
- On June 4, 2007, Polk was recognized as a Laureate by the Computerworld Honors Program for its use of OneView360° within its information technology to benefit society.
- In the Gartner research note titled, Cross-Functional Analytics Are Key to Auto Industry Agility, of August 25, 2006 by Thilo Koslowski, Research Vice President, he notes that RLPTechnologies' OneView360° is the first offered solution that specifically addresses the needs of the automotive industry.










Be the first to comment on this post using the section below.