Every few years, the corporate information factory (CIF) is extended, as architecture and technological advances occur in the industry. The highlights of the additions to the 2004/2005 CIF are the inclusion of:

Unstructured data has been around for a long time. Unstructured data includes e-mail, spreadsheets, text files, Word documents and more. Typically, unstructured data is what you find on the desktop. Interestingly, there is a large world of structured data and a large world of unstructured data, but there is very little intersection between the two. There is a lot of very valuable data in the world of unstructured data, and it is a shame that there has been little intersection between the two environments all these years. Now there is unstructured ETL technology and there is the potential for intersection of the two worlds at last. One of the most intriguing new possibilities is unstructured visualization. Visualization today is really visualization of numbers and quantities. There are summarizations, drill down, drill across, detailed analysis and KPIs. All of this manipulation and visualization is based on the properties of numerical data. However, the fiber of unstructured data is made up of text, not numbers. Now there is unstructured visualization, based on text, which is the business intelligence of the unstructured world.

Register or login for access to this item and much more

All Information Management content is archived after seven days.

Community members receive:
  • All recent and archived articles
  • Conference offers and updates
  • A full menu of enewsletter options
  • Web seminars, white papers, ebooks

Don't have an account? Register for Free Unlimited Access