The Internet is spectacular, 90 billion pages and growing, 352 million authors, 171 million domain names and 945 million users. The Internet is a result of rapid deployable technology that allows anyone with a computer to place their thoughts, concerns and expertise online. The majority of the information placed on the Web is not to sell something but rather to simply be heard. The Internet allows us all to be heard, perceived and branded by our own imagination. The Internet is dynamic, untethered, built with passion, unedited, governed only by technology and available to all. The Internet works 24x7, 365 days per year and works equally well in Sharpsburg, Georgia, as it does in Shanghai, China. The Internet is not always accurate, sometimes confusing, without management or guidance and many times fails to deliver accurate information to the end user.
The Internet is a thing of beauty that works precisely because it is uncontrolled and the sheer volume of information that is available. The Internet works because of four specific reasons: diversity, independence, decentralization and aggregation of information. The first three are obvious and will continue to expand beyond our imagination. Aggregation is the real problem with the Internet, and the question remains how you aggregate 90 billion sources of information. The Internet is meta data's greatest failure. We revel in the success of search engines such as Yahoo! and Google with their indexing millions of pages of distributed content. Yet, type in "meta data" and you get 10,900,000 hits which has grown from 2 million just three years ago. How many of those sites actually provide a solid foundation of meta data? Type in "enterprise meta data" and you drop just below a million hits. Again, how many of these links are you going to follow? Better yet, what about the other 70 billion pages not indexed by the search engines? (It is estimated that only 10 percent of pages are indexed). What about the semantic Web?
Register or login for access to this item and much more
All Information Management content is archived after seven days.
Community members receive:
- All recent and archived articles
- Conference offers and updates
- A full menu of enewsletter options
- Web seminars, white papers, ebooks
Already have an account? Log In
Don't have an account? Register for Free Unlimited Access