for Information Management Blogs
FEB 22, 2011 8:25am ET

Blogroll

Has Data Become a Four-Letter Word?

Print
Reprints
Email

In her excellent blog post “'The Bad Data Ate My Homework'’' and Other IT Scapegoating,” Loraine Lawson explained how “there are a lot of problems that can be blamed on bad data. I suspect it would be fair to say that there’s a good percentage of problems we don’t even know about that can be blamed on bad data and a lack of data integration, quality and governance.”

Lawson examined whether bad data could have been the cause of the bank foreclosure fiasco, as opposed to, as she concludes, the more realistic causes being bad business and negligence, which, if not addressed, could lead to another global financial crisis.

“Bad data,” Lawson explained, “might be the most ubiquitous excuse since ‘the dog ate my homework.’ But while most of us would laugh at the idea of blaming the dog for missing homework, when someone blames the data, we all nod our heads in sympathy, because we all know how troublesome computers are. And then the buck gets (unfairly) passed to IT.”

Unfairly blaming IT, or technology in general, when poor data quality negatively impacts business performance is ignoring the organization’s collective ownership of its problems, and its shared responsibility for the solutions to those problems, and causes, as Lawson explained in “Data’s Conundrum: Everybody Wants Control, Nobody Wants Responsibility,” an “unresolved conflict on both the business and the IT side over data ownership and its related issues, from stewardship to governance.”

In organizations suffering from this unresolved conflict between IT and the Business – a dysfunctional divide also known as the IT-Business Chasm – bad data becomes the default scapegoat used by both sides.

Perhaps, in a strange way, placing the blame on bad data is progress when compared with the historical notions of data denial, when an organization’s default was to claim that it had no data quality issues whatsoever.

However, admitting bad data not only exists, but that bad data is also having a tangible negative impact on business performance doesn’t seem to have motivated organizations to take action.  Instead, many appear to prefer practicing bad data blame-storming, where the Business blames bad data on IT and its technology, and IT blames bad data on the Business and its business processes.

Or perhaps, by default, everyone just claims that “the bad data ate my homework.”

Are your efforts to convince executive management that data needs to treated like a five-letter word (“asset”) being undermined by the fact that data has become a four-letter word in your organization?

 

Advertisement

Comments (9)
Bad data is only a valid excuse if management refuses to dedicate any resources to its resolution. Once bad data is identified and the source is identified it's time for action to correct it, rather than using it as a crutch. However, it has been my experience that the owners of the systems that are generating bad data are often able to convince management that it is too costly and difficult to fix it at the source - so why don't we just make IT "fix" it as it is moved from source to destination. Architecturally this is the worst option (well, second worst to "make the reports account for bad data"). But the difference is that IT can usually accomplish this so we are victims of our own proficiency. I have actually heard first hand conversations where the owners of reports who have coded for bad data are upset when the data is cleaned up and insist that IT "unfix" the data so the reports work. To my amazement, management agreed to this solution - they purposefully messed up their own data. I chose to leave that company, for obvious reasons.
Posted by Mark V | Wednesday, February 23 2011 at 10:41AM ET
I would suggest something completely different: it is not data or data quality that is the problem. It is that people, and by extension organizational cultures, that think that data is absolute that are the problem. The phrase: "the data is bad" is the epitome of the innumeracy that is plaguing our progress in the 21st century. All raw information has some uncertainty associated with it and any process that uses that data needs to take that into account. So by definition there isn't any 'bad' data, just data with a certain amount of uncertainty.

The reason that organizations like Amazon and Google run circles around their competition is because in their corporate cultures they embrace that uncertainty and bake it right into their business processes.

Talking about data in the absolute is missing the bigger picture.

Posted by Theo O | Wednesday, February 23 2011 at 10:41AM ET
Add Your Comments:
You must be registered to post a comment.
Not Registered?
You must be registered to post a comment. Click here to register.
Already registered? Log in here
Please note you must now log in with your email address and password.

Blog Archive for Jim Harris

Pondering a Big Data Philosophy
Galileo, the Hubble and Clear Data Insight
When Poor Data Quality Lands on the Ledger
Poor Data Quality That Kills
Data Quality and the OK Plateau

More from Jim Harris »

Blog Index »

Where do young IT professionals (30 and under) obtain information to aid with daily role responsibilities and career development?

Trade publication websites 14%
Social media 23%
Vendor websites 4%
Vendor/community forums 7%
Newsletters 1%
Trade conferences/meetups 2%
RSS feeds 6%
Web search 44%

 

Twitter
Facebook
LinkedIn
Login  |  My Account  |  White Papers  |  Web Seminars  |  Events |  Newsletters |  eBooks
FOLLOW US
Please note you must now log in with your email address and password.