Information Management

Information Management

Advertisement

Free Site RegistrationFree Site Registration

Blogs

The Rise of the Columnar Database

Evan Levy
Information Management Blogs, May 19, 2009

I’m continually surprised that more vendors haven’t hurled themselves onto the columnar database bandwagon. The more this space matures, the more evident it becomes that analytics is a perfect match for column-based database architectures.

One of the most frustrating phenomena to IT is adherence to a theoretical view. In the 1970s the entire relational database industry implemented what was really an academic precept. For those pragmatists who haven’t dusted off their textbooks recently, I’ll recall the writings of Codd and Date. They introduced the concepts of organizing data in tuples, organizing primary values along with their descriptive details (aka: attributes). Vendors interpreted this to mean that data should be physically stored in this fashion, architecting their products to store data in tables, populated with rows consisting of columns. If you wanted to access a value, you had to retrieve the entire row.

With all due respect, this approach has been cumbersome since Day 1. The fact is, storing data the way the business looks doesn’t lend itself to the way people ask questions. When I create an outbound marketing list, I need a name, a phone number, and an address. I don’t need information on household, demographic segment, or the name of a customer’s dog. 

While I do need to store all the customer data, I don’t want to be bogged down by processing all that data in order to answer my question. Herein lies the quandary: do I structure the data based on all the information we have, or based on the information I might access?

Vendors have tried to bridge the gap. We’ve seen partitioning, star indexes, query pre-processing, bitmap and index joins, and even hashing in an attempt to support more specific data retrieval. Such solutions still require examining the contents of the entire row.

Although my background is in engineering, I know enough about Occam’s razor to know that it applies here: the simplest solution is the best one. Vendors like Kickfire, Vertica, Paraccel, and Sybase—whose pioneering IQ product launched over a dozen years ago--went back to the drawing board and fixed the problem, architecting their products structure and store the data the way people ask questions—in columns. 

For you SQL jockeys, most of the heavy-lifting in database processing is in the where clause. Columnar databases are faster because their processing isn’t inhibited by unnecessary row content. Because many database tables can have upwards of 100 columns, and because most business questions only request a handful of them, this just makes business sense. And In these days of multi-billion row tables and petabyte-sized systems, columnar databases make more sense than ever.

As the data warehouse market continues to consolidate through acquisitions, look for column-based startups—including several open-source solutions—to fill the void. If you ask me, there’s plenty of room.

22 Comments

Occam's razor, in latin, is translated to something like: "Entities should not be unnecessarily multiplied." I think he preferred normalization, or at least discouraged denormalization.

Posted by: Steve R | January 4, 2010 1:13 PM

Report this Comment


We've implemented our columnar database for some major companies in Brazil - credit cards, banks, telecom, credit bureau industries - with hundreds of columns and billions of rows, and we are getting outstanding performance for OLAP applications, running queries on wintel platform, with 1 to 5 sec of response time. We developed this technology using Codd and Date concepts, strictly as is, twenty years ago.

Posted by: Waldo | October 26, 2009 3:31 PM

Report this Comment


In the years that I've been working in the technology industry (we won't get into how much gray hair or wrinkles I have) there's always lively discussions regarding the effectiveness of a new technical paradigm or design. Some of my experineces include when...

* Most vendors and IT groups fought the effectiveness and benefits of relational database technology in the early 80's. The argument was speed, maturity, and reliability of the incumbant technology.

* Most folks didn't believe parallel processing was practical in commercial application in the mid- and late-80's. The masses said it was too complex, and impractical.

* I myself battled (and lost) an argument in the late 80's about my belief that windows was more likely to succeed than desktop Unix (Everyone thought that was settled...but they're back...)

I'd also be remiss if I didn't mention the number of "can't miss technologies" that delivered incremental (or possibly invisible) benefits -- the Apple Newton, Next, BeOS, PC-RT, AI for the masses, Apple Lisa, XT/370, etc.

Tne one thing that's become clear -- as processor speeds have increased, memory costs have dropped, and disk capacity expanded, many "failed" technologies became practical and possible.

At the end of the day, the proof will be in functioning software with real world results. I wouldn't bet against columnar databases -- some of the numbers the early customers are seeing challenge the traditional paradigm.

It's important to consider that the focus of these newer companies isn't to replace existing applications, but to target the applications that simply aren't possible with "today's" proven technology.

Time will tell.

E.

Posted by: Evan L | June 11, 2009 12:58 AM

Report this Comment


Several comments in this thread suggest that Columnar and MPP are mutually exclusive. That is not the case. The latest generation of columnar databases including Vertica's Analytic Database are based on MPP architectures. This combines the best of two different worlds. Less I/O and the scalability/performance of MPP architectures. The I/O advantages of colunmnar databases are dramatic for appropriate workloads. (SELECT * is not typically one of them!) However I disagree with David N's assertion that the number of columns required for "true analytics" is beyond the reach of columnar databases. We have customers who are regularly accessing dozens of columns in their queries and enjoy orders of magnitude better performance over row-based alternatives. I would also suggest that Ben Werther's comment about tuple-assembly cost does not apply to Vertica the way it may to other colunmnar databases because tuple reassembly (and conversion of compressed data to its original expanded form) is deferred until result sets are materialized as part of the query results.

FULL DISCLOSURE: As you might have determined from my comments I work for Vertica.

Posted by: DMenninger | June 4, 2009 4:29 PM

Report this Comment


Columnar databases are great if: 1) Your table has lots of columns (100s) 2) You only want to query a small number of them (low 10s or less)

Problem is, if you deviate from these, the tuple re-assembly costs kill you. Products like Sybase IQ and Vertica do good business targeting the niche data mining workloads that have these characteristics. But good luck trying to find either of these products in use in a general-purpose EDW or analytical data mart.

'Columnar-oriented storage' is really just one of many possible features of a database. Far more is the degree of parallelism -- hence the dominance of MPP (massively-parallel-processing) databases like Teradata, Greenplum and Netezza for high-end data warehousing and analysis. These guys squash the columnar guys on most data warehouse usage scenarios and handle any schema and arbitrary ad-hoc queries without the fragility of a columnar approach.

Posted by: Ben Werther | May 27, 2009 6:09 PM

Report this Comment


Michael M: Yes, that's the right quote, and the right spelling of (William of) Ockham, even though Occam has become the prevalent way to spell it, as in Occam's Razor. However, the implication is still to "favor" the simpler solution, but not to assume it is correct. During his life, theologians and rational philosophers worked very hard to catch up myth with science, creating extremely complex cosmologies, for example, to explain the motion of the heavenly bodies that still conformed to the geocentric approach rather than the simpler heliocentric model.

So my original point was, if a database approach seems simpler, you may favor it, but not accept it until proven.

-Neil Raden

Posted by: Neil R | May 27, 2009 1:32 PM

Report this Comment


Leslie S: I didn't get acquainted with APL until 1977, but what you describe, matrix inversion, was a primitive of APL. It's a stretch calling APL a database, though. It was just a cryptic interpretive language for data and text manipulation. IBM did, however, develop a columnar database based on APL called ADRS (A Departmental Reporting System) which used APL to extract and transform record-type data into ADRS' columnar structure using a sort of proto-ETL tool called APLDI (APL-Data Interchange).

In the early 80's, I worked with a guy (Clark French) who invented a columnar database called Expressway, which he sold to Sybase and was renamed SybaseIQ. And as someone else has pointed out, Model204 has been around for decades and still endures.

So columnar has been around for a long time, but that doesn't mean that newer or mature implementations of it aren't far better suited to our technology today. for example, none of these early tools could handle parallelism, large memory models or operate with standards-based APLI's and SDKs.

As far as performance is concerned, the case can be made that reading whole pages of data to access a few bits of it is wasteful, but it isn't yet clear to me that it's such as case of black and white. Row-based databases can index and compress too, and once the data you need is in memory, how much difference does it make? We'll have to see.

-Neil Raden

Posted by: Neil R | May 27, 2009 1:22 PM

Report this Comment


Good article and fair portrayal of column databases. However, column databases are but one take on the challenges posed by traditional single-processor RBDMS architectures. MPP is another approach, and has had far more success in the short time that the approach has been around (most vendors, except Teradata, are this decade's startups).

Column databases are challenged by true analytics - their effectiveness wanes as increasing number of colums are pulled into the query. Requesting client information is not analytics. True real-time analytics requires far more complexity and chips away at the benefits of column databases.

Posted by: David N | May 27, 2009 7:48 AM

Report this Comment


In "Summa Totius Logicae", Ockham stated "Frustra fit per plura quod potest fieri per pauciora" which translates to "It is futile to do with more things that can be done with fewer."

Posted by: Michael M | May 27, 2009 6:12 AM

Report this Comment


It would be nice to see more support for the claims made. In my opinion, the hardcore of analytics lies on aggregated data at various grains. It would be a strong advantage if columnar databases could outperform relational databases in this competition.

It was an interesting point about asking questions. As a part of my PhD I was trying to find a balance between natural languages and the strict formalism of artificial languages. Eventually, any question applies an attribute to a concept. An answer to a question is always a functional dependency that tries to obtain a value.

You can find more about this model in my book at Amazon ("Operational Product Performance ACCEL: Analytical Approach Towards Learning How to Design Successful Products")

Maxim 4suc6

Posted by: Maxim I | May 27, 2009 5:31 AM

Report this Comment


I think an even more radical approach to database architecture and storage is needed.

I have 25 years experience with Oracle starting with Oracle 5.

I am presently developing applications with LazySoftware's Sentences Associational DBMS lazysoft.com. It is far easier to use than relational technology and provides excellent performance.

Posted by: Grant Morgan C | May 27, 2009 3:25 AM

Report this Comment


Way way back in 1970, APL was a popular (geeky) language for we mathematicians.

One of the workspaces dealt with table transformations. where the i rows by j columns was converted to a table of j columns with i row entries.

Research into this table, using integer conversion of ascii data was phenomenal in terms of speed.

So there is nothing new here, except that inserts take a long time in the columner arrays vs row inserts.

It was back in 1970, before Oracle came on the scene.

Posted by: lsatenstein | May 26, 2009 9:03 PM

Report this Comment


What about MonetDB? I've had it running four years for some specialty apps at a university and am not sure it has ever been rebooted of failed - I am sure it has never been tuned. My humble opinion is it belongs in the list of established CO databases - especially as IQ is still burdened with the biggest marketing blunder since new coke* and Stonebreaker (Vertica) is a really brilliant scientist who has contributed a lot but cant seem to market squat

*should never have been branded so strongly as Sybase - POS (plain ole Sybase) is a great product but was really on the corporate outs when IQ was purchased. IQ could have changed the world but instead started a cult.

Posted by: joe b | May 26, 2009 7:28 PM

Report this Comment


I'd like to point those interested in Evan's column to a different point of view in David Raab's piece in the latest issue of Information Management. I found it interesting in that David is not so much concerned with the DBA aspect as much as he is in the needs of the marketing functions who in this example access it. Click on our June issue cover on the homepage to scroll to David's article. We've also had some lively DB discussions lately on DM Radio, this one with some very sharp people in the mix including Forrester's Boris Evelson: http://www.information-management.com/dmradio/-10015384-1.html -ed

Posted by: jericson | May 26, 2009 7:13 PM

Report this Comment


Measuring an OLAP engine by OLTP criteria is misleading at best and may distroy inventive initiative at worst.

I have used a proprietary application from Hilbert Technologies, Inc., which employed the columar db concept. The resulting array dbs (another name for columar dbs) where made even more efficient by using text-to-integer transforms. This technique put the text data into the native (integer) language of the computer (ASCII data is intrinsically costly.)

The columns being integer arrays were manipulated via memory mapped files on a PC (could be other platforms by now) and resulted in retrivals, compares and merges of multiple databases (millions of records) thousands of times faster than any hierachical or relational db engine. Skip analytics for a moment - did I hear someone say Data Cleansing?

The fact that we are having this friendly discussion proves that db industry momentum is a burden to progress.

But then I could be wrong...

Posted by: David O | May 26, 2009 6:37 PM

Report this Comment


It seems to me that the answer also depends on what you call "analytics". In the example above, just pulling a name and address doesn't seem like a good example of a complex analytic question, which would be both i/o and cpu intensive.

The relative performance of columnar/row stores would probably vary a lot depending on the type of question being asked (complexity of predicates, etc.), the analytic schema, etc., and in that sense, it seems hard to generalize about the "best" approach for "analytics".

Maybe exadata/db machine, which cuts i/o by performing much of the projection on the storage level, is the best of both?

See also: Read-Optimized Databases, In Depth, by Allison L. Holloway and David J. DeWitt:

http://pages.cs.wisc.edu/~ahollowa/paper377.pdf

Posted by: Kolin O | May 26, 2009 5:40 PM

Report this Comment


I can verify this much, I did the comparison on my laptop with Oracle 10.2 and Sybase IQ. 12.7, running the same query against an identical set of data, where IQ was column-based, Oracle is row-based, 1.2 M rows.

Results: Oracle: 42 seconds, Sybase IQ: 13 seconds.

I am a certified Oracle DBA having specialized in performance tuning both databases and SQL queries, so this kind of changed my thinking about 2 years ago.

Sybase IQ has made some significant strides since then, significantly lowering the entry price to IQ so that it makes a very compelling alternative to smaller organizations. If only their marketing was as good as their technology, they would be a much bigger player in the market.

Evan, who are the worthy open-source players in this market?

Posted by: Ferenc M | May 26, 2009 5:38 PM

Report this Comment


You've missed a whole generation of column based databases, think M204, think Sand Technologies and their DOD funded Nucleus database...and if you want to think about marketing segmentation, the modelling principle is more akin to a star schema, here start with AT

Posted by: Chris Day Big Data - Done Right | May 26, 2009 4:33 PM

Report this Comment


In answer to Peter, the columnar db should perform better on I/O because many values (rows) will be stored in the equivalent of a physical page, and in the analytics context you would normally be reading many rows at a time, not a single row via a primary key. However, I quite agree that the RDBMS didn't win on speed (although the latest TPC benchmarks have DB2 clocking a whopping 6 million transactions a minute, which isn't exactly slow). The concurrency and content-based referential integrity built into the RDBMS model make it enormously powerful for high-volume operational systems, and it can run analytical processes pretty fast as well. The columnar db might have some speed advantages in analytics but I don't think it has any OLTP capability at all (I shudder to think of the locking implications of a big update!), which makes the ROI much poorer than for the all-purpose RDBMS (although to be fair, big companies often have warehouses and analytic engines built on separate instances if not totally different products).

It is also not correct to say that the whole row has to be retrieved with every query from an RDBMS. Most of them will return data from an index-only read if the index covers all the columns required by the query (so no data page physical I/O is necessary). So if you create indexes for every column in a dimension (for example), you will end up with data structures very similar to those in a columnar db

Posted by: Caroline B | May 26, 2009 3:32 PM

Report this Comment


How is the "columns" based database any different from the CJ Date's description of tubes? The principles of getting/accessible only the data you need is indeed a core concept of an RDBMS. Projection is a core component of SQL. A table is a collection of related tubes (the common misconception of the word relations as being between tables is of course - a misconception).

I also fail to see where the speed increase from column only objects is found. With a traditional DB, all my columns are in one block. Accessing a single record by primary key is a matter of 2-4 IOs depending on how big my index tree is and regardless of getting 1 or 50 columns. With column based, you duplicate this access path per key per column? How can that ever be any faster?

If we want to improve the conceptual speed of databases, we should abandon the relational principle and just go back to the old network or hierarchical database designs. A lot less overhead and much faster to search. The point being, that relational databases didn't win because of their performance speed by their dynamics.

Posted by: Peter L | May 26, 2009 3:08 PM

Report this Comment


From what little I know about Occam's Razor, it says if you have two theories that predict the same results, the simpler theory is better.

I think Evan has extended it to say if you have two solutions that produce the same results, the simpler is better. I like the analogy.

So not sure about the notion of only one being correct if they're both predicting the same results?

On a different note, how does columnar stack up to MPP?

Posted by: Terri_Rylander | May 26, 2009 2:12 PM

Report this Comment


I of course would never argue with Evan, but I would like to point out that his description of Occam's Razor is a common misconception. What Occam really said was to "favor" the simplest hypothesis, not that it was necessarily correct. This was actually a thinly veiled attack on the Church whose teachings were at variance with observation.

Not unlike our industry.

-Neil Raden

Posted by: Neil R | May 26, 2009 1:45 PM

Report this Comment

Add Your Comments...

Already Registered?

If you have already registered to Information Management, please use the form below to login. When completed you will immeditely be directed to post a comment.

Forgot your password?

Not Registered?

You must be registered to post a comment. Click here to register.

Blog Archive

Performance Measurement – Common Causes/Risk, Skill and Luck, Part 1

Wouldn't it be nice if business intelligence could generally distinguish common cause/market risk from skill components when assessing business performance?

The ROI of High-Quality Decision Support Infrastructure

I’m developing a return on investment calculator for data warehousing appliances, using the Forrester Total Economic Impact methodology

Who Let the Dollars Out?

2010 has already delivered a list of top acquisitions; drivers include maturity, customer acquisition and cross-selling, but the confusion now falls to buyers

IBM Initiates Deeper Healthcare Focus by Acquiring MDM from Initiate Systems

IBM has indicated their focus on using Initiate as a lever into healthcare and government industries and clearly will need to rationalize their technology portfolio

Ode to Initiate Systems

The rumors of IBM’s acquisition of MDM vendor Initiate Systems were confirmed in an IBM press release. Herewith, Jill pays homage to a class act.

800lb MDM Gorilla Obviously Not Looking for Weight-Loss Plan: IBM Acquires Initiate Systems

You wouldn’t know it following this week’s activity, but there are more than two MDM vendors in this ever-changing market. What does this massive disruption mean?

Big Brother For Hire

With CIA profilers reportedly moonlighting to monitor earnings calls for "deception detection," speech analytics vendors are likely licking their chops for what comes next

A New Decade and Technology Charter for Business and IT Starts in 2010

The year is underway and the optimism about using technology for business has a new decade mentality

Performance Measurement

Performance measurement is a key driver of business intelligence. I'd guess that most BI deployments today are at least in part supporting PM

Informatica is a New Master in MDM with Siperian Acquisition

Now, Informatica can advance the use of MDM within many areas of their business from data governance, cloud computing and business-to-business processes

The Good News about MDM Market Consolidation

I usually root for the little guy. So when I heard the rumor that there were multiple bidders for Siperian, I figured SAP would take it home and that it really was a three-vendor world

Pros and Cons of Using a Vendor Provided Analytical Data Model in Your BI Implementation

Some pros and cons of implementing a vendor provided analytical logical data model at the start of any business intelligence or data warehousing initiative

Introducing The MDM Market’s Newest 800lb Gorilla: Informatica Acquires Siperian!

Look out IBM, Oracle and SAP — you’re about to lose a bit of your dominance in the master data management market to Informatica

BI in the Cloud? Yes, and on the Ground, Too

Although I do not see a stampede to replace traditional BI applications with SaaS alternatives in the near future, BI SaaS does have a few legitimate use cases even today

Data as Commerce

Just like iTunes, granular Web data services are just beginning to sprout into a new transactional data economy

Mark Logic Makes Information Applications a Simple Reality

The market for information applications is heating up and the demand is coming from line-of-business and vertical industries

Ruby Monday – Part 3

After writing two blogs on the delights of the agile language Ruby, I decided to eat my own dog food and challenge myself to convert a significant piece of my portfolio returns simulation program from the statistical package R to Ruby

IBM Lotus – Are They Ready for Your Business Collaboration Needs?

Let’s do a deep dive and discuss the current business and market environment and impact to IBM efforts in collaboration

The Quality Gap: Why Being On-Time Isn’t Enough

Jill advocates behavior changes, starting at the top. But not for you. No, you’re fine. Just for other people

Ruby Monday – Part 2

Today's blog embellishes on Ruby's suitability for basic data management tasks that precede business intelligence and analytics

Oracle Finds Silver in the Creek of Information Management

This industry activity validates an automated approach to discovering and matching data as part of organization's information management processes

New Year Technology Predictions Could Ruin Your Business

We need to be more pragmatic about technology predictions and focus on what we need to do to improve in iterative time cycles

Hedge Like the Vendors Do

Deconstruct your daily tech briefing, find your self-interest and use last week’s HP/Microsoft news as a learning experience

How Much Intelligence Can You Pack Into a Tweet?

The intelligence value of any individual tweet in isolation is negligible. Then again, what’s the intrinsic value of a single attribute-value pair in a relational database?

Mining the Cloud? Are You Allowed?

I came across an article with the headline “Mining the Cloud.” The cynic in me immediately issued a silent scoff: How is that different from “crawling the Web”?

Next Best Models: The Process Agility Equation

You never know what’s coming at you next, which is why process agility is so important

Standardizing Data Migration

Maybe IT should establish a functional team that’s responsible for data packaging and distribution, just like the movie industry

Razor, Blades and 10x: Key to Value-Based Pricing in the Data Warehousing Appliance Market

Price-performance is everything in data warehousing (DW), and it’s become the leading battleground for competitive differentiation

Next-Gen Jobs

Parents want to see their kids with a happy and successful career, but these days there's not a lot of useful advice

Ruby Monday – Part 1

Dusting off a favorite open source tool, the programming language Ruby

Process Mining: Because Your Company’s Workflow Issues Aren’t Always Obvious

The more complex business processes are the more difficult it is to find the magic blend that distinguishes a well-tuned process from a miserable flop

Bedford Clicks Authoria Up a Notch of Talent with Peopleclick

Let’s see if 2010 is the year of the workforce and if Peopleclick can help organizations truly invest into the most valuable asset

A Last Look At The Naughty and the Nice

Before we shift our focus completely to 2010, it’s worth a last look back to consider all that was learned in 2009

Oracle Finally Takes Control Of Its Data Quality Destiny With Acquisition Of Silver Creek Systems For Product Lifecycle Support

Silver Creek is the only data quality vendor at this point that we feel has sufficiently addressed the unique and complex challenges associated with product data quality

Conversation as a Complex Event

Your customers are spending more time — and venting more of their true feelings — in social networks, so it only makes sense to move customer-facing operations into this brave new world

Social Network Analysis: The Fuse Igniting Enterprise Data Warehouse Growth. It’s Planet Petabyte or Bust!

Social networks have always been with us, of course, but now they’ve gained concrete reality in the online fabric of modern life

The Movable IT Budget

Not that it will change overnight, but an annual budgeting cycle for IT spending might not stand the test of time

The New Decade of Advanced Analytics: Roll Over Rocket Scientists!

The core problem with today’s CEP offerings is that many of them are power tools, not solutions suitable for the mass business market

Whither Data Warehousing in the Teens?

In data warehousing, the most likely casualty of the Teens will be the very notion of a data warehouse

Searching and Planning for the New Year

This searching-planning compromise lens now colors my assessment of business strategy and intelligence

Data Center Lessons from AVATAR

Weta Digital’s Paul Gunn runs a state of the art facility but doesn’t play by all the new rules

Starview Advances Operational Intelligence with Analytics in Distributed CEP

I expect this new category of technology to be like business intelligence in the 90s, where the range of vendors and opportunity will with significant demand

Written Review

My writing retreat is completed for 2009. I'm not sure whether I'm comforted or humbled

IBM Will Find Business Process Optimization with Acquisition of Lombardi Software

The unique element of Lombardi was their ability to help organizations gain visibility through their process models that can easily measure activities and business flows across the enterprise

BI Reports, Data Quality and the Dreaded Design Review

Everyone’s talking about single version of the truth, but how often are our reports reviewed for accuracy?

A Spot Market for Computing

Amazon’s 12/14 launch of Spot Instances for EC2 is something we might want to mark on our calendar as a minor historic date in computing

Monte Carlo, Resampling and BI – Part 2

I recently downloaded the latest stock portfolio returns from the website of Dartmouth professor Ken French

The End of a Tough Year

It is usual (and customary), during the holiday season, to give thanks for the blessing we have. I thought a short IT list would be sufficient

Actuate Brings Innovation to Performance Management

Advancements in business applications for performance management do not come very often in the industry but Actuate has one that we should all pay notice

Advanced Analytics Predictions For 2010

Here are some quick thoughts on the trends that will shape advanced analytics in the year to come

Wisdom That Isn't

If you're going to tackle a new center of excellence or competency center, at some point you're going to have to throw out the analyst playbook

Monte Carlo, Resampling and BI – Part 1

About five years ago, a Newsweek review of the then new iPod raised a concern that the shuffle feature might not be random

SAP Broadcasts New Enterprise Software and Applications Strategy

SAP made it clear that they would like to eliminate the three letter acronym soup of the past and focus on the line of business needs by vertical industry

Data Governance Moves Upstream

Don't sweat data governance, make it a measure of political maturity to challenge your peers and look for tools to help your people processes

Social Network Analysis: Going to Become Too Ubiquitous for Its Own Good

From an enterprise perspective, social networks are the “buzz” that can spell the difference between success and failure in a reputation-driven online economy

Cloud9 Analytics Make Sales Pipelines Deliver Results

Sales opportunities are continuously updated but the impact of those changes to the sales forecast and overall financial health of the organization is not well understood

A Volleyball Championship and BI

Because a truly randomized tournament is both impractical and unattractive, officials must grapple with quasi-experimental methods and designs that attempt to control bias, all within a limited budget

Master Data Management Is Foundational

Yes, Virginia, there is a Santa Claus and we are rapidly finding that this is getting the information into the hands of those who can use it

Plateau Provides a Suite for Workforces to Become Best Talent

The main challenge and opportunity for Plateau is to demonstrate the value of their integrated suite of applications and convince, not just HR and training professionals, but finance and operations management the value of adopting their approach

You Build it, You Break It, You Fix It: Why Applications Must Be Responsible for Data Quality

When it comes to bad data, a lot of the problem stems from companies letting their developers off the hook

Splunk Expands IT Search into Information Applications

A technology vendor named Splunk has come to market with a solution to index volumes of log files and provide search and display capabilities along with a new generation of interactivity and application capabilities

Not All Mobile BI Applications Are Created Equal

Beyond simple delivery of alerts, URLs or actual reports via email here are a few newer approaches to deliver business intelligence on a mobile device

Discovery-Driven Planning – A Planning-Searching Compromise?

I like the concept of discovery-driven planning a lot. It seems a reasonable compromise between pure planning and searching

More on the Statistical Revolution – a SAS Story (continued)

It's probably happenstance that statistical juggernaut SAS has been in the news several times over the last week following my recent blog. But like me, the press hasn't exactly been favorable, adding credence to the claim that SAS is now facing serious threats on several competitive fronts

Instrumenting Your Enterprise for Maximum Predictive Power

Business is all about placing bets and knowing if the odds are in your favor

Salesforce Chatter Brings Social Collaboration and Media into Business

Salesforce steps into the business and social collaboration market where most would not expect

SOA Implies Understanding Patterns

Building a service-oriented architecture implies the understanding of the business and the use of patterns - because SOA is really about architecting for reuse

MDM: Build or Buy?

The point here is whether you choose to build or buy your MDM solution, know what you need, and what you don’t need. I know that sounds obvious. But obviously, it’s not

Informatica Evangelizes its Increased Business Relevance with Latest Release

Informatica 9 continues to evolve PowerCenter and PowerExchange data integration and access technologies into a much more comprehensive data management platform going well beyond the scope of traditional, batch-oriented ETL that remains Informatica’s bread and butter

IBM Elevates MDM and PIM by Integrating Content and Analytics

Gaining better consistency and quality of information is what IT needs to be addressing more than consolidating BI as the underlying ingredient as your organization's most valuable asset

Clouds: Both Sides Now

With my inbox groaning under overload, two themes have emerged: some will watch the establishment evolve (Goldman Sachs has picked the winners already); and others will simply go for it

Highly Irritating Management Gurus

Business intelligence practitioners can analyze, critique and debunk the works of popular management gurus till the cows come home, but the demand for expert wisdom grows unabated

What Makes BI SaaS Architecture Multitenant?

A key critical point to "what differentiates one business intelligence software as a service vendor from another" discussion is what really constitutes multitenant architecture. Here are some initial thoughts to stimulate the discussion

Next Gen BI Is Here Today

I picked these 20 next generation business intelligence trends, not because the market labels them as trendy, but because I have tons of evidence that the buy-side o f the BI market can use these technologies today

How To Differentiate Advanced Data Visualization Solutions

Start looking at advanced features to select a data visualization vendor/solution

Informatica Brings Business and IT Together for Your Data in Version 9

Informatica has advanced their efforts with support for business and IT collaboration, pervasive data quality and SOA-based data services

My First Windows 7 Moment

It might have been software or an act of providence, but it appears that Windows 7 saved my job

Kronos Guides Workforce Software Across Generations

Kronos introduced the next generation of workforce management that will dramatically improve the usability of its new releases next year

Ten Strong Hints Your Enterprise May Not Have a BI Strategy

You know that you don't have an enterprise BI strategy if ...

Statistical Revolution

Statistical and predictive analytics software is fast becoming a next big competitive landscape in the business intelligence market. Heightened competition will be a boon for statistical consumers of BI

SAP Promotes Value of EPM and GRC at SAP Reporting Conference

The SAP enterprise performance management and governance, risk and compliance solution is a refreshing approach that organizations should review if they have not

A SAS Story

I downloaded a 30 day trial of the World Programming System and started constructing simple scripts. It was like seeing an ex after eight years of separation

From Search to Explore

Information on demand depends on a growing base of content assets and services and the cut-to-chase script looks increasingly agnostic

IBM Mashes Information and Analytics to Support Information Accessibility

IBM announced a new version of IBM Mashup Center that will simplify the assembly and publishing of information across a workforce and across the Internet for consumers, constituents and customers

SOA Projects Are Moving Again

SOA is not a technology but an architectural paradigm and it is on the move again

IBM Fuses New Generation of Analytics for Deeper Business Optimization

IBM largest challenge with their software investments will be to continue to simplify them and connect with those in line of business and ensure they can get smarter solutions in the right time frame at the right price

IBM Doubles Down Again on Information Agenda at Annual Conference

At IBM Information on Demand in Las Vegas, the conference started today to introduce the latest insight to the fastest growing components of their business with analytics

Process-Centric Data Quality Services and MDM Among Top Technology Trends to Watch

The business expects high quality data, but hasn’t taken much accountability in delivering it

OpenCourseWare and BI – An Update

I revisited MITOpenCourseWare a few months ago in search of a gentle computer programming course I could recommend to those interested in BI without a strong technology background

Free Ham for $6 Per Pound (and Other Oracle Marketing Ploys)

With all the press releases emitted from OpenWorld, Larry Ellison’s $10 million IBM challenge stands out

Teradata Steps Up Version 13 into Cloud Computing and New Appliance

Teradata announced their advancement into cloud computing called Teradata Enterprise Analytics Cloud, providing a range of options for organizations to use the database technology

The Demise of the 2009 Boston Red Sox: Super-Crunching Takes a Drunkard's Walk

Since 2004, when the Red Sox rallied from three games down to beat the Yankees for the American League championship and swept the Cardinals in the World Series, the Red Sox have been New England's academic darlings, “champions” of the data-driven, predictive analytics approach to running a baseball team

Larry Ellison Stumps Oracle Exadata and Fusion Applications with California Governor

In this year’s keynote Ellison briefly discussed Linux and their efforts to drive neutrality in operating systems though they will potentially own Sun Solaris and open source version of Solaris by their acquisition of Sun Microsystems

The API is the New Network

The growing glut of Web services will increasingly flow not through browsers, but through application programming interfaces

Immature Planning Processes on Display at Oracle OpenWorld

When it comes to planning and budgeting, the gap between what’s possible and what companies actually do is still wide

Oracle Complex Event Processing Advances Operational Intelligence

At Oracle OpenWorld the focus on Oracle Complex Event Processing was quietly demonstrated in educational sessions how this dedicated technology is helping organizations

Oracle OpenWorld Sunday Night Opener Highlights Technology Game

Better than the massive defeats by the San Francisco 49ers and Oakland Raider football games in the bay area on Sunday was the opening keynote at Oracle Open World. Silicon Valley legends Larry Ellison and Scott McNealy opened up the evening

Fall 2009 OpenBI College Recruiting: Where Have All the Programmers Gone?

The demand for new college grads is still soft. This cycle reminds me a lot of 2000, when the high-flying technology consulting career choice crashed and burned, no doubt mirroring the same fate of the Internet boom

Service-Oriented Analytics: Tapping into the Predictive Smarts of Your Entire Organization

To become truly future-focused, organizations must build out their predictive muscles through deepening commitment to these and other advanced analytics technologies

Metator, Librarian, Gatekeeper, Broker

The information function in the organization is a network of the people who really know what’s going on; “They’re not organized or rewarded, yet they’re running the damn place”

Liaise Cures Business Interaction Chaos with Microsoft Outlook

It is not often that there are technologies that make you rethink your current methods of how you work and the tasks that you do constantly

Improving BI Development Efficiency: Standard Data Extracts

For every operational system, a company can save hundreds or even thousands of hours every week in development and processing time

HR Tech Brings New Products but HR Budgets May Limit Potential

At this year’s HR Tech in Chicago a broad range of new software products for human resources professional to address their people-related processes across a range of activities were available for HR and associated IT professionals

The October 2009 HBR and BI – Risk Management and Performance Measurement

The October 2009 Harvard Business Review spotlights Risk Management and Performance Measurement, both significant consumers of BI

CIO’s Have a Challenging Mission

It is not uncommon to expect with the downturn in the economy, that CIOs would be challenged to do more with less. IT executives are focusing on ensuring that business is conducted efficiently to get more mileage out of their budgets

Help Wanted: Metator

As data and projects become collaborative, a metadata editor sounds like someone you’d like to have, and maybe wouldn't have to hire

IT Analyst Firms Continue Confusion on GRC

Be forewarned that as you read analyst reports on GRC that they are a small component of what you will need to address a range of needs in the enterprise and across business and IT

The Netflix Prize

The other day, I came across an article in the Wall Street Journal noting that movie rental company Netflix had announced a winner of the $1 million contest to improve the accuracy of its film recommendation engine

SuccessFactors and Business Execution Software: Confusing or Confused?

It is up to SuccessFactors to see if they can make something of this new category and focus in business execution and what other vendors might join them

The Death of Business News Reporting

The six words above are misleading, I’m really trying to get your attention to tell you about something about business intelligence, so c’mon, click it

Need For A Health Care Standard for EHR and EMR

There is a need to define a standard quickly for data interchange with regards to the electronic health record and the electronic medical record

All Business Analysts are Not Created Equal

The majority of data issues within an operational system are data entry-related. The challenge in business analysis is to establish standard business processes to automate

Smart Analytics – The Rising IBM Tide

Having IBM seriously promote an analytics product line will help all statistics/mining vendors and provide a much-desired commercial jolt for academically-focused R

BI SaaS Vendors Are Not Created Equal

I am, slowly but surely, beginning to believe there couldn’t be a better case for demand for business intelligence software as a service -- especially after findings from a project that I am currently conducting

Resolving Content and BI

Experts say search, information extraction and text analytics can close the structured/unstructured divide; can it really be that easy?

Vertica Advances Analytics through Sophistication and Simplification

The growing volumes of data from the Internet and enterprise placing pressure to gain better insights on a more frequent basis is an issue that continues to be for business and IT

Dear Steve, Dear Eric, Dear Steve – Musings on Predictive Analytics World

I received a very nice note from Eric Siegel, PhD., Conference Chair, Predictive Analytics World last week in response to my recent blog that mentioned an upcoming PAW

Process Data Management: Like Your Brain And Your Heart, BPM and MDM Can’t Survive Independently

It's an age old question: Which came first: the data or the process? Okay, not an age old question, but an interesting one to ponder nonetheless

In Sickness and in Health Care

Blue-sky challenges and benefits of health care data management will touch individuals and economies

Synygy Simplifies Mobility of Sales Performance Management

Synygy's new set of capabilities will help organization address the sales and revenue performance priorities and drive new levels of operational efficiency in sales

HP and Informatica’s Expanded Relationship: Portent of Bigger Deals to Come?

This relationship long predates this announcement, but it could be the start of a relationship of considerable strategic importance to both partners

Intelligence Interruptus

I've assembled my BI-Searchers toolkit

Head of BI Job Description

While Forrester does not have a formal description for a head of business intelligence, if I map requirements for BI best practices, here’s what I come up with

Informatica Acquires Agent Logic and Enters Operational Intelligence Market

Logically this acquisition expands Informatica beyond data integration into event integration

Good Data Warehouse DBAs are Hard to Find

As a consultant I’m often asked about how roles and responsibilities should be delegated or identified within the IT organization to support the data warehousing

myDials Optimizing Operational Performance Like No Other

It has been some time since I advanced the definition and focus of operational performance management over eight years ago

Random Business Performance

There's a randomness to company performance that, as deftly chronicled by Leonard Mlodinow in The Drunkard's Walk, is a much more powerful force than we'd like to believe

Health Care Is Going Through an Information Led Transformation

I believe the massive stimulus package signed by President Obama in early 2009 was a catalyst to a situation that was already fraught with issues – the integration of information in health care

Destroy the Web Site and Build Your Business on the Internet

Now as an entire business needs to operate on the Internet the responsibility for business and IT executive’s leadership and involvement is essential for success

Is BAM Relevant In The Age Of Lean Processes?

In any dynamic business environment, the last thing you want is to indulge in navel-gazing. If you’re not adept at responding to breaking events or anticipating the future, you will find yourself marginalized in the new economy

Perfect Data and Other Data Quality Myths

We need to understand the variance between the data as it exists and its acceptability, not its perfection

Searching and Experimenting for Business Innovation

The rapid development of technology and expansion of the Web provide the infrastructure for promoting ever-more-rapid innovation. But what's behind this technology enablement are two themes

Resuscitating Your Dying Metadata Strategy

Here’s a question to ponder: Why do so many metadata initiatives deliver well below expectations, or outright fail?

eThority Provides Intuitive Analytics and Insights for Business

The unique aspects of eThority are the usability and interactivity, which makes their analytics more intuitive and relevant than most business intelligence products

M-Factor Provides Analytics and Planning for Trade Spending and Pricing Optimization

Consumer packaged goods companies spend enormous of time and money to optimize their penetration of brands within their relevant product categories

The Drunkard's BI – Part II

The scary part for decision-making is that the errors are both deductive - from population to sample - and inductive, from sample to population

MDM Data Quality as a Process

Weyerhaeuser keeps records up to the minute with business rules to address quality at the point of data entry

MDM: Subject-Area Data Integration

It’s important to realize that mastering data isn’t really necessary if you only have a single system that contains one copy of data

SAP and TIBCO? Too Juicy Of A BI Rumor Not To Comment On

The business intelligence implications of the rumored SAP/TIBCO merger are huge

Information Applications: New Generation of Information Technologies

The technology for information applications will continue to evolve and become one of the fastest growing software categories

Does Your Supply Chain Have Location Intelligence?

Most manufacturing and services organizations have driven their supply chain processes to be lean and are now looking at the means to find new methods to gain incremental efficiency

The Drunkard's BI – Part I

I read what is certain to be my favorite BI book of 2009: 'The Drunkard's Walk, How Randomness Rules Our Lives'

From Analytical to Transactional MDM

Cisco Systems aims at operational insight that looks at the customer experience across multiple transaction platforms

A Look at the "Nexus of BPM and Business Intelligence"

One of the more common phrases in corporate life is: 'Close the loop.' This mantra refers to completing an improvement cycle by putting into play the insights gained by analytical processes

Field Experiments and BI – Part I

Much as I love the behavioral economics gospel espoused in Dan Ariely, I'm uncomfortable with the driving methodology of controlled classroom or laboratory experiments that use MIT business graduate students as subjects

PIM for Partners

Polycom is hoping to use product information management to shorten sales cycles in a complex, regulated global marketplace

IBM Goes Deeply Predictive, Announces Acquisition of SPSS

IBM’s bold move has already sent shockwaves throughout the analytics market

The Next Wave of BI Acquisitions?

Whoever says that business intelligence market is commoditizing is smoking something funny

IBM Boldly Elevates Analytics with Acquisition of SPSS

IBM now can empower new classes of analytic solutions with SPSS that go well beyond the traditional business intelligence applications that focus only on historical data

Potpourri Redux

A discussion about the direction of business intelligence in several fields, including economics, statistical learning and the R platform

Oracle Targets Real-Time Data Integration With GoldenGate Acquisition

This acquisition will focus on two specific areas: high availability to support data replication and migration efforts and noninvasive real-time data integration capabilities

Jobs, Process and Metrics

As business process and infrastructure outsourcing continues to trickle up in organizations, internal process owners could emerge to manage core business challenges

And Now For Something Completely Different

What really impacts our business/technology careers falls into a scope greater than the sum of our expertise and skills; for those not born to a single working destiny, follow the macro along with the micro

Planners, Searchers and BI

I think the planner/searcher dichotomy is quite pertinent for business intelligence

Architecture Makes Decisions Easier

When I previously referred to architecture, I implied an engineered approach to solving a complex, multifaceted issue. One issue has to do with making complex decisions about inaccessible information

Actuate Steps Forward to Power Information Applications

Actuate must take some credit for their work to date and be aggressive in what they can help organizations deploy easily with their existing development and technology skills

Repurposing Your Data Warehouse Platform—Not!

I’ve noticed lately that data warehouse vendors are dusting off the arguments and pitches of days gone by

BI, Analytics, and CEP: Some Fruitful Potential Follow-Ons from Software AG’s Acquisition of IDS Scheer

Bet you didn’t realize that IDS Scheer has ARIS solutions in the fast growing markets of BI, analytics and complex event processing

Tricks of the Data Trade – a Blueprint for Success

Many business and IT professionals will attest to the fact that project management is the key to successful data implementations, although few data analysts will agree

Economics and BI, Part 2

Just what is behavioral economics?

Job Uncertainty Times Two

A weak job market and poorly identified roles are making life dodgy for IT information workers; don’t expect human resources to help, find the competency center and align yourself to it

CIO Strategies Require Cloud Computing

IT needs to fully engage in IT performance management and document their strategies and scenarios to determine the best effective alignment for business

Master Data and the Need for Management, Part 2

Master Data Management and Data Governance have common goals

Master Data and the Need for Management

The complexity of IT systems has caused IT to become reactive rather than proactive

Is CRM Technology a Legacy or Innovative Investment?

Do your existing CRM investments work well and, if you replace them, will they just be a legacy investment?

Big Brains, Soft Skills

As information career paths disintegrate and reform, the convergence of business and technology finds people creating their own jobs and the 'smartest' among us may be the most challenged

Economics and BI – Part 1

What good are economists anyway?

Different People Need Different Info…

Different business audiences require varying intelligence needs and focus

Sun and Oracle:

The recent acquisition of Sun by Oracle has raised a lot of speculative discussion about the latter vendor’s strategic pursuits

CDI: Rubber Meets Road

At MDM Summit in Toronto, random pearls of wisdom from Ed Unrau at Canadian Tire Financial Services

Bogle for BI

I periodically revisit Bogle's wisdom not only to affirm my investment approach but also as a guide for BI

BI SaaS Vendors Struggle to Survive and Some Do Not

The market for BI might seem rosy for most in the industry. While some analyst firms predict significant growth, most of us who have been in the industry for decades know a different reality

What Really Is Business Intelligence?

A corporation loses money every time it delays getting information into the hands of decision-makers

Intelligent Education

BI can learn a lot from education and other not-for-profit programs for its mission to assess the performance of business

Are Your Customer Metrics and Business Intelligence Good Enough?

As I have dug into improving customer satisfaction, the question I ask myself is that although this seems like a laudable target, is it really?

Will Google Gobble Web Analytics?

The short answer is yes and it's surprising the titan of search hasn't done so already

Information Builders Rocks Nashville with Business Intelligence and More

At the Information Builders 2009 Summit in Nashville music and technology came together to present innovation and practical use of BI and information-enabling technologies

The Hawthorne Effect – or Not

I was pretty distraught over a column I recently read in the Economist. The article, Light Work, challenges the legacy of the Hawthorne effect

MicroStrategy’s Big Picture

With official release of free reporting suite, the BI vendor offers a clever reverse consolidation path for customers

BI Mashup Maturity Model? Oxymoron? Au Contraire Mon Frère!

Forrester has developed a maturity model for enterprise adoption of mashup-style, self-service development of business intelligence applications

No Data Warehouse Required: BI Reporting Extends Its Reach

The dirty little secret in most companies is that the BI reporting team has morphed into a de-facto enterprise reporting team

IT Should be an Enabler to the Business

Let’s start with my basic opinion: IT is not the business, it is an enabler

Analytical Designs for BI – Part 2

Quasi-experimental studies attempt to control potential confounding variables by clever design techniques and statistical adjustment

Coca-Cola Goes Freestyle

RFID-powered vending machines that let customers mix their own water, juices and sodas hoped to provide data for new product introductions

SPSS: Analytics Force in Customer Experience Management

Behind a fairly generic name is SPSS's new version of their enterprise feedback management technology, which in my opinion is one of the key supporting technologies providing customer feedback management

Pitney Bowes Introduces Portfolio of Business Insights Technology

After a day behind closed doors with executives I got a deeper appreciate for the portfolio of technologies today and in the future

Analytical Designs for BI – Part 1

I must admit I'm obsessed with BI designs for business performance measurement

When Will Eco Apps Go Mainstream?

Examples of transaction and performance management based systems to manage Eco Footprint are showing up in Europe; vendors will be ready to deliver products stateside as regulations call for

Database Religions Dissolve Into The Big Billowing Virtual Data Cloud

Virtualization is a venerable old computing concept that has achieved new life in recent years

Convergence of SOA and BI

The IT community has sensed a need for consistent terms. I propose we start a process to keep us all sane and volunteer to head the IEEE, ANSI or other committee

Social Science Statistics Blog and BI – Nonrandomized Experiments

I came across an intriguing blog entry that raised issues pertinent to business and business intelligence

Things I Need to Do My Job

The tyranny of technology moves slowly and leaves the best bits behind for a longer run than people appreciate

Information Post-Discovery - Latest BI Trend

This is yet another proof – and I’ll never get tired of saying this – that BI market is as vibrant, exciting and far from commoditization as ever

Quantitative Social Science and BI

I find the intersection of the social and quantitative sciences of particular interest – and pertinence for BI

BI Search is Back (and Better)

Molson Coors has been testing SAP’s Business Objects Explorer release and they really like it

MDM and M&A

Corporate restructuring isn’t just a financial challenge. It includes realignment of marketing activities, sales and operational issues

Bayes and BI

The current ascendance of Bayesian analysis in the statistical world is, I believe, a boon for BI

Imagine How the Business Feels

A service-based architecture requires information abstraction using metadata, whereas Web services do not use common metadata

Bundled BI Bargains

Have a closer look at how consolidation and product maturity are paying off now for businesses with stalled IT budgets; you may already be a winner

Self-Service Business Intelligence Depends On Automated Data Discovery

The ball’s in IBM’s rivals’ courts regarding whether, when and how they plan to add automated source discovery to their BI portfolios

OpenBI Travels II: R/Finance 2009, Chicago

I received an email from Mike Driscoll, co-chair of the Bay Area R Users Group, announcing the formation of three new groups – Los Angeles, New York and Ottawa

AARP Says I’m Special

Just another data quality failure from the annals of direct marketing, or am I missing the point?

Confusing Business

We in IT struggle with understanding the business – or is it the business that struggles with explaining themselves (and their needs or requirements) to IT

Missing Out due to Travel Restrictions?

Corporate travel budgets have been slashed, but what are the long-term effects of less opportunity

OpenBI Travels I: MySQL Conference and the Oracle Acquisition of Sun

There was no shortage of strong reaction at the MySQL conference to the acquisition of Sun. Open source purists were incensed

Make the Whirl Go Away

What’s really going on with advances in solid-state disk drives and how far will it go?

Oracle’s Sun Acquisition Accelerates Push Into Data Warehousing Appliances

Oracle is acquiring longtime partner Sun Microsystems, putting the software powerhouse fully into the hardware business - and hitting the DW industry like an earthquake

Your Company’s Data Supply Chain

We’ve been talking for several years about the concept of a data supply chain. But IT executives are only now starting to catch on to its importance

Free BI is Still No Free Lunch

There are few truly zero cost alternatives to BI tools, but there are some

More Statistical Learning

The Elements of Statistical Learning: Data Mining, Inference and Prediction, Second Edition, by Trevor Hastie, Robert Tibshirani and Jerome Friedman is now available

What About Architecture Anyway?

Confusion about services-based architectures has been created by a number of industry elements

Architecture as an Asset

We need to consider enterprise architecture not as a hindrance, but as an asset to the IT organization and to the business that we are enabling

The R Statistical Learning Lasso

The R user community had just been provided access to a latest learning algorithm hot off the development presses from three world-renowned practitioners – for free

Dislocation Intelligence in a Brutal Economy

We as a country should try to smooth over these economic dislocations so that they don’t completely wipe some places off the map

Health Care Industry BI Groundhog Day

On my way back home from www.himss.org show in Chicago, I have a creepy feeling of déjà vu. Even worse, it feels like the movie Groundhog Day where the main character keeps waking up on the same day, same date, never able to get to tomorrow

Performance Goals and Compensation

Why business rarely ties outcomes to job success for managers, and, if the New York Yankees are baseball’s AIG, why don’t we hate them?

Blurring the Line Between SOA and BI

I recently read an article in the Microsoft Architect Journal on so-called service-oriented business intelligence or, as the article’s authors call it, SoBI

Data Vu All Over Again

No sweat I thought. We’ve honed our expertise on open source BI in the cloud. We’ve performed this drill several times already. Alas, not so fast ...

Inmon’s Vitriolic Slap At “Virtual Data Warehousing” Does Not Withstand Scrutiny

Bill Inmon treats data federation as mutually exclusive from enterprise data warehousing, when in fact they are highly complementary approaches

Gathering of Clouds

With hype contained, developments in cloud computing are on display at conference

The Divine R

A colleague recently asked me for a good introductory text on the R statistical computing platform. Though there are a seemingly endless number of published books on R, I recommended a personal favorite

Not MDM, Not Data Governance: Data Management

Has everyone forgotten database development fundamentals?

After So Many Years Of Ballyhoo, Semantic Web Still Searching For Killer App

Cynics might call Semantic Web a technology looking for a solution. And they might have a point

Basketball Diaries

Lessons from the NCAA tournament coverage; don't drop the ball on Web content management

Hypothesize/Experiment/Learn/Nudge

Nudge is a concept derived from behavioral economics. It denotes a gentle “push” or incentive to coax decision-makers to choose a preferred option from a series of alternatives

Lean Information Management Strategies for Lean Times

When the going gets tough, the tough get lean, focused, and flexible. To help organizations survive the bad times and thrive in all climates, their information management initiatives must remain agile and adaptable

Are There BI Implications In The Rumored IBM/Sun Merger? You Betcha!

I always predicted that Open Source BI has to reach critical mass before it becomes a viable alternative for large enterprise BI platforms

Hypothesize/Experiment/Learn

I’ve subscribed to the Harvard Business Review for years. It seems there are either several articles pertinent for BI or none at all. The February 2009 edition was the former

Enjoy the Chicken (and then Get Back to Work)

Reflecting on a conference, same as it ever was and still relevant

Yawn! Business Process Visions Consistent Over The Years

A couple of nights ago, I found my handwritten notes from 1997, summarizing a literature search I was doing at that time while authoring my first book, the IDG Books title “Workflow Strategies.” Reading the prettier handwriting of a 12-years-younger Jim Kobielus, I was struck by how little has changed since then

The Flaw of the Hub-and-Spoke Architecture

I recently talked to a client who was fixated on a hub-and-spoke solution to support his company’s analytical applications. In the world of software and data, the one thing I’ve learned is that there are no absolutes. And there’s no such thing as a universal architecture

Planning for Predictive Models – Wisdom From Regression Modeling Strategies

The following are nuggets of wisdom from RMS for planning/executing modeling studies, along with a statistical blogger’s commentary

Riding the Content Technology Train

You’ll know content vendors at a glance with this clever subway map; also, Skittles turns Web site over to those crazy social networkers

BI Nirvana

I had an amazing client experience. I searched long and hard for a client with flawless, 100 percent efficient and effective BI environment and applications. Imagine my utter and absolute amazement when I stumbled on one

Rattle Redux and Predictive Analytics World Potpourri

As I mentioned in last week’s blogs, I was pleasantly surprised by version one of Predictive Analytics World, finding it quite useful on a number of levels. Today, I offer a few final observations on the conference

Retrospect or Hindsight

Why we're bad at benchmarking BI failures

Predictive Analytics World – Methodology and Business Learning

Steve Miller's thoughts his trip to Predictive Analytics World

Learning from the Black Swan

Nassim Nicholas Taleb was right. The world financial system was devastated by unpredicted catastrophes of grand proportion – a financial black swan

Whatever Happened To EII?

Some integration approaches developed into multibillion dollar standalone markets. But others, while valuable, haven't survived as well. EII is one example

What, If Anything, Is A "Niche Vendor," Where Enterprise Data Warehousing Is Concerned?

No matter how carefully one words a report, there is always the potential for misunderstanding

SOA Mandates Data Management

Everyone thinks that SOA is an integration framework. In fact it’s a means of remotely accessing other systems and their related information without having to know the details

Kickin IT Old School

Goes around, comes around; back office meets its match in tech-savvy users

Operational BI From the Trenches

As with any other current buzzword, the world seems to be piling on and the meaning of operational BI seems to be is evolving (or eroding)

Advertisement

Advertisement