Free Site RegistrationFree Site Registration

Predictive Analytics with Data Mining: How It Works

Information Management Special Reports, February 2005

Eric Siegel

Although you've probably heard many times that predictive analytics will optimize your marketing campaigns, it's hard to envision, in more concrete terms, what it will do. This makes it tough to select and direct analytics technology. How can you get a handle on its functional value for marketing, sales and product directions without necessarily becoming an expert?

The answer is, in order to know precisely how predictive analytics may benefit current marketing operations, you do need to learn a few specifics about how it works. This short article covers just enough of the inside mechanics to eliminate predictive analytics' "voodoo" status. Here you will learn what a predictive model is, and how it constitutes a key form of business intelligence by actively guiding marketing campaigns. To this end, we'll take a look inside to see how a model works and how it is created.

Predictors Rank Your Customers to Guide Marketing

Advertisement

Predictive analytics' central building block is the predictor, a single value measured for each customer. For example, recency, which is based on the number of weeks since the customer's last purchase, has higher values for more recent customers. This predictor is usually a reliable campaign response predictor: you will receive more responses from those customers more highly ranked by recency. That means that if you contact your customers in order of recency - first, call the most-recent customer; next, call the next-most-recent customer; and so on - you will improve your response rate.

For each prediction goal, there are an abundance of predictors that will help rank your customer database. For example, consider a customer's online behavior: Customers who spend less time logged on may be less likely to renew their annual subscription. In this case, retention campaigns can be cost-effectively targeted to customers with a low monthly usage predictor value.

Combined Predictors Mean Smarter Rankings

It turns out you can do even better by using more than one predictor at a time, combining them with a model. Creating this model is the very purpose of predictive analytics.

One way to combine two predictors is with a formula, such as simply adding them together. If both recency and personal income influence the chance that a customer will respond to a mailing, a good predictor may be:

Voila, a new, improved predictor. If recency is twice as important, give it twice the weight:

A scheme such as this that combines predictors is called a model - in the case of the summation above, a linear model. For this reason, predictive analytics is also called predictive modeling.

Other predictive models are business intelligence rules, such as:

If you discover that urban customers who spend more time exploring new service features are at a greater risk to cancel, expand this rule-based model with a second rule:

The right combination of predictors will perform better prediction by considering multiple aspects of the customer and her behavior. To match the complexity of customer decisions, a predictive model must usually be much richer and more complex than the above examples, combining dozens of predictors.

The Computer Makes Your Model from Your Customer Data

The real trick is to find the best predictive model. This is a difficult problem, since there are so many options. There are many kinds of models, such as linear formulas and business rules. And, for each kind of model, there are all the weights or rules or other mechanics that determine precisely how the predictors are combined. In fact, there are so many choices, it is literally impossible for a person to try them all and find the best one.

Predictive analytics is data mining technology that uses your customer data to build a predictive model specialized for your business. This process learns from your organization's collective experience by leveraging your existing logs of customer purchases, behavior and demographics. The wisdom gained is encoded as the predictive model itself. Predictive modeling software has computer science at its core, undertaking a mixture of number crunching, trial and error.

Figure 1: Wisdom Gained: A Predictive Model is Built from Customer Data

Predictive analytics can generate business rules that may make clear sense, or you could end up with a complex formula that is hard to decipher. The choice is up to you, keeping in mind that a simpler, more intuitive model may not perform prediction as well.

A Simple Curve Shows How Well Your Model Works

Either way, you need solid proof that the model is a good one. A profit curve (shown in Figure 2) estimates the profit you'll receive from a campaign guided by predictive analytics, depending on how many prospects you contact. The profit this curve predicts depends on the ranking of your customers given by a predictive model, the cost per contact (e.g., printing and mailing costs) and the average profit per respondent.

Figure 2: A Typical Profit Curve

As shown by the upper line in Figure 2, the more customers you contact, the greater your profit, up to a point. This predicted profit line rises initially, since you will contact customers more likely to respond first. After exhausting those highly ranked, though, contacting the remaining customers will only serve to decrease your profit. You'll probably want to stop your campaign at the high profit peak, although that choice may depend on your longer-term marketing strategies.

Page 1 of 2.

Advertisement

Advertisement