Module 2 Sources of Data
Module 2 Sources of Data
Table of Contents
1. Potential Data Sources
Categories of Data Sources
Uses of Databases in Predictive Analytics
More on Purchased Data
Introduction
In order to successfully market using predictive analytics, it is crucial to obtain appropriate
data sources as inputs.
In this module, students will get an introduction to data used in targeted marketing
campaigns. Predicting consumer response, return on investment from campaigns,
and appropriate product or service offerings can only be done
with solid input data because when you put “garbage in,” you get “garbage out.”
Additionally, students will learn the importance of understanding the variety of data
sources available along with each source's intent, capabilities, relevance to your business,
cost, and stability—all key parts of deciding whether to make an investment in specific
external data sources.
Learning Objectives
• Explain to leadership the types of data sources available in the marketplace
• Effectively evaluate various data sources (internal vs. external) used in prospecting,
consumer marketing, customer segmentation, and lead qualification
• Frame up a data question to clearly articulate revenue goals and measure them for
success
Internal data
These are data which are available in internal systems such as customer transactional data
(purchases), and customer contact data (email, phone).
Customer-provided data
These are data provided by the customer voluntarily, which could be at the point of sale,
online, by telephone, by mail, or by a survey.
Public data
Some data are publicly available via online sources, trade directories and resources such as
census bureaus or other government-related records.
Purchased data
There are many compilers of data who provide information for sale ranging from
demographics, to interests, to purchases by consumers.
Uses of Databases in Predictive Analytics
Customer Groups
Identify customer groups, from the most loyal, high-value customers to first-time
customers and occasional purchasers.
Customer Segments
Create detailed customer segments based on demographics, behaviors, or even
personal interests.
Lead Generation
Effective lead generation for better conversion rates.
Messages
Design personalized messages for both current and prospective customers.
Channel
Determine the best channel to reach customers or prospects.
Marketing Efficiency
Improve marketing efficiency by sending campaigns to those likely to respond.
Loyalty Programs
Build effective loyalty programs that provide the right incentives for repeat purchases.
The following video provides an overview of customer marketing databases and their importance in
organizing customer data.
Start
Finding a data source
There are also a number of demographic compliers for B2C marketing. These data are often
used for overlay on consumer databases to append information such as age, income,
housing status, household composition, interests, and purchasing power. Demographic
providers include Experian, Acxiom, Epsilon, and many others.
Linking to database
There are also several providers of identity graphs. These providers match the online ID of
a customer or visitor to the company’s website to the offline ID. The first major identity
graph provider is LiveRamp, however there is a host of companies currently jumping into
the space. These data are used to get a sense for characteristics on who’s visiting one’s site.
Operationalize
The marketer will have a service provider match their customer database to these
providers to gain additional information on their customers.
Account-based marketing
For B2B marketing, ABM without the right contact info is guesswork. This makes it
important to find a data compiler that can be used for a prospecting database. (Gartner
2015)
End
Activity
Which of the following are overall uses of databases
in predictive analytics?
Select all that apply, then click “Submit.”
Create detailed customer segments based on demographics, behaviors, or even personal
interests
Decrease marketing efficiency by sending campaigns to those not likely to respond
Build effective loyalty programs that provide the right incentives for repeat purchases
Internal data
• Rouge may have records of past consumer purchases from customers. These data
would include a customer identifier along with name, address, email, automobile
purchased with the price and category, and means of financing.
• Rouge may have records from dealers on auto servicing by customer.
• Rouge may also have records of consumer interactions from prospective customers,
or prospects. These data would include a prospect name, address, which
automobiles they considered with the price and category.
Customer-provided data
• Rouge may have captured data from consumer satisfaction surveys indicating
various points of information about the customer’s decision reasons.
• Rouge may also have a website where customers can share their experiences. For
expensive purchases such as an automobile, customers are more likely to share
their views with others.
Public data
• Most countries, states, and provinces collect data on residents. This would include
household composition, some measure of home ownership, and attributes of
socioeconomic status. These measures can be important in correlating to vehicle
purchase.
• There are many data sources that provide availability of public transportation.
These can be important in determining whether a vehicle is needed.
• Many countries, states, and provinces keep data on consumer vehicle ownership.
These may not be available at a household level, but can be available in small-area
geographies.
Purchased data
• There are many compilers of data who provide information on consumer purchases
and searches throughout an industry category, such as automobiles. The availability
varies by country, state, and province. In the U.S., the primary provider is Polk, a
division of IHS Markit. Polk’s Automotive Data helps dealers, media and agencies
understand who’s in-market, where to find them, what they are looking for, when
they will buy and what will drive their purchase decision.
• In addition, providers of data can develop strategic agreements. The Nielsen
company is a global company that measures advertising effectiveness.
They’ve partnered with Polk Automotive to deliver campaign effectiveness.
(Nielsen, 2021)
• RECOMMENDED READING
• For more information on what companies can do in database marketing, CRM, one-
to-one marketing, and advanced analytics to stay competitive, read Legacy
Companies Need to Become More Data Driven—Fast.
Internal data
• STW may have records of past consumer purchases from customers. These data
would include a customer identifier along with name, address, email, cruise
purchased along with the price, destination, and time of year.
• STW may also have records of consumer interactions from prospective customers,
or prospects. These data would include a prospect name, address, which cruises
they considered along with the price, destination and time of year.
Customer-provided data
• STW may have captured data from consumer satisfaction surveys indicating various
points of information about the customer’s decision reasons.
• STW may also have a website where customers can share their experiences. For
expensive experiences such as a cruise, customers are more likely to share their
views with others.
• Cruise line Instagram accounts can show potential travelers the views they’ll
experience on their vacations, with the marketer capturing the views.
Public data
• Most countries, states, and provinces collect data on residents. This would include
household composition, some measure of home ownership, and various attributes
of socioeconomic status. All these measures can be important in correlating to
cruise purchase.
• There are many data sources that provide availability of access to major airports or
departure points for cruises. These data give a sense for ease of getting to a cruise
departure.
• Many countries, states, and provinces keep data on travel outside the country. These
may not be available at a household level, but can be available in small-area
geographies.
Purchased data
• Many cruise lines partner with travel agents or partner brands to build robust
mailing lists that are made available for purchase. These are an efficient source of
previous cruise-goers and new travelers.
• Companies such as Deloitte have done studies on the cruise line industry along with
the passenger experience. (Deloitte, 2021)
Customer-provided data
• Some retailers have websites where people can post, leave feedback, and share their
thoughts. GFV may be able to leverage to optimize product mix.
• GFV may offer coupons or other incentives in exchange for customer demographics.
Public data
• Most countries, states, and provinces collect data on residents. This would include
household composition, some measure of home ownership, and various attributes
of socioeconomic status. All these measures can be important in determining the
volume of food a household will buy along with purchasing power.
• There are many data sources that provide availability of public transportation.
These can be important in determining whether a person would take a big shopping
trip infrequently, or small shopping trips more frequently. It would also help enable
to mix of perishable items vs packaged items.
Purchased data
• There are also compilers of panel data that capture receipts for consumers that opt-
in. These data are scanned by consumers and transmitted. An example is
Numerator. These data can give a sense for share of wallet from a retailer’s
customers. (Numerator, 2021)
RECOMMENDED VIDEO
• To learn what happens when a marketer doesn't "do it right", watch the following
short video by Merkle.
RECOMMENDED READING
For more information on the importance of making decisions connected to revenue via the
effective use of data and analytics, read Is Your Marketing Strategy Based on the Right
Data?
Activity
True or False: Many countries, states, and provinces
keep data on travel outside the country.
Choose the best answer, then click "Submit".
True
False
Case 4: Data R Us
Scenario: Suppliers of goods to retailers know what they’ve shipped to each retailer,
but they don’t know their market share nor the specific drivers impacting change in market
share. A marketing information supplier, Data R Us, has agreements with 800 different CPG
retailers throughout the U.S. Data R Us aggregates data for suppliers such as Coca-Cola,
Procter & Gamble, and Kraft-Heinz along with model-based explanations of why market
share has changed over time. Their service is sold via annual subscription.
Internal data
• Point-of-sale data for 800 retailers throughout the U.S.
Customer-provided data
• Some customers provide their shipment data; how much product has been shipped
to each retailer by time period and product type.
Public data
• Quarterly revenue by retailer.
Purchased data
• Bombora is a company that measures prospects’ digital journey across 5,000+
premium B2B websites, so you can know exactly what prospects are in the market
for, letting you guide them to your solution. (Bombora, 2024)
Internal data
• History of conversion rates by industry based on actions on website; identity graph
matching device ID to physical mailing address.
Customer-provided data
• Clients provide their first-party data from website visits including time on the
domain, which URLs were visited, number of clicks, source of entry to site (direct vs.
search).
Public data
• Total revenue for each prospective publicly traded client.
Purchased data
• Firmographic segmentation to group companies by filters; Coresignal is a good
example. (Coresignal, 2024)
RECOMMENDED READING
• Read about the 3 Reasons to Build an Effective B2B Prospecting Database to learn
how to construct an effective database.
RECOMMENDED VIDEO
• In this video, Colin Carrasquillo from Experimarketing provides insights into a
major provider of automotive ownership data. Watching this video will help you to
understand the importance of utilizing data to inform automotive advertising
decisions
Accuracy
Coverage
Next, it's important to consider coverage. Does it have high coverage? That is, does it capture a high
percentage of the target audience?
Predictive Power
Finally, it's important to assess the data's usefulness in predicting consumer behavior. Can it be used
to segment the audience appropriately and predict how much clients or customers will spend?
RECOMMENDED VIDEO
While customer demographics provide you with an idea of what products and services
your customers may be interested in, psychographics include things like personality,
values, attitudes, interests, and lifestyles. In this video, the team from Elegant Themes
explains 3 key steps for using psychographics to improve marketing efforts. Watching this
video will help you to better understand the use of motivations and desires in marketing
success.
Using Psychographics to Improve Marketing Techniques (youtube.com)
Case 4: Data R Us
Scenario: Data R Us wants to upsell their clients, CPG suppliers, from their basic offering
that solely provides data on market share to the premium package that uses predictive
modeling to identify the primary drivers impacting change in market share.
In the carousel below, we'll explore two potential questions that could help achieve this
goal. Which one would you prioritize?
REMEMBER
In each case you must evaluate each data source within the dimensions of accuracy, high
coverage, and predictive power toward the question being asked.
The effective marketer or salesperson will need to work closely with data architects to
build a data model that will house the data in such a way to allow marketers and
salespeople to meet all the goals of the customer marketing or prospecting database. Data
modeling is a process used to define and analyze data requirements needed to support the
business processes within the scope of corresponding information systems in
organizations. (IBM, 2024)
Above all, the purpose of collecting data is not only to increase sales to customers, but to
establish relevancy so that the customer feels good about the purchase and comes back.
After each effort, additional data needs to be added and sources re-evaluated.
Finally, there can always be errors or missing data within any data sources. The smart
business leader uses domain expertise to determine rules for whether data are “good” or
need to be thrown out.
RECOMMENDED READING
For more information on ways to determine the quality of your data, read How to Measure
Data Quality – 7 Metrics to Assess the Quality of Your Data.
Glossary
For your convenience, here is a list of the key terms presented in this module.
Behavioral Data
Behavioral data is based on what is happening now, or in the recent past. This data tells us
what a person is doing.
Customer-Provided Data
These are data provided by the customer voluntarily, which could be at the point of sale,
online, by telephone, by mail, or by a survey.
Data Model
Data models are visual representations of an enterprise's data elements and the
connections between them.
Data Modeling
Data modeling is a process used to define and analyze data requirements needed to
support the business processes within the scope of corresponding information systems in
organizations.
Demographic Data
This type of data includes census data, home ownership, and potentially thousands of
external sources.
Internal Data
These are data available in internal systems such as customer transactional data
(purchases) and customer contact data (email, phone number).
Psychographic Data
Psychographic data includes data based on a consumer’s values, opinions, and motivations.
This data helps uncover why they are making the choices they do.
Public Data
Some data are publicly available via online sources, trade directories, and resources such
as census bureaus or other government-related records.
Purchased Data
These are data that companies compile in order to sell the related information, which
can range from demographics to consumer interests and purchases.
Knowledge Check
Module 2
The following series of questions will test your understanding of the contents of
this module.
Please read each question carefully before selecting your answer.