Data Mining
Data Mining
Companies use data mining software to learn more about their customers. It can help
them to develop more effective marketing strategies, increase sales, and decrease
costs. Data mining relies on effective data collection, warehousing, and computer
processing.
KEY TAKEAWAYS
Data mining is the process of analyzing a large batch of information to discern
trends and patterns.
Data mining can be used by corporations for everything from learning about what
customers are interested in or want to buy to fraud detection and spam filtering.
Data mining programs break down patterns and connections in data based on what
information users request or provide.
Social media companies use data mining techniques to commodify their users in order
to generate profit.
This use of data mining has come under criticism lately as users are often unaware
of the data mining happening with their personal information, especially when it is
used to influence preferences.
Data is collected and loaded into data warehouses on-site or on a cloud service.
Business analysts, management teams, and information technology professionals
access the data and determine how they want to organize it.
Custom application software sorts and organizes the data.
The end user presents the data in an easy-to-share format, such as a graph or
table.
Data Warehousing and Mining Software
Data mining programs analyze relationships and patterns in data based on user
requests. It organizes information into classes.
For example, a restaurant may want to use data mining to determine which specials
it should offer and on what days. The data can be organized into classes based on
when customers visit and what they order.
Cloud data warehouse solutions use the space and power of a cloud provider to store
data. This allows smaller companies to leverage digital solutions for storage,
security, and analytics.
Different data mining processing models will have different steps, though the
general process is usually pretty similar. For example, the Knowledge Discovery
Databases model has nine steps, the CRISP-DM model has six steps, and the SEMMA
process model has five steps.
1
Applications of Data Mining
In today's age of information, almost any department, industry, sector, or company
can make use of data mining.
Sales
Data mining encourages smarter, more efficient use of capital to drive revenue
growth. Consider the point-of-sale register at your favorite local coffee shop. For
every sale, that coffeehouse collects the time a purchase was made and what
products were sold. Using this information, the shop can strategically craft its
product line.
Marketing
Once the coffeehouse above knows its ideal line-up, it's time to implement the
changes. However, to make its marketing efforts more effective, the store can use
data mining to understand where its clients see ads, what demographics to target,
where to place digital ads, and what marketing strategies most resonate with
customers. This includes aligning marketing campaigns, promotional offers, cross-
sell offers, and programs to the findings of data mining.
Manufacturing
For companies that produce their own goods, data mining plays an integral part in
analyzing how much each raw material costs, what materials are being used most
efficiently, how time is spent along the manufacturing process, and what
bottlenecks negatively impact the process. Data mining helps ensure the flow of
goods is uninterrupted.
Fraud Detection
The heart of data mining is finding patterns, trends, and correlations that link
data points together. Therefore, a company can use data mining to identify outliers
or correlations that should not exist. For example, a company may analyze its cash
flow and find a reoccurring transaction to an unknown account. If this is
unexpected, the company may wish to investigate whether funds are being mismanaged.
Human Resources
Human resources departments often have a wide range of data available for
processing including data on retention, promotions, salary ranges, company
benefits, use of those benefits, and employee satisfaction surveys. Data mining can
correlate this data to get a better understanding of why employees leave and what
entices new hires.
Customer Service
Customer satisfaction may be caused (or destroyed) for a variety of reasons.
Imagine a company that ships goods. A customer may be dissatisfied with shipping
times, shipping quality, or communications. The same customer may be frustrated
with long telephone wait times or slow e-mail responses. Data mining gathers
operational information about customer interactions and summarizes the findings to
pinpoint weak points and highlight what the company is doing right.
Data mining can look very different across applications, but the overall process
can be used with almost any new or legacy application. Essentially any type of data
can be gathered and analyzed, and almost every business problem that relies on
qualifiable evidence can be tackled using data mining.
The end goal of data mining is to take raw bits of information and determine if
there is cohesion or correlation among the data. This benefit of data mining allows
a company to create value with the information they have on hand that would
otherwise not be overly apparent. Though data models can be complex, they can also
yield fascinating results, unearth hidden trends, and suggest unique strategies.
Data mining doesn't always guarantee results. A company may perform statistical
analysis, make conclusions based on strong data, implement changes, and not reap
any benefits. Through inaccurate findings, market changes, model errors, or
inappropriate data populations, data mining can only guide decisions and not ensure
outcomes.
There is also a cost component to data mining. Data tools may require costly
subscriptions, and some bits of data may be expensive to obtain. Security and
privacy concerns can be pacified, though additional IT infrastructure may be costly
as well. Data mining may also be most effective when using huge data sets; however,
these data sets must be stored and require heavy computational power to analyze.
Even large companies or government agencies have challenges with data mining.
Consider the FDA's white paper on data mining that outlines the challenges of bad
information, duplicate data, underreporting, or overreporting.
2
That data can be used to make inferences about their preferences. Advertisers can
target their messages to the people who appear to be most likely to respond
positively.
Data mining on social media has become a big point of contention, with several
investigative reports and exposes showing just how intrusive mining users' data can
be. At the heart of the issue, users may agree to the terms and conditions of the
sites not realizing how their personal information is being collected or to whom
their information is being sold.
Examples of Data Mining
Data mining can be used for good, or it can be used illicitly. Here is an example
of both.
In light of this inappropriate data mining and misuse of user data, Facebook agreed
to pay $100 million for misleading investors about its uses of consumer data. The
Securities and Exchange Commission claimed Facebook discovered the misuse in 2015
but did not correct its disclosures for more than two years.
5
The ultimate goal of the data mining process is to compile data, analyze the
results, and execute operational strategies based on data mining results.