0% found this document useful (0 votes)
131 views7 pages

Data Mining and Predictive Analytics

Uploaded by

Armand Cristobal
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
131 views7 pages

Data Mining and Predictive Analytics

Uploaded by

Armand Cristobal
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 7

Part I

1. For each of the following meetings, explain which phase of the CRISP-DM process is
represented:

a. Managers want to know by next week whether deployment will take place. Therefore,
analysts meet to discuss how useful and accurate their model is.

Evaluation Phase - Data mining analysts analyze whether the model and approach
employed in the first step accomplish the business goals indicated in the evaluation phase.
b. The data mining project manager meets with the data warehousing manager to discuss
how the data will be collected.

Data Understanding Phase - The data warehouse is recognized as a resource during


the Business Understanding phase, but data gathering takes place during the Data
Understanding phase. Data is collected and retrieved from the resources stated and
identified in the Business Understanding step during this phase.
c. The data mining consultant meets with the Vice President for Marketing, who says that
he would like to move forward with Customer Relationship Management (CRM).

Understanding Phase - During the Company Understanding Phase, the primary goal
of business is to evaluate. As a result of the meeting, the data mining specialist appeared
to have persuaded the vice president of marketing to allow data mining on the customer
relationship management system.
d. The data mining project manager meets with the production line supervisor to discuss
the implementation of changes and improvements.

Deployment Phase - It employs already constructed models. In another department,


it employs a concurrent data mining method.
e. The analysts meet to discuss whether the neutral network or decision tree models
should be applied.

Modeling Phase - The analysts get together to discuss, choose, and implement
appropriate modeling methodologies for use throughout the Modeling Phase.
2. Discuss the need for human direction of data mining. Describe the possible
consequences of relying on completely automated data analysis tools.

Data mining is a technique used by businesses to convert unstructured data into


useful information. Businesses can learn more about their customers by employing
algorithms to search for patterns in enormous volumes of data. This enables them to
develop more effective marketing tactics, boost sales, and save costs. We could look at
client habits and insights, as defined by the definitions and applications of data mining. As
a result, the business is thriving and data-driven. Because AI is continually evolving,
hardware and software must be changed to suit the most current requirements. This is one
of the downsides of depending too heavily on automated data analysis. Machines require
regular maintenance and repair, which is costly. Because they are incredibly intricate
equipment, their construction necessitates exorbitant pricing. As a result of its applications
automating the majority of people's professions, AI causes humans to lose their critical
thinking skills or become technology dependent. AI also causes humans to become lazy.
Humans have a tendency to become addicted to technology advancements, which could
be a concern for future generations. The rise in the unemployment rate is another factor.
As AI replaces most monotonous jobs and other duties with robots, human intervention is
reducing, posing a serious dilemma in terms of employment norms. Every business wants
to replace low-skilled labor with AI machines that can do the job better. It reduces the
ability to work in a safe atmosphere. There's no doubting that robots are more productive
than humans. They can replace the human bond that holds the team together, but they
can't replace the human bond that holds the team together. Machines are unable to
develop human-like connections, which is a crucial characteristic in team management.
Machines can only do jobs for which they were intended or programmed; otherwise, they
will fail or provide irrelevant results, which can be disastrous.

3. CRISP-DM is not the only standard process for data mining. Research an alternative
methodology (ex: Sample, Explore, Modify, Model, and Assess or SEMMA, from SAS
Institute). Discuss the similarities and differences of the researched methodology with
CRISP-DM.
For the data science life cycle, the CRISP-DM (Cross Industry Standard Process for
Data Mining) is a six-phase process model. It acts as a collection of principles for planning,
organizing, and carrying out a data science (or machine learning) project. While Fayyad,
Piatetsky-Shapiro, and Smyth proposed the foundations of structured data mining
approaches, they were originally tied to Knowledge Discovery in Databases. KDD is a
conceptual process model of computational ideas and methods for data extraction. Data
mining is a specific phase in the KDD approach to knowledge discovery. As a result, KDD
has the advantage of taking into account data storage and access, algorithm scaling,
result interpretation and visualization, and human-computer interaction through its nine
primary phases. With the introduction of KDD, the line between data mining and data
analytics became even obvious.

Part II

1. Discuss briefly the difference between Data Mining and Predictive Analytics.

Predictive analytics is the process of extracting data from large databases in order to
make predictions and projections about future events. Data mining is the process of
identifying important patterns and trends in large data collections, whereas predictive
analytics is the process of extracting data from large databases in order to make
predictions and projections about future events. On the other hand, predictive analytics
and data mining both use data models to generate predictions about upcoming events.
Although there are significant distinctions between the two, they are frequently used to
explain how data is handled.

2. Enumerate and explain the fallacies of Data Mining.

Fallacy 1. There are data mining tools that we can turn loose on our data repositories, and
find answers to our problems.

According to popular assumption, there are no automatic data mining solutions that
can answer your problems while you wait. Data mining, on the other hand, is a method.
CRISP-DM can be used to integrate data mining into a larger commercial or research
plan.

Fallacy 2. The data mining process is autonomous, requiring little or no human oversight.

The ability of the embedded control mechanism to make independent data mining
judgments. On the other hand, data mining isn't a magic formula. Without skilled human
supervision, using data mining technology blindly can only supply you with an inaccurate
answer to an incorrect query applied to the wrong sort of data.

Fallacy 3. Data mining pays for itself quite quickly.

According to the instructional material, return rates vary depending on start-up costs,
analytical manpower costs, and data warehouse preparation expenditures, among other
factors.

Fallacy 4. Data mining software packages are intuitive and easy to use.

It is necessary to understand the mathematical and statistical assumptions


underlying the program's models.

Fallacy 5. Data mining will identify the causes of our business or research problems.

While patterns can be discovered, human examination is required to determine the


software's usefulness.

Fallacy 6. Data mining will automatically clean up our messy database.

Because data is merely a collection of programs that you want the system to
recognize, it usually has to be preprocessed.

Fallacy 7. Data mining always provides positive results.

Data mining may be able to provide useful discoveries if preprocessing and an


understanding of underlying mathematical and statistical assumptions are used correctly
(with data application).
3. Is it important to form a model in predicting behaviors? Explain the good side of having
a predictive model.

Models have long played a crucial role in science, and they are still used to evaluate
hypotheses and forecast data today. Scientists are frequently inaccurate because they do
not have all of the information. Businesses can use predictive models to find, keep, and
expand their most profitable clients. It enhances the efficiency of the business. To forecast
inventory and manage resources, many businesses use predictive models. To establish
ticket prices, airlines utilize predictive analytics. It's very beneficial for demand forecasting,
labor planning, and customer attrition analysis, as well as competitor research in depth. It
anticipates external influences that may have an impact on your workflow. It's Fleet
maintenance time. Financial risks are highlighted, as are credit models.

4. Cite the possible examples or applications of Predictive Analytics on marketing mix:

4.1. Product
It boosts lead generation by providing the information needed to optimize
advertising campaigns and target the most profitable customers. More revenue and a
greater return on investment result from better leads. Marketing analytics provides insight
into customer behavior and preferences.

4.2. Price
The product's perceived value determines the pricing strategy. It's logical to
presume that the price was determined based on what the competitors was charging. It's
used to determine how pricing decisions affect the total business, examine the profitability
of various price points, and improve a company's pricing strategy to maximize revenue.

4.3. Place
Place encompasses many locations where products is made, viewed in ads,
distributed, and sold. They can better estimate where they can position their business with
the use of data.

4.4. Promotion
It boosts lead generation by providing the information needed to optimize
advertising campaigns and target the most profitable customers. More revenue and a
greater return on investment result from better leads. Marketing analytics provides insight
into customer behavior and preferences.

4.5. Physical Evidence


This tangible thing verifies the purchasing experience and gives the transaction a
sense of worth. It consists of precise data or records that can be used as reliable proof.

4.6. People
People aren't merely the people to whom you sell and advertise. Personnel,
salespeople, customer service representatives, and anybody else involved in the
marketing and sales operations are all included. For instance, it may play a role in
minimizing costly staff turnover. However, only a few businesses are capable of
developing human resource forecasting models.

4.7. Processes
The delivery of a product or service to a consumer is referred to as this P.
Function, activities, tasks, and processes must all be outlined in maps. The goal of
implementing predictive analytics is to keep processes running smoothly and efficiently.

5. Relate a company’s competitive data analytics culture and proper implementation of Big
Data strategy. Explain the possible outcome given this situation. Will this have a significant
impact to business performance?

Big data refers to data volumes that are so huge and complicated that typical data
processing technologies become ineffective. To handle such massive amounts of data,
new technology is required. Big Data, according to the internet, has recently piqued the
interest of academics and practitioners due to its potential to provide beneficial insights for
improved decision-making. Many firms are turning to Big Data Analytics to gain valuable
insights from massive amounts of data. AIG's chief science officer, the most difficult
aspect of transitioning from a knowing culture to a learning culture from a culture that
relies heavily on heuristics in decision making to a culture that is much more objective and
data driven, and embraces the power of data and technology is not the cost. At initially, it
is mostly determined by imagination and inertia. Data analysis not only improves
productivity, but it also helps uncover new business prospects that could otherwise go
unnoticed, such as underserved client categories. As a result, the potential for profit and
growth expands, and the focus shifts to intelligence. Big data can be utilized to minimize
operating costs and better manage resources. The data acquired can be utilized to
improve and change corporate processes, lowering costs and improving profits. In
addition, Big Data analysis makes the process of cutting waste and enhancing efficiency
much easier.

You might also like