Data Mining Pptxpresentation
Data Mining Pptxpresentation
BBA(IB)
23
Contents
Category 1
09/12/2022
• Data analysis
• Tools used in data mining
• Benefits of data mining
Category 3
2
What Is Data Mining
09/12/2022
• Data mining is the process of sorting through large data sets to identify
patterns and relationships that can help solve business problems through data
analysis. Data mining techniques and tools enable enterprises to predict
future trends and make more-informed business decisions.
• Data mining is a key part of data analytics overall and one of the core
disciplines in data science, which uses advanced analytics techniques to find
useful information in data sets. At a more granular level, data mining is a step
in the knowledge discovery in databases (KDD) process, a data science
methodology for gathering, processing and analyzing data. Data mining and
KDD are sometimes referred to interchangeably, but they're more commonly
seen as distinct things.
3
09/12/2022
4
09/12/2022
5
09/12/2022
Data Gathering
• Relevant data for an analytics application is
identified and assembled. The data may be
located in different source systems, a data
warehouse or a data lake, an increasingly
common repository in big data
environments that contain a mix of structured
and unstructured data. External data sources
may also be used. Wherever the data comes
from, a data scientist often moves it to a data
lake for the remaining steps in the process.
6
09/12/2022
Data Preparation
• This stage includes a set of
steps to get the data ready to
be mined. It starts with data
exploration, profiling and pre-
processing, followed by data
cleansing work to fix errors
and other data quality issues.
Data transformation is also
done to make data sets
consistent, unless a data
scientist is looking to analyze
unfiltered raw data for a
particular application. 7
Data Modeling
09/12/2022
THANK YOU
12