CPP3
CPP3
of
Computer Sc. & Engg.
Data mining is the process of extracting knowledge or insights from large amounts of data using various
statistical and computational techniques. The data can be structured, semi-structured or unstructured,
and can be stored in various forms such as databases, data warehouses, and data lakes.
The primary goal of data mining is to discover hidden patterns and relationships in the data that can be
used to make informed decisions or predictions. This involves exploring the data using various
techniques such as clustering, classification, regression analysis, association rule mining, and anomaly
detection.
Data mining has a wide range of applications across various industries, including marketing, finance,
healthcare, and telecommunications. For example, in marketing, data mining can be used to identify
customer segments and target marketing campaigns, while in healthcare, it can be used to identify risk
factors for diseases and develop personalized treatment plans.
A data warehouse is a system that aggregates data from multiple sources into a single, central and
consistent data store. Data warehouses help prepare data for data analytics, business intelligence (BI),
data mining, machine learning (ML) and artificial intelligence (AI) initiatives.