Data mining
Data mining
Mining Technologies:
1) Statistics:
Regression analysis, Clustering analysis standard deviation
lays the foundation of data mining.
2) Artificial Intelligence:
Apply the Human thought like processing.
3) Machine Learning:
Union of Statics and Ai.
About learning by the software of data
Data mining process:
The data mining process can be broken down into these four primary stages:
Data mining process:
Nothing’s perfect, including data mining. These are the major issues in data mining:
● Many data analytics tools are complex and challenging to use. Data scientists
need the right training to use the tools effectively.
● Speaking of the tools, different ones work with varying types of data mining,
depending on the algorithms they employ. Thus, data analysts must be sure to
choose the correct tools.
● Data mining techniques are not infallible, so there’s always the risk that the
information isn’t entirely accurate. This obstacle is especially relevant if there’s
a lack of diversity in the dataset.
● Companies can potentially sell the customer data they have gleaned to other
businesses and organizations, raising privacy concerns.
● Data mining requires large databases, making the process hard to manage.
● Distributed Data
● Complex Data
It takes a long time and money to process big amounts of complicated data. Data in the
real world is structured, unstructured,semi-structured, and heterogeneous forms,
including multimedia such as photos, music, video, natural language text etc
● Domain Knowledge
It is simpler to dig some information with domain expertise, without which collecting
useful information from data might be tough.
● Data Visualization
The first interaction that presents the result correctly to the client is data visualization.
The information is conveyed with unique relevance based on its intended use.
● Incomplete Data
Large data amounts might be imprecise or unreliable owing to measurement equipment
problems. Customers that refuse to disclose their personal information may result in
incomplete data, which may be updated owing to system failures, resulting in noisy
data, making the data mining procedure difficult.
● Higher Costs
The expenses linked with purchasing and maintaining strong servers, software, and
hardware for handling massive amounts of data might be too expensive.
● Performance Issues