0% found this document useful (0 votes)
99 views4 pages

Data Mining Assignment

This data mining assignment submission summarizes key concepts in data mining. It defines data mining as discovering patterns from large datasets to extract useful insights. There are two main types of data: structured data organized in tables and unstructured data like text. Attributes in data mining can be nominal without order, ordinal with order but inconsistent intervals, interval with consistent intervals but no true zero, or ratio with consistent intervals and a true zero point. Data mining functions include pattern discovery, knowledge extraction, predictive modeling, and more.

Uploaded by

Ali Hassan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
99 views4 pages

Data Mining Assignment

This data mining assignment submission summarizes key concepts in data mining. It defines data mining as discovering patterns from large datasets to extract useful insights. There are two main types of data: structured data organized in tables and unstructured data like text. Attributes in data mining can be nominal without order, ordinal with order but inconsistent intervals, interval with consistent intervals but no true zero, or ratio with consistent intervals and a true zero point. Data mining functions include pattern discovery, knowledge extraction, predictive modeling, and more.

Uploaded by

Ali Hassan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 4

DATA MINING ASSIGNMENT

SUBMITTED TO: MR. ZAFAR IQBAL


SUBMITTED BY: HASNAIN JAVED (BSF2006253)

Q: what is data mining?


Data mining is the process of discovering patterns, trends, correlations, or valuable information
from large sets of data. It involves using various techniques and algorithms to analyze and
interpret data, with the goal of extracting useful and meaningful insights. Data mining is often
associated with the field of knowledge discovery in databases (KDD).

Q: why we need data mining?


We need data mining because Data mining is crucial for:
Knowledge Discovery: Identifying hidden patterns and relationships in large datasets.
Decision Making: Providing insights for informed and strategic decisions.
Predictive Modeling: Forecasting trends and behaviors.
Pattern Recognition: Understanding customer behavior and market trends.
Fraud Detection and Security: Identifying anomalies for fraud prevention.
Customer Relationship Management (CRM): Enhancing customer satisfaction and loyalty.

Q: what is data? How many types of data is used


in data mining?
Data refers to raw facts, figures, or information, often in the form of numbers, text, or
multimedia. In the context of data mining, data is the foundation upon which analysis and
discovery are performed. Data can be collected from various sources and can include structured
data (organized in tables or databases) and unstructured data (not organized in a predefined
manner, such as text documents or images).
In data mining, there are two main types of data:
Structured Data: This type of data is highly organized and typically resides in relational
databases or spreadsheets. It is represented in tables with rows and columns, and each attribute
has a predefined data type. Structured data is suitable for traditional data mining techniques like
classification, regression, and clustering.

Unstructured Data: This type of data lacks a predefined data model or structure. It includes
text documents, images, videos, social media posts, and other content that doesn't fit neatly into
rows and columns. Unstructured data requires more advanced techniques, such as natural
language processing and image analysis, to extract meaningful information. Text mining, image
mining, and multimedia mining are examples of data mining approaches applied to unstructured
data.

Q: Define the types of attributes with suitable


example?
In data mining, attributes are the properties or characteristics of the data that are being analyzed.
There are different types of attributes based on their nature and the kind of values they can take.
Here are some common types:
Nominal Attributes: Nominal attributes are categorical variables with no inherent order or
ranking among categories.
Example: Colors (e.g., red, blue, green) or types of fruits (e.g., apple, banana, orange).
Ordinal Attributes: Ordinal attributes have categories with a meaningful order or rank, but the
intervals between them are not consistent.
Example: Education levels (e.g., high school, college, graduate) or customer satisfaction ratings
(e.g., low, medium, high).
Interval Attributes Interval attributes have a consistent interval between values, but there is no
true zero point.
Example: Temperature measured in Celsius or Fahrenheit (e.g., 20°C, 30°C) where zero does
not represent an absence of temperature.
Ratio Attributes: Ratio attributes have a consistent interval between values, and they have a
true zero point, indicating the absence of the attribute.
Example: Height, weight, income, where zero is meaningful (e.g., 0 meters, 0 kilograms, 0
dollars).
Binary Attributes: Binary attributes have only two possible values, often denoted as 0 and 1.
Example: Yes/No, True/False, or 1/0 for variables like gender (Male/Female) or whether a
customer made a purchase (Yes/No).
Discrete Attributes: Discrete attributes can take on a finite or countable number of distinct
values.
Example: Number of children in a family (1, 2, 3, ...) or the count of items in a shopping cart.
Continuous Attributes: Continuous attributes can take any real value within a given range.
Example: Temperature measured in degrees, height measured in centimeters, or income
measured in dollars and cents.

Q: Describe the functions of data mining?


The functions of data mining can be as follows:

Pattern Discovery: Uncovering hidden patterns, trends, and relationships within large datasets.
Knowledge Extraction: Extracting valuable insights and information from data to support
decision-making.
Predictive Modeling: Building models to predict future trends or behaviors based on historical
data.
Anomaly Detection: Identifying unusual patterns or outliers that may indicate fraud or
irregularities.
Classification: Categorizing data into predefined classes or groups based on its attributes.
Clustering: Grouping similar data points together to discover natural structures within the data.
Association Rule Mining: Finding associations or relationships between variables in large
datasets.

Q: Diff b/w nominal and ordinal value with


examples?
Nominal Values:
Definition: Nominal values are categorical variables with no inherent order or ranking among
categories.
Example: Colors (e.g., red, blue, green) or types of fruits (e.g., apple, banana, orange).
Characteristics: Categories are distinct, and there is no meaningful order or hierarchy. Each
category is treated as equal.

Ordinal Values:
Definition: Ordinal values have categories with a meaningful order or rank, but the intervals
between them are not consistent.
Example: Education levels (e.g., high school, college, graduate) or customer satisfaction ratings
(e.g., low, medium, high).
Characteristics: Categories have a relative ranking, indicating a specific order or hierarchy.
However, the differences between the ranks are not necessarily uniform or precisely measurable.

Q: Diff b/w interval and ration-scale value with


example?
Interval Values:
Definition: Interval values have a consistent interval between values, but there is no true zero
point.
Example: Temperature measured in Celsius or Fahrenheit (e.g., 20°C, 30°C) where zero does
not represent an absence of temperature.
Characteristics: Intervals between values are consistent and meaningful, but the absence of the
attribute (zero point) is arbitrary.

Ratio-Scale Values:
Definition: Ratio-scale values also have a consistent interval between values, but they possess a
true zero point, indicating the absence of the attribute.
Example: Height, weight, income, where zero is meaningful (e.g., 0 meters, 0 kilograms, 0
dollars).
Characteristics: Similar to interval values, ratio-scale values have consistent and meaningful
intervals, but the presence of a true zero point allows for ratios to be formed.

You might also like