0% found this document useful (0 votes)

5 views

Module 3

it is module 3 for the preparation of semester exam for everyone

Uploaded by

ashna8521

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

Module 3

it is module 3 for the preparation of semester exam for everyone

Uploaded by

ashna8521

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Module – 3

1. How Fowlkes Mallows index is it used to evaluate clustering

The Fowlkes-Mallows index is a measure used to evaluate the quality of clustering algorithms. It compares the
similarity between the clusters found by the algorithm to a ground truth clustering, if available. Here's how it works:

True Positives (TP): Count of pairs of data points that are in the same cluster in both the predicted clustering and the
ground truth clustering.

False Positives (FP): Count of pairs of data points that are in the same cluster in the predicted clustering but not in
the ground truth clustering.

False Negatives (FN): Count of pairs of data points that are in the same cluster in the ground truth clustering but not
in the predicted clustering.

The Fowlkes-Mallows index (FM index) is calculated as the geometric mean of precision (P) and recall (R):

Precision (P) measures the proportion of true positive pairs among all pairs that are in the same cluster in the
predicted clustering. It is a measure of how accurate the predicted clustering is.

Recall (R) measures the proportion of true positive pairs among all pairs that are in the same cluster in the ground
truth clustering. It is a measure of how well the predicted clustering captures the clusters in the ground truth.

2. What are the advantage and disadvantage of k-mean clustering algorithm?

Advantages of K-Means Clustering:

1. Simple and Easy to Implement: K-Means is straightforward and easy to understand, making it accessible for
beginners in clustering.

2. Efficient: It is computationally efficient and scalable to large datasets, making it suitable for datasets with many
data points.

3. Works Well with Balanced Cluster Sizes: K-Means performs well when clusters are relatively balanced in size and
have a spherical shape.

4. Iterative Refinement: The algorithm iteratively refines cluster centroids, converging to a local optimum, ensuring a
meaningful partitioning of the data.

5. Interpretability: The resulting clusters are easy to interpret and visualize, making it useful for exploratory data
analysis.

Disadvantages of K-Means Clustering:

1. Sensitive to Initial Centroid Selection: The algorithm's performance can be sensitive to the initial placement of
centroids, leading to different clusterings for different initializations.

2. Assumes Spherical Clusters: K-Means assumes that clusters are spherical and have similar sizes, which may not
hold true for all datasets. It may produce poor results for non-linear or irregularly shaped clusters.

3. Requires Predefined Number of Clusters: The user must specify the number of clusters (k) in advance, which can
be challenging when the optimal number of clusters is unknown.
4. Sensitive to Outliers: Outliers can significantly affect the cluster centroids' positions, leading to suboptimal cluster
assignments.

5. May Converge to Local Optima: K-Means converges to a local optimum, which may not be the global optimum. It
may produce different results with different initializations, affecting result consistency.

In summary, while K-Means is widely used for its simplicity and efficiency, it has limitations related to
cluster shape assumptions, sensitivity to initial conditions, and the requirement of a predefined number of clusters. It
is essential to understand these limitations and assess whether K-Means is suitable for a particular clustering task.

3. Give an example of an application for k-mean clustering algorithm. Explain in brief.

One example application of the K-Means clustering algorithm is customer segmentation in marketing. In this
scenario, a company aims to group customers based on similarities in their purchasing behavior, demographics, or
other relevant features. By identifying distinct customer segments, the company can tailor marketing strategies and
offerings to better meet the needs and preferences of each segment.

Application Example:

Scenario: A retail company wants to segment its customer base to personalize marketing campaigns and improve
customer engagement.

Data: The company collects data on customer transactions, including purchase history, frequency of purchases, total
spending, and demographic information such as age, gender, and location.

Process:

1. Data Preprocessing:

- Normalize or standardize the data to ensure all features have the same scale.

- Remove any outliers that could skew the clustering results.

2. Clustering with K-Means:

- Choose the number of clusters (k) based on domain knowledge or using techniques like the elbow method or
silhouette score.

- Apply the K-Means algorithm to partition the customers into k clusters based on their feature similarities.

- Iteratively update cluster centroids until convergence, minimizing the within-cluster sum of squared distances.

3. Interpretation:

- Analyze the characteristics of each cluster, such as average purchase amount, frequency of purchases, and
demographic composition.

- Assign meaningful labels to each cluster based on its distinctive traits (e.g., "High-Spending Customers,"
"Occasional Shoppers," "Young Urban Professionals").

4. Marketing Strategies:

- Tailor marketing campaigns and promotions to the specific needs and preferences of each customer segment.

- Develop targeted messaging and product offerings to maximize customer engagement and satisfaction.

5. Evaluation:

- Assess the effectiveness of the segmentation by measuring metrics like customer retention, conversion rates, and
revenue per segment.

- Refine the segmentation strategy based on feedback and performance metrics.

Outcome: By applying K-Means clustering to customer data, the company can gain valuable insights into its customer
base and create more personalized marketing strategies. This can lead to increased customer satisfaction, higher
retention rates, and improved overall business performance.

4. Describe various type of regression.

Here are descriptions of various types of regression in brief:

1. Linear Regression:

- Description: Linear regression models the relationship between a dependent variable and one or more independent
variables by fitting a linear equation to the observed data.

- Usage: It's widely used for predicting continuous outcomes and understanding the relationship between variables.

2. Polynomial Regression:

- Description: Polynomial regression extends linear regression by fitting a polynomial equation to the data. It can
capture nonlinear relationships between variables.

- Usage: Useful when the relationship between variables is curvilinear rather than linear.

3. Ridge Regression:

- Description: Ridge regression is a regularized version of linear regression that adds a penalty term (L2 norm) to the
loss function, preventing overfitting.

- Usage: It's beneficial when dealing with multicollinearity (high correlation among predictors) and helps in stabilizing
parameter estimates.

4. Lasso Regression:

- Description: Lasso regression is another regularized regression method that adds a penalty term (L1 norm) to the
loss function. It encourages sparsity in the coefficients, effectively performing feature selection.

- Usage: Useful for feature selection when dealing with high-dimensional data with many predictors.

5. Elastic Net Regression:

- Description: Elastic Net regression combines both L1 and L2 penalties in the loss function. It provides a balance
between Ridge and Lasso regression, incorporating their strengths.

- Usage: Beneficial when dealing with datasets with multicollinearity and a large number of predictors.

6. Logistic Regression:

- Description: Despite its name, logistic regression is a classification algorithm used for binary classification tasks. It
models the probability of the binary outcome as a function of the independent variables using the logistic function.

- Usage: Widely used in various fields for binary classification tasks, such as predicting whether an email is spam or
not.

7. Poisson Regression:

- Description: Poisson regression models count data (integer-valued outcomes) by assuming that the dependent
variable follows a Poisson distribution.

- Usage: Suitable for analyzing count data, such as the number of occurrences of events in a fixed period.

Each type of regression has its unique characteristics and is suitable for different types of data and
modeling tasks. Choosing the appropriate regression technique depends on the nature of the data, the relationship
between variables, and the specific goals of the analysis.
5. What is mutual information, and how is it used to evaluate clustering algorithms?
Mutual information is a measure of the amount of information that one random variable (e.g., a clustering result)
contains about another random variable (e.g., ground truth labels). It quantifies the degree of dependency between
the variables. In the context of clustering evaluation, mutual information is used to assess the similarity between a
clustering result and a ground truth partitioning (if available).

How Mutual Information Works:

1. Entropy: Entropy measures the uncertainty or randomness in a random variable. Higher entropy indicates more
uncertainty.

2. Conditional Entropy: Conditional entropy measures the remaining uncertainty in one random variable given the
knowledge of another random variable.

3. Mutual Information: Mutual information quantifies the reduction in uncertainty of one random variable when the
other random variable is known. It's calculated as the difference between the entropy of the first variable and the
conditional entropy of the first variable given the second variable.

Using Mutual Information to Evaluate Clustering Algorithms:

- Ground Truth Comparison: In clustering evaluation, if a ground truth partitioning of the data is available, mutual
information can be used to compare the clustering result obtained from an algorithm to the ground truth.

- Quantifying Agreement: A higher mutual information value indicates greater agreement between the clustering
result and the ground truth partitioning. It measures how much information about the ground truth labels is
captured by the clustering.

- Range of Values: Mutual information ranges from 0 to infinity. A value of 0 indicates no agreement between the
clustering and the ground truth, while higher values indicate better agreement.

- Interpretation: A high mutual information score suggests that the clustering algorithm has successfully captured the
underlying structure of the data as represented by the ground truth labels.

In summary, mutual information is a useful metric for evaluating clustering algorithms, providing
a quantitative measure of the similarity between a clustering result and a ground truth partitioning, if available. It
helps assess the quality and accuracy of the clustering outcome.

6. What is the purpose of clustering evaluation?

Clustering evaluation serves the purpose of assessing the quality and effectiveness of clustering algorithms by
providing insights into how well they group data points. Here's a brief explanation of its purpose:

Purpose of Clustering Evaluation:

1. Assessing Algorithm Performance: Clustering evaluation helps determine how well a clustering algorithm
performs in partitioning the data into meaningful groups or clusters.

2. Comparing Algorithms: It facilitates the comparison of different clustering algorithms to identify the most suitable
one for a particular dataset or problem.

3. Validating Results: Clustering evaluation provides a means to validate the clustering results and ensure they align
with the underlying structure or patterns in the data.

4. Parameter Tuning: It aids in the selection of appropriate parameters for clustering algorithms, such as the number
of clusters (k), distance metrics, or linkage criteria.

5. Interpreting Results: Evaluation metrics help interpret the quality of the clustering outcome and provide insights
into the characteristics of the resulting clusters.
6. Informing Decision-Making: Clustering evaluation assists in making informed decisions about the usefulness and
reliability of the clustering results for downstream tasks or applications.

7. Improving Algorithms: By identifying weaknesses or limitations of clustering algorithms, evaluation metrics

contribute to the improvement and refinement of clustering techniques.

In summary, clustering evaluation serves the crucial purpose of assessing the performance, validity,
and reliability of clustering algorithms, enabling informed decision-making and continuous improvement in the field
of unsupervised learning.

( Long Answer Type )

I to 3 there are numerical

1. What are the features of KNN algorithm? What are the advantages and disadvantages
of the KNN algorithm .
Features of K-Nearest Neighbors (KNN) Algorithm:

1. Instance-Based Learning: KNN is an instance-based learning algorithm that does not involve explicit model training.
Instead, it stores all available training data points and makes predictions based on their similarity to the new data
point.

2. Non-Parametric Algorithm: KNN makes no assumptions about the underlying data distribution, making it suitable
for both linear and nonlinear relationships.

3. Simple Implementation: KNN is easy to understand and implement, making it accessible for beginners in machine
learning.

4. Flexibility in Choosing K: The choice of the number of nearest neighbors (K) allows for flexibility in balancing bias
and variance in the model.

5. Versatile: KNN can be used for both classification and regression tasks, making it applicable to a wide range of
problems.

Advantages of KNN Algorithm:

1. No Training Phase: KNN does not require a training phase, which reduces computational overhead and makes it
efficient for online learning.

2. Non-Parametric: Its non-parametric nature allows it to handle complex data patterns without making strong
assumptions about the data distribution.

3. Interpretability: KNN's predictions are easy to interpret, as they are based on the majority vote (for classification)
or the average (for regression) of the nearest neighbors.

4. Adaptability to Local Structure: KNN adapts well to local data structures, making it robust to noisy data and
suitable for datasets with irregular boundaries.

5. Effective with Small Datasets: KNN performs well with small datasets, where other algorithms may suffer from
overfitting due to limited data.

Disadvantages of KNN Algorithm:

1. Computational Complexity: KNN requires computing distances between the new data point and all training data
points, which can be computationally expensive for large datasets.
2. Sensitivity to Distance Metric: The choice of distance metric significantly affects the performance of KNN, and
selecting an appropriate metric can be challenging.

3. Imbalanced Data: KNN tends to favor majority classes in imbalanced datasets, leading to biased predictions.

4. Need for Proper Scaling: KNN is sensitive to the scale of features, so it's essential to scale the features
appropriately before applying the algorithm.

5. Memory Consumption: Storing all training data points in memory can be memory-intensive, especially for large
datasets with many dimensions.

In summary, while KNN offers simplicity, flexibility, and interpretability, its effectiveness depends on
careful consideration of its limitations, such as computational complexity, sensitivity to distance metric, and handling
of imbalanced data.

Enrique Dóal Pérez Frías - Predictive Methods For Football and Betting Markets (2023)
100% (1)
Enrique Dóal Pérez Frías - Predictive Methods For Football and Betting Markets (2023)
485 pages
Big Data Analytics
No ratings yet
Big Data Analytics
25 pages
LP I Assignment A4 Clustering
No ratings yet
LP I Assignment A4 Clustering
13 pages
Python Machine Learning
No ratings yet
Python Machine Learning
19 pages
Objectives of Clustering
No ratings yet
Objectives of Clustering
3 pages
SML Hand Note Bau by DT
No ratings yet
SML Hand Note Bau by DT
1 page
M.L. 3,5,6 Unit 3
No ratings yet
M.L. 3,5,6 Unit 3
6 pages
ML - Machine Learning PDF
No ratings yet
ML - Machine Learning PDF
13 pages
AI Algoritm Course
No ratings yet
AI Algoritm Course
19 pages
Ml Assignment 4
No ratings yet
Ml Assignment 4
6 pages
DWM PT 2 QB Soln
No ratings yet
DWM PT 2 QB Soln
8 pages
BDA LabReport-9
No ratings yet
BDA LabReport-9
17 pages
It 311-Ads Module 5
No ratings yet
It 311-Ads Module 5
9 pages
BIL Report
No ratings yet
BIL Report
24 pages
Customer Categorization by Data Analysis Using Clustering Algorithms of Machine Learning
No ratings yet
Customer Categorization by Data Analysis Using Clustering Algorithms of Machine Learning
4 pages
K Means Clustering
No ratings yet
K Means Clustering
6 pages
6 - Romanko - Data - Science - and - Business - Analytics - Data - Mining
No ratings yet
6 - Romanko - Data - Science - and - Business - Analytics - Data - Mining
51 pages
Data Mining Answer Key
No ratings yet
Data Mining Answer Key
10 pages
Zara
No ratings yet
Zara
47 pages
Unit-4 (2)
No ratings yet
Unit-4 (2)
29 pages
Data Mining Assignment-Clustering Data-Ads 24x7 Summary
No ratings yet
Data Mining Assignment-Clustering Data-Ads 24x7 Summary
12 pages
Peer Eval
No ratings yet
Peer Eval
6 pages
Overview of Clustering:: UNIT-5
No ratings yet
Overview of Clustering:: UNIT-5
27 pages
FAM_QUESTION_BANK_CT[1]
No ratings yet
FAM_QUESTION_BANK_CT[1]
14 pages
Importance of Clustering
No ratings yet
Importance of Clustering
5 pages
DA_EXP_10_66
No ratings yet
DA_EXP_10_66
6 pages
S-6
No ratings yet
S-6
5 pages
TQM - TRG - F-07 - Cluster Analysis - Rev02 - 20180421
No ratings yet
TQM - TRG - F-07 - Cluster Analysis - Rev02 - 20180421
42 pages
Variance Rover System
No ratings yet
Variance Rover System
3 pages
Słowacja Wszystko PDF
No ratings yet
Słowacja Wszystko PDF
379 pages
Data Mining Project Shivani Pandey
100% (1)
Data Mining Project Shivani Pandey
40 pages
data analytics-1
No ratings yet
data analytics-1
21 pages
Unit 4
No ratings yet
Unit 4
5 pages
APznzab0G8iLD5cDfn798Gn-fXshRpam8ullbf6ZS5Hd4l0BEcKNHy9gDG24DS66RfgvnKXAQjMAivMmmi5cmDWF9tqOaPMy3afuzafCU1kpG1xfQIr7b98q406ZWiqt50nL8WhMI6azoYzWSgf7c7khnqww3VlQ9I90ROmc0QL4DbmipYYoLleGYR6TO4UYmc_PsaQB5v0XmLUwPEub3QuwGdUnUEr2dp_hV4bds0MuRbpJ
No ratings yet
APznzab0G8iLD5cDfn798Gn-fXshRpam8ullbf6ZS5Hd4l0BEcKNHy9gDG24DS66RfgvnKXAQjMAivMmmi5cmDWF9tqOaPMy3afuzafCU1kpG1xfQIr7b98q406ZWiqt50nL8WhMI6azoYzWSgf7c7khnqww3VlQ9I90ROmc0QL4DbmipYYoLleGYR6TO4UYmc_PsaQB5v0XmLUwPEub3QuwGdUnUEr2dp_hV4bds0MuRbpJ
34 pages
Big Data Analytics Algorithm, Tools in Systematic Review
No ratings yet
Big Data Analytics Algorithm, Tools in Systematic Review
7 pages
K means algorithm
No ratings yet
K means algorithm
4 pages
Clustering
No ratings yet
Clustering
6 pages
11 Chapter 3
No ratings yet
11 Chapter 3
17 pages
Chapter 2
No ratings yet
Chapter 2
17 pages
BDA Unit 2
No ratings yet
BDA Unit 2
31 pages
Data Mining - Clustering
No ratings yet
Data Mining - Clustering
90 pages
Data Mining Graded Assignment: Problem 1: Clustering Analysis
100% (3)
Data Mining Graded Assignment: Problem 1: Clustering Analysis
39 pages
UNIT - 4 DWDM
No ratings yet
UNIT - 4 DWDM
27 pages
Interview questions companie
No ratings yet
Interview questions companie
72 pages
ML_Lec-16
No ratings yet
ML_Lec-16
16 pages
BA Assignment
No ratings yet
BA Assignment
10 pages
Cluster Analysis and Data Mining
100% (1)
Cluster Analysis and Data Mining
333 pages
1.1 Project Overview: Data Mining
No ratings yet
1.1 Project Overview: Data Mining
74 pages
Clusteranalysisanddatamining PDF
100% (1)
Clusteranalysisanddatamining PDF
333 pages
K Mean Notes
No ratings yet
K Mean Notes
5 pages
clustering
No ratings yet
clustering
6 pages
Techniques of Cluster Analysis: A Seminar On
No ratings yet
Techniques of Cluster Analysis: A Seminar On
25 pages
K-Means Clustering
No ratings yet
K-Means Clustering
6 pages
206 Data Mining
No ratings yet
206 Data Mining
28 pages
Data Mining Project Ashwani 3 PDF
100% (1)
Data Mining Project Ashwani 3 PDF
20 pages
Machine learning
No ratings yet
Machine learning
4 pages
Cluster-Analysis
No ratings yet
Cluster-Analysis
89 pages
APznzaaxpWzYylHJmwXGn2puBz7GP1usZYf9XTi7oqfrrKnFV9DMMfVzPCu6yO0UOnr_XFt1gJv4TE1ITR6850n9k65DydQUgoRlylNdn2acWAu6KNonoO8z7QULN6BlLxY_B-JhKko0tJ3K77woLz26oTaAv1YNcIuMcOSqInmgeCUzpUxjKC9VqnT_lhE7vDyWp_LQQjGTRnamgIC6ya3nlwi7mjjE9EUIiO2sUhjkD6RV
No ratings yet
APznzaaxpWzYylHJmwXGn2puBz7GP1usZYf9XTi7oqfrrKnFV9DMMfVzPCu6yO0UOnr_XFt1gJv4TE1ITR6850n9k65DydQUgoRlylNdn2acWAu6KNonoO8z7QULN6BlLxY_B-JhKko0tJ3K77woLz26oTaAv1YNcIuMcOSqInmgeCUzpUxjKC9VqnT_lhE7vDyWp_LQQjGTRnamgIC6ya3nlwi7mjjE9EUIiO2sUhjkD6RV
38 pages
Data Mining Business Report Set
No ratings yet
Data Mining Business Report Set
12 pages
Dmbi Ia2 Ans
No ratings yet
Dmbi Ia2 Ans
17 pages
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
Repurchase Intention of Brand Automobile - TPB
No ratings yet
Repurchase Intention of Brand Automobile - TPB
11 pages
Multiple Imputation in Practice
No ratings yet
Multiple Imputation in Practice
11 pages
1.3 1 Perception of Tax Evasion Among Tax Payers of Robe Town Administration Ethiopia
No ratings yet
1.3 1 Perception of Tax Evasion Among Tax Payers of Robe Town Administration Ethiopia
13 pages
Logistic Regression
No ratings yet
Logistic Regression
19 pages
Predictive Modeling Business Report Seetharaman Final Changes PDF
100% (1)
Predictive Modeling Business Report Seetharaman Final Changes PDF
28 pages
Curriculum GenAI Pinnacle Program
No ratings yet
Curriculum GenAI Pinnacle Program
54 pages
Real Statistics Examples Regression 2
No ratings yet
Real Statistics Examples Regression 2
377 pages
Data Mining unit-1 complete
No ratings yet
Data Mining unit-1 complete
45 pages
What Statistical Analysis Should I Use
No ratings yet
What Statistical Analysis Should I Use
2 pages
Ai PPT
No ratings yet
Ai PPT
24 pages
Motherhood and Marriagej.1741-3737.2010.00786.x
No ratings yet
Motherhood and Marriagej.1741-3737.2010.00786.x
4 pages
Algorithmic Trading Methods:: Applications Using Advanced Statistics, Optimization, and Machine Learning Techniques 2nd
100% (7)
Algorithmic Trading Methods:: Applications Using Advanced Statistics, Optimization, and Machine Learning Techniques 2nd
53 pages
Regresi Logistik - Bahan
No ratings yet
Regresi Logistik - Bahan
89 pages
COLORECTAL CANCER
No ratings yet
COLORECTAL CANCER
12 pages
Instant ebooks textbook Medical Statistics from Scratch An Introduction for Health Professionals 2nd Edition David Bowers download all chapters
100% (12)
Instant ebooks textbook Medical Statistics from Scratch An Introduction for Health Professionals 2nd Edition David Bowers download all chapters
70 pages
Spatial Logistic Regression and GIS To Model Rural-Urban Land Conversion
No ratings yet
Spatial Logistic Regression and GIS To Model Rural-Urban Land Conversion
21 pages
Hydrocephalus
No ratings yet
Hydrocephalus
8 pages
Sesi 6.2 Family Planning
No ratings yet
Sesi 6.2 Family Planning
9 pages
Mastering Predictive Analytics with R 2nd edition Edition Forte All Chapters Instant Download
100% (3)
Mastering Predictive Analytics with R 2nd edition Edition Forte All Chapters Instant Download
81 pages
Mba ZG536 Ec-3r First Sem 2023-2024
No ratings yet
Mba ZG536 Ec-3r First Sem 2023-2024
6 pages
1 s2.0 S0740624X13000804 Main
No ratings yet
1 s2.0 S0740624X13000804 Main
9 pages
Active Management of Third Stage of Labour Practic
No ratings yet
Active Management of Third Stage of Labour Practic
6 pages
Investigators Id No: Kuri Mohammedsafi 1204536 Abdulehi Seid 1104224 Senayt Mosewa 1201281 Hana Tefera 1204151
No ratings yet
Investigators Id No: Kuri Mohammedsafi 1204536 Abdulehi Seid 1104224 Senayt Mosewa 1201281 Hana Tefera 1204151
64 pages
新增 Microsoft Word Document
No ratings yet
新增 Microsoft Word Document
10 pages
Adaptive Fault Detection Scheme Using An Optimized Self-Healing Ensemble Machine Learning Algorithm
100% (1)
Adaptive Fault Detection Scheme Using An Optimized Self-Healing Ensemble Machine Learning Algorithm
12 pages
Ordered Logit Models - Basic & Intermediate Topics
No ratings yet
Ordered Logit Models - Basic & Intermediate Topics
16 pages
SECA4002
No ratings yet
SECA4002
65 pages
06 Logistic Regression PDF
No ratings yet
06 Logistic Regression PDF
10 pages
Doc-20240330-Wa0001 240330 194806
No ratings yet
Doc-20240330-Wa0001 240330 194806
7 pages

Module 3

Uploaded by

Module 3

Uploaded by

Module – 3

1. How Fowlkes Mallows index is it used to evaluate clustering

2. What are the advantage and disadvantage of k-mean clustering algorithm?

Disadvantages of K-Means Clustering:

3. Give an example of an application for k-mean clustering algorithm. Explain in brief.

- Remove any outliers that could skew the clustering results.

2. Clustering with K-Means:

- Refine the segmentation strategy based on feedback and performance metrics.

4. Describe various type of regression.

5. Elastic Net Regression:

How Mutual Information Works:

Using Mutual Information to Evaluate Clustering Algorithms:

6. What is the purpose of clustering evaluation?

Purpose of Clustering Evaluation:

7. Improving Algorithms: By identifying weaknesses or limitations of clustering algorithms, evaluation metrics

( Long Answer Type )

I to 3 there are numerical

Advantages of KNN Algorithm:

Disadvantages of KNN Algorithm:

You might also like