Unit 2 ML

The document covers various machine learning techniques, focusing on clustering methods such as partitioning, hierarchical, and fuzzy clustering, along with algorithms like BIRCH and Gaussian Mixture Models. It also discusses classification algorithms, ensemble learning, dimensionality reduction, and parameter estimation methods like MLE and MAP. Additionally, it highlights applications of clustering in areas like customer segmentation, image segmentation, and anomaly detection.

Uploaded by

sauravpatel220930

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views11 pages

Unit 2 ML

Uploaded by

sauravpatel220930

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 11

UNIT-II

Clustering in Machine Learning: Types of Clustering Method: Partitioning Clustering, Distribution Model-Based
Clustering, Hierarchical Clustering, Fuzzy Clustering. Birch Algorithm, CURE Algorithm. Gaussian Mixture Models and
Expectation Maximization. Parameters estimations – MLE, MAP. Applications of Clustering.

UNIT-III
Classification algorithm: - Logistic Regression, Decision Tree Classification, Neural Network, K-Nearest Neighbors (K-
NN), Support Vector Machine, Naive Bayes (Gaussian, Multinomial, Bernoulli). Performance Measures: Confusion
Matrix, Classification Accuracy, Classification Report: Precisions, Recall, F1 score and Support.

UNIT-IV
Ensemble Learning and Random Forest: Introduction to Ensemble Learning, Basic Ensemble Techniques (Max Voting,
Averaging, Weighted Average), Voting Classifiers, Bagging and Pasting, Out-of-Bag Evaluation, Random Patches and
Random Subspaces, Random Forests (Extra-Trees, Feature Importance), Boosting (AdaBoost, Gradient Boosting),
Stacking.

UNIT-V
Dimensionality Reduction: The Curse of Dimensionality, Main Approaches for Dimensionality Reduction (Projection,
Manifold Learning) PCA: Preserving the Variance, Principal Components, Projecting Down to d Dimensions, Explained
Variance Ratio, Choosing the Right Number of Dimensions, PCA for Compression, Randomized PCA, Incremental PCA.
Kernel PCA: Selecting a Kernel and Tuning Hyper parameters. Learning Theory: PAC and VC model.
# Clustering in Machine Learning
Clustering or cluster analysis is a machine learning technique, which groups the unlabelled dataset. It can be defined
as "A way of grouping the data points into different clusters, consisting of similar data points. The objects with the
possible similarities remain in a group that has less or no similarities with another group."
It does it by finding some similar patterns in the unlabelled dataset such as shape, size, color, behavior, etc., and
divides them as per the presence and absence of those similar patterns.
It is an unsupervised learning method, hence no supervision is provided to the algorithm, and it deals with the
unlabeled dataset.
After applying this clustering technique, each cluster or group is provided with a cluster-ID. ML system can use this id
to simplify the processing of large and complex datasets.
The clustering technique is commonly used for statistical data analysis.
Note: Clustering is somewhere similar to the classification algorithm, but the difference is the type of dataset that we
are using. In classification, we work with the labeled data set, whereas in clustering, we work with the unlabelled
dataset.
Example: Let's understand the clustering technique with the real-world example of Mall: When we visit any shopping
mall, we can observe that the things with similar usage are grouped together. Such as the t-shirts are grouped in one
section, and trousers are at other sections, similarly, at vegetable sections, apples, bananas, Mangoes, etc., are
grouped in separate sections, so that we can easily find out the things. The clustering technique also works in the
same way. Other examples of clustering are grouping documents according to the topic.
The clustering technique can be widely used in various tasks. Some most common uses of this technique are:
o Market Segmentation
o Statistical data analysis
o Social network analysis
o Image segmentation
o Anomaly detection, etc.
Apart from these general usages, it is used by the Amazon in its recommendation system to provide the
recommendations as per the past search of products. Netflix also uses this technique to recommend the movies and
web-series to its users as per the watch history.
The below diagram explains the working of the clustering algorithm. We can see the different fruits are divided into
several groups with similar properties.

Types of Clustering Methods

The clustering methods are broadly divided into Hard clustering (datapoint belongs to only one group) and Soft
Clustering (data points can belong to another group also). But there are also other various approaches of Clustering
exist. Below are the main clustering methods used in Machine learning:
1. Partitioning Clustering
2. Density-Based Clustering
3. Distribution Model-Based Clustering
4. Hierarchical Clustering
5. Fuzzy Clustering
Partitioning Clustering
It is a type of clustering that divides the data into non-hierarchical groups. It is also known as the centroid-based
method. The most common example of partitioning clustering is the K-Means Clustering algorithm.
In this type, the dataset is divided into a set of k groups, where K is used to define the number of pre-defined groups.
The cluster center is created in such a way that the distance between the data points of one cluster is minimum as
compared to another cluster centroid.

Advantages:
 Efficient for large datasets.
 Easy to understand and implement.
 Works well when clusters are globular and evenly sized.
Disadvantages:
 The number of clusters (K) needs to be predefined.
 Sensitive to initial centroids (can lead to poor local minima).
 Assumes clusters are spherical and equally sized, which might not be the case in all datasets.

Density-Based Clustering
The density-based clustering method connects the highly-dense areas into clusters, and the arbitrarily shaped
distributions are formed as long as the dense region can be connected. This algorithm does it by identifying different
clusters in the dataset and connects the areas of high densities into clusters. The dense areas in data space are
divided from each other by sparser areas.
These algorithms can face difficulty in clustering the data points if the dataset has varying densities and high
dimensions.

Distribution Model-Based Clustering

In the distribution model-based clustering method, the data is divided based on the probability of how a dataset
belongs to a particular distribution. The grouping is done by assuming some distributions commonly Gaussian
Distribution.
The example of this type is the Expectation-Maximization Clustering algorithm that uses Gaussian Mixture Models
(GMM).
Advantages:
 Can model clusters of arbitrary shape and size (unlike K-Means).
 Provides soft assignment (probabilities), which can be useful for uncertain or overlapping data.
Disadvantages:
 Assumes data follows a specific distribution (e.g., Gaussian).
 Can be computationally expensive.
 Sensitive to initialization and may converge to local maxima.
Hierarchical Clustering
Hierarchical clustering can be used as an alternative for the partitioned clustering as there is no requirement of pre-
specifying the number of clusters to be created. In this technique, the dataset is divided into clusters to create a tree-
like structure, which is also called a dendrogram. The observations or any number of clusters can be selected by
cutting the tree at the correct level. The most common example of this method is the Agglomerative Hierarchical
algorithm.

Advantages:
 Does not require the number of clusters to be specified in advance.
 Provides a detailed view of data with a dendrogram that shows the merging/splitting process.
 Can handle clusters of various shapes and sizes.
Disadvantages:
 Computationally expensive, especially for large datasets.
 Sensitive to noise and outliers.
 May struggle with large-scale datasets as it involves repeated distance calculations.

Fuzzy Clustering
Fuzzy clustering is a type of soft method in which a data object may belong to more than one group or cluster. Each
dataset has a set of membership coefficients, which depend on the degree of membership to be in a cluster. Fuzzy C-
means algorithm is the example of this type of clustering; it is sometimes also known as the Fuzzy k-means
algorithm.
Advantages:
 Handles overlapping clusters and soft assignments.
 Provides a more nuanced view of data by assigning data points to multiple clusters.
Disadvantages:
 Computationally expensive, especially for large datasets.
 Requires careful selection of the fuzziness parameter (degree of membership).
 Can struggle with non-convex clusters.
Clustering Algorithms
The Clustering algorithms can be divided based on their models that are explained above. There are different types
of clustering algorithms published, but only a few are commonly used. The clustering algorithm is based on the kind
of data that we are using. Such as, some algorithms need to guess the number of clusters in the given dataset,
whereas some are required to find the minimum distance between the observation of the dataset.
Here we are discussing mainly popular Clustering algorithms that are widely used in machine learning:
1. K-Means algorithm: The k-means algorithm is one of the most popular clustering algorithms. It classifies the
dataset by dividing the samples into different clusters of equal variances. The number of clusters must be
specified in this algorithm. It is fast with fewer computations required, with the linear complexity of O(n).
2. Mean-shift algorithm: Mean-shift algorithm tries to find the dense areas in the smooth density of data
points. It is an example of a centroid-based model, that works on updating the candidates for centroid to be
the center of the points within a given region.
3. DBSCAN Algorithm: It stands for Density-Based Spatial Clustering of Applications with Noise. It is an
example of a density-based model similar to the mean-shift, but with some remarkable advantages. In this
algorithm, the areas of high density are separated by the areas of low density. Because of this, the clusters
can be found in any arbitrary shape.
4. Expectation-Maximization Clustering using GMM: This algorithm can be used as an alternative for the k-
means algorithm or for those cases where K-means can be failed. In GMM, it is assumed that the data points
are Gaussian distributed.
5. Agglomerative Hierarchical algorithm: The Agglomerative hierarchical algorithm performs the bottom-up
hierarchical clustering. In this, each data point is treated as a single cluster at the outset and then
successively merged. The cluster hierarchy can be represented as a tree-structure.
6. Affinity Propagation: It is different from other clustering algorithms as it does not require to specify the
number of clusters. In this, each data point sends a message between the pair of data points until
convergence. It has O(N2T) time complexity, which is the main drawback of this algorithm.
Applications of Clustering
Below are some commonly known applications of clustering technique in Machine Learning:
o In Identification of Cancer Cells: The clustering algorithms are widely used for the identification of cancerous
cells. It divides the cancerous and non-cancerous data sets into different groups.
o In Search Engines: Search engines also work on the clustering technique. The search result appears based on
the closest object to the search query. It does it by grouping similar data objects in one group that is far from
the other dissimilar objects. The accurate result of a query depends on the quality of the clustering algorithm
used.
o Customer Segmentation: It is used in market research to segment the customers based on their choice and
preferences.
o In Biology: It is used in the biology stream to classify different species of plants and animals using the image
recognition technique.
o In Land Use: The clustering technique is used in identifying the area of similar lands use in the GIS database.
This can be very useful to find that for what purpose the particular land should be used, that means for
which purpose it is more suitable.
# BIRCH Clustering

Clustering algorithms like K-means clustering do not perform clustering very efficiently and it is difficult to process
large datasets with a limited amount of resources (like memory or a slower CPU). So, regular clustering algorithms do
not scale well in terms of running time and quality as the size of the dataset increases. This is where BIRCH clustering
comes in. Balanced Iterative Reducing and Clustering using Hierarchies (BIRCH) is a clustering algorithm that can
cluster large datasets by first generating a small and compact summary of the large dataset that retains as much
information as possible. This smaller summary is then clustered instead of clustering the larger dataset. BIRCH is
often used to complement other clustering algorithms by creating a summary of the dataset that the other clustering
algorithm can now use. However, BIRCH has one major drawback – it can only process metric attributes. A metric
attribute is any attribute whose values can be represented in Euclidean space i.e., no categorical attributes should be
present. Before we implement BIRCH, we must understand two important terms: Clustering Feature (CF) and CF –
Tree Clustering Feature (CF): BIRCH summarizes large datasets into smaller, dense regions called Clustering Feature
(CF) entries. Formally, a Clustering Feature entry is defined as an ordered triple, (N, LS, SS) where ‘N’ is the number of
data points in the cluster, ‘LS’ is the linear sum of the data points and ‘SS’ is the squared sum of the data points in the
cluster. It is possible for a CF entry to be composed of other CF entries. CF Tree: The CF tree is the actual compact
representation that we have been speaking of so far. A CF tree is a tree where each leaf node contains a sub-cluster.
Every entry in a CF tree contains a pointer to a child node and a CF entry made up of the sum of CF entries in the
child nodes. There is a maximum number of entries in each leaf node. This maximum number is called the threshold.
We will learn more about what this threshold value is. Parameters of BIRCH Algorithm :
 threshold : threshold is the maximum number of data points a sub-cluster in the leaf node of the CF tree can
hold.
 branching_factor : This parameter specifies the maximum number of CF sub-clusters in each node (internal
node).
 n_clusters : The number of clusters to be returned after the entire BIRCH algorithm is complete i.e., number
of clusters after the final clustering step. If set to None, the final clustering step is not performed and
intermediate clusters are returned.
Implementation of BIRCH in Python: For the sake of this example, we will generate a dataset for clustering using
scikit-learn’s make_blobs() method. To learn more about make_blobs(), you can refer to the link below: https://scikit-
learn.org/stable/modules/generated/sklearn.datasets.make_blobs.html Code: To create 8 clusters with 600
randomly generated samples and then plotting the results in a scatter plot.
 python3

# Import required libraries and modules

import matplotlib.pyplot as plt
from sklearn.datasets.samples_generator import make_blobs
from sklearn.cluster import Birch

# Generating 600 samples using make_blobs

dataset, clusters = make_blobs(n_samples = 600, centers = 8, cluster_std = 0.75, random_state = 0)
# Creating the BIRCH clustering model
model = Birch(branching_factor = 50, n_clusters = None, threshold = 1.5)

# Fit the data (Training)

model.fit(dataset)
# Predict the same data
pred = model.predict(dataset)

# Creating a scatter plot

plt.scatter(dataset[:, 0], dataset[:, 1], c = pred, cmap = 'rainbow', alpha = 0.7, edgecolors = 'b')
plt.show()
Output Plot:

#Gaussian Mixture Models (GMMs) and Expectation-Maximization (EM)

Gaussian Mixture Models (GMMs)
A Gaussian Mixture Model (GMM) is a probabilistic model that assumes all the data points are generated from a
mixture of several Gaussian (normal) distributions with unknown parameters. GMMs are widely used for clustering,
density estimation, and pattern recognition because they provide a flexible way to model complex distributions.

Expectation-Maximization (EM) Algorithm

The Expectation-Maximization (EM) algorithm is a powerful iterative method for finding maximum likelihood
estimates of parameters in models like the Gaussian Mixture Model (GMM) when the data is incomplete or has
latent variables. In the case of GMMs, the "latent variables" are the cluster assignments for each data point, which
are unknown.
The EM algorithm consists of two main steps:
# Parameters estimations – MLE, MAP. Applications of Clustering.
Parameter Estimation: MLE vs MAP
In machine learning and statistics, parameter estimation involves determining the values of model parameters that
best explain the data. Two common methods for parameter estimation are Maximum Likelihood Estimation (MLE)
and Maximum A Posteriori Estimation (MAP). Both are used to estimate the parameters of statistical models, but
they differ in their assumptions and formulation.
Applications of Clustering:
1. Customer Segmentation:
o Businesses often use clustering to segment customers based on their behavior, demographics, or purchasing
patterns.
o Helps in targeting specific customer groups for marketing, product recommendations, or personalized offers.
o Commonly used in retail and e-commerce to improve customer experience and sales.
2. Image Segmentation:
o Clustering techniques like K-Means or GMM are used to segment images into regions that share similar
properties (e.g., color, texture).
o Useful in medical imaging, satellite image analysis, and computer vision tasks.
o For example, segmenting a tumor from surrounding tissue in medical scans.
3. Anomaly Detection:
o Clustering can help identify unusual patterns or outliers in data.
o Used in fraud detection (e.g., credit card fraud detection) or network intrusion detection.
o Anomalous points that do not belong to any cluster or significantly differ from existing clusters can be flagged
for further investigation.
4. Document or Text Clustering:
o Clustering is widely used in Natural Language Processing (NLP) for organizing large text corpora into meaningful
groups (e.g., news articles, product reviews, etc.).
o Helps in document categorization, topic modeling, and improving search engine results.
o Algorithms like Latent Dirichlet Allocation (LDA) or K-Means are commonly used.
5. Biological Data Analysis:
o Clustering is used to identify similar gene expression patterns in genomic research.
o Can be used in identifying clusters of proteins with similar functions or in classifying cells in single-cell RNA
sequencing data.
o Techniques like hierarchical clustering and K-Means are often applied to DNA, RNA, or protein data.

ML Mod 4 Part 1
No ratings yet
ML Mod 4 Part 1
99 pages
Unit 4
No ratings yet
Unit 4
62 pages
ML Unit III
No ratings yet
ML Unit III
82 pages
Classification and Clustering
No ratings yet
Classification and Clustering
8 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
64 pages
Clustering in Machine Learning
No ratings yet
Clustering in Machine Learning
21 pages
ML Unit 4 (Ab 22)
No ratings yet
ML Unit 4 (Ab 22)
39 pages
Clustering
No ratings yet
Clustering
11 pages
Unsupervised Learning-01
No ratings yet
Unsupervised Learning-01
42 pages
Unsupervised Machine Learning
No ratings yet
Unsupervised Machine Learning
59 pages
Clustering in Machine Learning
No ratings yet
Clustering in Machine Learning
7 pages
Unit 4
No ratings yet
Unit 4
29 pages
Unit III Clustering
No ratings yet
Unit III Clustering
47 pages
7.introduction To Clustering
No ratings yet
7.introduction To Clustering
11 pages
Module 5
No ratings yet
Module 5
91 pages
Unit 4 Clustering
No ratings yet
Unit 4 Clustering
18 pages
Lecturer-1 Unit 3
No ratings yet
Lecturer-1 Unit 3
31 pages
Clustering in Data Mining
No ratings yet
Clustering in Data Mining
14 pages
Day 3 - Content
No ratings yet
Day 3 - Content
50 pages
DWMModule 4
No ratings yet
DWMModule 4
31 pages
Module 5 - Notes - 13 12 2024
No ratings yet
Module 5 - Notes - 13 12 2024
45 pages
Clustering in Machine Learning
No ratings yet
Clustering in Machine Learning
7 pages
4.unsupervised Learning Model-Clustering
No ratings yet
4.unsupervised Learning Model-Clustering
45 pages
Unit 3 Clustering Algorithm
No ratings yet
Unit 3 Clustering Algorithm
44 pages
Clustering in Machine Learning
No ratings yet
Clustering in Machine Learning
4 pages
Unit Iii - ML
No ratings yet
Unit Iii - ML
13 pages
Clustering
No ratings yet
Clustering
20 pages
ML Unit-3
No ratings yet
ML Unit-3
22 pages
Unsupervised Learning Part 1
No ratings yet
Unsupervised Learning Part 1
9 pages
ML CH 4
No ratings yet
ML CH 4
51 pages
ML Unit-4 Final 2024-25
No ratings yet
ML Unit-4 Final 2024-25
28 pages
Unit 4-L2
No ratings yet
Unit 4-L2
19 pages
Artificial Intelligence Lec 5
No ratings yet
Artificial Intelligence Lec 5
20 pages
Clustering Explanation
No ratings yet
Clustering Explanation
8 pages
Unit 4
No ratings yet
Unit 4
74 pages
ML Unit-Iii
No ratings yet
ML Unit-Iii
18 pages
Unit 3 Unsupervised Learning Algorith
No ratings yet
Unit 3 Unsupervised Learning Algorith
15 pages
Clustering: An Overview: Key Concepts Objective
No ratings yet
Clustering: An Overview: Key Concepts Objective
12 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
23 pages
Unit-5 Clustering (March 16, 24)
No ratings yet
Unit-5 Clustering (March 16, 24)
25 pages
Cbsyllabus Bda
No ratings yet
Cbsyllabus Bda
5 pages
Unit - 4 (ML)
No ratings yet
Unit - 4 (ML)
13 pages
Using Information Technology 10th Edition Williams Test Bank Download
100% (22)
Using Information Technology 10th Edition Williams Test Bank Download
83 pages
Practical and Scientific Aspects of Injection Molding Simulation
No ratings yet
Practical and Scientific Aspects of Injection Molding Simulation
156 pages
Additive Manufacturing Kme071
No ratings yet
Additive Manufacturing Kme071
1 page
Clustering
No ratings yet
Clustering
9 pages
ML Unit-4
No ratings yet
ML Unit-4
14 pages
Classify Clustering
No ratings yet
Classify Clustering
31 pages
Clustering
No ratings yet
Clustering
8 pages
ML Unit 4 Notes - NJ
No ratings yet
ML Unit 4 Notes - NJ
15 pages
Gautam A. Kudale
No ratings yet
Gautam A. Kudale
6 pages
U20cs604 Machine Learning Unit III
No ratings yet
U20cs604 Machine Learning Unit III
23 pages
Pioneer SPH-DA360DAB-Operation-Manual
No ratings yet
Pioneer SPH-DA360DAB-Operation-Manual
65 pages
ILI6480
No ratings yet
ILI6480
58 pages
Module 03 OS
No ratings yet
Module 03 OS
35 pages
DOA IPIP - 200B GTS - HS Global POF
No ratings yet
DOA IPIP - 200B GTS - HS Global POF
13 pages
MCQs - Deep Learning Fundamentals - Understanding Neural Networks, Activation Functions, and Bac
No ratings yet
MCQs - Deep Learning Fundamentals - Understanding Neural Networks, Activation Functions, and Bac
10 pages
Java Swing Intro
No ratings yet
Java Swing Intro
76 pages
Clustering
No ratings yet
Clustering
10 pages
Fundamentals of Data Science Unit 3
No ratings yet
Fundamentals of Data Science Unit 3
15 pages
Clustering
No ratings yet
Clustering
6 pages
Unsupervised Machine Learning
No ratings yet
Unsupervised Machine Learning
10 pages
USAA Bank Statement 5 Page
No ratings yet
USAA Bank Statement 5 Page
8 pages
Clustering in Machine Learning - Javatpoint
No ratings yet
Clustering in Machine Learning - Javatpoint
10 pages
Unit 5
No ratings yet
Unit 5
5 pages
Mech Softwares List
No ratings yet
Mech Softwares List
1 page
Operating System Design The Xinu Approach 2nd Edition Douglas Comer PDF Download
No ratings yet
Operating System Design The Xinu Approach 2nd Edition Douglas Comer PDF Download
36 pages
Project Report
No ratings yet
Project Report
30 pages
MCT Unit 2
No ratings yet
MCT Unit 2
26 pages
Hol 2210 91 SDC - PDF - en
No ratings yet
Hol 2210 91 SDC - PDF - en
55 pages
0 - بحث عن اللغة العربية PDF
No ratings yet
0 - بحث عن اللغة العربية PDF
1 page
Machine Learning & Data Mining: Understanding
No ratings yet
Machine Learning & Data Mining: Understanding
7 pages
(Test 1) Elektrostatyka (B) PDF
No ratings yet
(Test 1) Elektrostatyka (B) PDF
1 page
Cluster Evaluation Techniques: Atds Assignment
No ratings yet
Cluster Evaluation Techniques: Atds Assignment
4 pages
MRSPTU B.Tech. Electrical 7th-8th Sem Scheme and Syllabus 2018 Batch Onwards
No ratings yet
MRSPTU B.Tech. Electrical 7th-8th Sem Scheme and Syllabus 2018 Batch Onwards
26 pages
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
2010A IP Questions
No ratings yet
2010A IP Questions
47 pages
Effective Web Searching
No ratings yet
Effective Web Searching
13 pages
CSS Electronics Products
No ratings yet
CSS Electronics Products
11 pages
Soal UAS B. Inggris Wajib
No ratings yet
Soal UAS B. Inggris Wajib
8 pages
100 Daily Job 25feb2021
No ratings yet
100 Daily Job 25feb2021
27 pages
Teach Your Raspberry Pi - "Yeah, World"
No ratings yet
Teach Your Raspberry Pi - "Yeah, World"
10 pages
Scent Marketing: Subliminal Advertising Messages
No ratings yet
Scent Marketing: Subliminal Advertising Messages
10 pages
Receipt - 3873
No ratings yet
Receipt - 3873
2 pages
Contoh Template Soal
No ratings yet
Contoh Template Soal
18 pages
An Online Scheduling Algorithm With Advance Reservation For Large-Scale Data Transfers
No ratings yet
An Online Scheduling Algorithm With Advance Reservation For Large-Scale Data Transfers
22 pages
Ravikiran Mediboyina
No ratings yet
Ravikiran Mediboyina
1 page
Trace - 2023-09-15 16 - 28 - 52 500
No ratings yet
Trace - 2023-09-15 16 - 28 - 52 500
2 pages
Image Segmentation: Unlocking Insights through Pixel Precision
From Everand
Image Segmentation: Unlocking Insights through Pixel Precision
Fouad Sabry
No ratings yet
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet