Detecting Patterns With Unsupervised Learning

The document discusses unsupervised learning, particularly focusing on clustering techniques like K-Means and Mean Shift, which categorize unlabeled data into subgroups based on similarity metrics. It also covers the evaluation of clustering quality using silhouette scores and introduces Gaussian Mixture Models for more complex data distributions. The document includes practical examples and code snippets for implementing these algorithms using Python.

Uploaded by

Shehar Bano

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views21 pages

Detecting Patterns With Unsupervised Learning

Uploaded by

Shehar Bano

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 21

Artificial Intelligence

Detecting Problems with Unsupervised

Learning
Unsupervised Learning

• Building machine learning models without using labeled training data

• Applications: market segmentation, stock markets, natural language processing,
and computer vision
• Large quantity of data exists without labeling and it needs to be categorized in
some way
• This is the perfect use case for unsupervised learning
• Unsupervised learning algorithms attempt to classify data into subgroups within
a given dataset using some similarity metric.
• When we have a dataset without any labels, we assume that the data is generated
because of latent variables that govern the distribution in some way
• Process of learning can then proceed in a hierarchical manner, starting from the
individual data points.
• We can build deeper levels of representation for the data by finding natural clusters
of similarities and trying to obtain signal and insights by classifying and
segmenting the data.
• Let's see some of the ways in which data can be classified using unsupervised
learning.
Clustering Data with the K-Means Algorithms

• Most popular unsupervised learning techniques to analyze data and find clusters using similarity
measurement such as the Euclidean distance to find subgroups
• Similarity measure can estimate the tightness of a cluster
• Clustering is the process of organizing data into subgroups whose elements are like each other
• The goal of the algorithm is to identify the intrinsic properties of data points that make them belong to the
same subgroup
• There is no universal similarity metric that works in all cases
• For example, we might be interested in finding the representative data point for each subgroup, or we
might be interested in finding the outliers in the data
• Depending on the situation, different metrics might be more appropriate than others
• The K-Means algorithm is a well-known algorithm for clustering data.
• The data is segmented into K subgroups using various data attributes.
• The number of clusters is fixed, and the data is classified based on that number.
• The main idea here is that we need to update the locations of the centroids with each iteration.
• A centroid is the location representing the center of the cluster.
• We continue iterating until we have placed the centroids at their optimal locations.
• We can see that the initial placement of centroids plays an important role in the algorithm.
• These centroids should be placed in a clever manner, because this directly impacts the results.
• A good strategy is to place them as far away from each other as possible.
Clustering Data with the K-Means Algorithms

• The basic K-Means algorithm places these centroids randomly where K-Means++
chooses these points algorithmically from the input list of data points.
• It tries to place the initial centroids far from each other so that they converge quickly.
• We then go through the training dataset and assign each data point to the closest centroid.
• Once we go through the entire dataset, the first iteration is over. The points have
been grouped based on the initialized centroids.
• The location of the centroids is recalculated based on the new clusters that were obtained at the end of the
first iteration.
• Once a new set of K centroids is obtained, the process is repeated.
• We iterate through the dataset and assign each point to the closest centroid.
• As the steps keep on getting repeated, the centroids keep moving to their
equilibrium position.
• After a certain number of iterations, the centroids do not change their locations anymore.
• The centroids converge to a final location.
• These K centroids are the values that will be used for inference.
Application of K-Means Clustering on Two-Dimensional Data

import numpy as np
import matplotlib.pyplot as plt
from sklearn.cluster import KMeans
from sklearn import metrics
X = np.loadtxt('data_clustering.txt', delimiter=',') # Load input data
num_clusters = 5
plt.figure() # Plot input data
plt.scatter(X[:,0], X[:,1], marker='o', facecolors='none', edgecolors='black', s=80)
x_min, x_max = X[:, 0].min() - 1, X[:, 0].max() + 1
y_min, y_max = X[:, 1].min() - 1, X[:, 1].max() + 1
plt.title('Input data')
plt.xlim(x_min, x_max)
plt.ylim(y_min, y_max)
plt.xticks(())
plt.yticks(())
kmeans = KMeans(init='k-means++', n_clusters=num_clusters, n_init=10)
kmeans.fit(X) # Train the KMeans clustering model
step_size = 0.01 # Step size of the mesh
x_min, x_max = X[:, 0].min() - 1, X[:, 0].max() + 1
y_min, y_max = X[:, 1].min() - 1, X[:, 1].max() + 1
x_vals, y_vals = np.meshgrid(np.arange(x_min, x_max, step_size),np.arange(y_min, y_max, step_size))
output = kmeans.predict(np.c_[x_vals.ravel(), y_vals.ravel()])
Plot All Output Values and Color Each Region

output = output.reshape(x_vals.shape)
plt.figure()
plt.clf()
plt.imshow(output, interpolation='nearest',extent=(x_vals.min(), x_vals.max(),y_vals.min(), y_vals.max()),
cmap=plt.cm.Paired, aspect='auto', origin='lower')
plt.scatter(X[:,0], X[:,1], marker='o', facecolors='none',edgecolors='black', s=80)
cluster_centers = kmeans.cluster_centers_
plt.scatter(cluster_centers[:,0], cluster_centers[:,1],
marker='o', s=210, linewidths=4, color='black', zorder=12, facecolors='black')
x_min, x_max = X[:, 0].min() - 1, X[:, 0].max() + 1
y_min, y_max = X[:, 1].min() - 1, X[:, 1].max() + 1
plt.title('Boundaries of clusters')
plt.xlim(x_min, x_max)
plt.ylim(y_min, y_max)
plt.xticks(())
plt.yticks(())
plt.show()

Visualization of input data Kmeans boundaries

Estimating the Number of Clusters with the Mean Shift Algorithm

• Mean Shift is a powerful nonparametric algorithm used in unsupervised learning for clustering because
it does not make any assumptions about the underlying distributions
• Mean Shift finds a lot of applications in fields such as object tracking and real-time data analysis
• In the Mean Shift algorithm, the whole feature space is considered as a probability
density function.
• We start with the training dataset and assume that it has been sampled from a probability density
function
• In this framework, the clusters correspond to the local maxima of the underlying distribution.
• If there are K clusters, then there are K peaks in the underlying data distribution and Mean Shift will
identify those peaks
• The goal of Mean Shift is to identify the location of centroids
• For each data point in the training dataset, it defines a window around it
• It then computes the centroid for this window and updates the location to this new centroid
• It then repeats the process for this new location by defining a window around it
• As we keep doing this, we move closer to the peak of the cluster
• Each data point will move towards the cluster it belongs to
• The movement is towards a region of higher density
• The centroids (also called means) keep on getting shifted towards the peaks of each cluster
• The algorithm gets its name from the fact that the means keep getting shifted
• The shift continues to happen until the algorithm converges, at which stage the centroids don't move
anymore.
Estimating the Number of Clusters with the Mean Shift Algorithm

import numpy as np
import matplotlib.pyplot as plt
from sklearn.cluster import MeanShift, estimate_bandwidth
from itertools import cycle
X = np.loadtxt('data_clustering.txt', delimiter=',') # Load data from input file
bandwidth_X = estimate_bandwidth(X, quantile=0.1, n_samples=len(X)) # Estimate the bandwidth of X
meanshift_model = MeanShift(bandwidth=bandwidth_X, bin_seeding=True) # Cluster data with MeanShift
meanshift_model.fit(X)
cluster_centers = meanshift_model.cluster_centers_
print('\nCenters of clusters:\n', cluster_centers)
labels = meanshift_model.labels_ # Estimate the number of clusters
num_clusters = len(np.unique(labels))
print("\nNumber of clusters in input data =", num_clusters)
plt.figure() # Plot the points and cluster centers
markers = 'o*xvs'
for i, marker in zip(range(num_clusters), markers):
plt.scatter(X[labels==i, 0], X[labels==i, 1], marker=marker, color='black') # Plot points belong to current
cluster
cluster_center = cluster_centers[i] # Plot the cluster center
plt.plot(cluster_center[0],cluster_center[1],marker='o',markerfacecolor='black',markeredgecolor='black',
markersize=15)
plt.title('Clusters')
plt.show()
Estimating the Number of Clusters with the Mean Shift Algorithm
Estimating the Quality of Clustering with Silhouette Scores
• If data is naturally organized into several distinct clusters, then it is easy to visually examine it and draw some
inferences
• This is rarely the case in the real world, unfortunately
• Data in the real world is huge and messy. So, we need a way to quantify the quality of the clustering
• Silhouette refers to a method used to check the consistency of clusters in data
• It gives an estimate of how well each data point fits with its cluster
• The silhouette score is a metric that measures the similarity of a data point to its own cluster, as compared to other
clusters
• The silhouette score works with any similarity metric
• For each data point, the silhouette score is computed using the following formula:
silhouette score = (p – q) / max(p, q)
• Here, p is the mean distance to the points in the nearest cluster that the data point is not a part of, and q is the mean
intra-cluster distance to all the points in its own cluster
• The value of the silhouette score range lies between -1 and 1
• A score closer to 1 indicates that the data point is very similar to other data points in the cluster, whereas a
score closer to -1 indicates that the data point is not like other data points in the cluster
• One way to think about it is if there are too many points with negative silhouette scores, then there may be
too few or too many clusters in the data
• We need to run the clustering algorithm again to find the optimal number of clusters
• Ideally, we want to have a high positive value
• Depending on the business problem, we do not need to optimize and have the highest possible value, but in
general, if we have a silhouette score that is close to 1, it indicates that the data clustered nicely
• If the scores are close to -1, it indicates that the variable that we are using to classify is noisy and does not
contain much of a signal
Estimating the Quality of Clustering with Silhouette Scores
import numpy as np
import matplotlib.pyplot as plt
from sklearn import metrics
from sklearn.cluster import KMeans
X = np.loadtxt('data_quality.txt', delimiter=',') # Load data from input file
scores = [] # Initialize variables
values = np.arange(2, 10)
for num_clusters in values: # Iterate through the defined range
kmeans = KMeans(init='k-means++', n_clusters=num_clusters, n_init=10)
kmeans.fit(X)
score = metrics.silhouette_score(X, kmeans.labels_, metric='euclidean', sample_size=len(X))
print("\nNumber of clusters =", num_clusters)
print("Silhouette score =", score)
scores.append(score)
plt.figure()
plt.bar(values, scores, width=0.7, color='black', align='center')
plt.title('Silhouette score vs number of clusters')
num_clusters = np.argmax(scores) + values[0] # Extract best score and optimal number of clusters
print('\nOptimal number of clusters =', num_clusters)
plt.figure()
plt.scatter(X[:,0], X[:,1], color='black', s=80, marker='o', facecolors='none')
x_min, x_max = X[:, 0].min() - 1, X[:, 0].max() + 1
y_min, y_max = X[:, 1].min() - 1, X[:, 1].max() + 1
plt.title('Input data')
plt.xlim(x_min, x_max)
plt.ylim(y_min, y_max)
plt.xticks(())
plt.yticks(())
plt.show()
Estimating the Quality of Clustering with Silhouette Scores
Guassian Mixture Model
• A Mixture Model is a type of probability density model where it is assumed that the data is governed by several
component distributions.
• If these distributions are Gaussian, then the model becomes a Gaussian Mixture Model
• These component distributions are combined in order to provide a multi-modal density function, which becomes a
mixture model
– We want to model the shopping habits of all the people in South America.
– One way to do it would be to model the whole continent and fit everything into a single model, but
people in different countries shop differently
– We therefore need to understand how people in individual countries shop and how they behave
– To get a good representative model, we need to account for all the variations within the continent
• In this case, we can use mixture models to model the shopping habits of individual countries and then combine all of
them into a Mixture Model
• This way, nuances in the data of the underlying behavior of individual countries are not missed. By not enforcing a
single model on all of the countries, a more accurate model is created
• An interesting point to note is that mixture models are semi-parametric, which means that they are partially
dependent on a set of predefined functions.
• They can provide greater precision and flexibility in modeling the underlying distributions of the data.
• They can smooth the gaps that result from having sparse data
• Once the function is defined, the mixture model goes from being semi-parametric to parametric.
• Hence a GMM is a parametric model represented as a weighted summation of component Gaussian functions.
• We assume that the data is being generated by a set of Gaussian models that are combined in some way
• GMMs are very powerful and are used in many fields.
• The parameters of the GMM are estimated from training data using algorithms like Expectation–Maximization
(EM) or Maximum A-Posteriori (MAP) estimation
• Applications include image database retrieval, modeling stock market fluctuations, biometric verification
Building a Classifier Based on Guassian Mixture Models
import numpy as np
import matplotlib.pyplot as plt
from matplotlib import patches
from sklearn import datasets
from sklearn.mixture import GaussianMixture
from sklearn.model_selection import StratifiedKFold
from sklearn.model_selection import train_test_split
iris = datasets.load_iris() # Load the iris dataset
X, y = datasets.load_iris(return_X_y=True)
skf = StratifiedKFold(n_splits=5) # Split dataset into training and testing (80/20 split)
skf.get_n_splits(X, y)
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.4, random_state=0)
num_classes = len(np.unique(y_train))
classifier = GaussianMixture(n_components=num_classes, covariance_
type='full', init_params='kmeans', max_iter=20)
classifier.means_ = np.array([X_train[y_train == i].mean(axis=0) for i
in range(num_classes)])
classifier.fit(X_train)
plt.figure()
colors = 'bgr'
for i, color in enumerate(colors):
eigenvalues, eigenvectors = np.linalg.eigh(
classifier.covariances_[i][:2, :2])
norm_vec = eigenvectors[0] / np.linalg.norm(eigenvectors[0])
angle = np.arctan2(norm_vec[1], norm_vec[0])
angle = 180 * angle / np.pi
scaling_factor = 8
eigenvalues *= scaling_factor
Building a Classifier Based on Guassian Mixture Models ………continued

ellipse = patches.Ellipse(classifier.means_[i, :2], eigenvalues[0], eigenvalues[1], 180 + angle, color=color)

axis_handle = plt.subplot(1, 1, 1)
ellipse.set_clip_box(axis_handle.bbox)
ellipse.set_alpha(0.6)
axis_handle.add_artist(ellipse)
colors = 'bgr‘# Plot the data
for i, color in enumerate(colors):
cur_data = iris.data[iris.target == i]
plt.scatter(cur_data[:,0], cur_data[:,1], marker='o',facecolors='none', edgecolors='black', s=40,
label=iris.target_names[i])
test_data = X_test[y_test == i]
plt.scatter(test_data[:,0], test_data[:,1], marker='s',
facecolors='black', edgecolors='black', s=40 ,
label=iris.target_names[i])
y_train_pred = classifier.predict(X_train)
accuracy_training = np.mean(y_train_pred.ravel() == y_train.ravel()) *100
print('Accuracy on training data =', accuracy_training)
y_test_pred = classifier.predict(X_test)
accuracy_testing = np.mean(y_test_pred.ravel() == y_test.ravel()) *100
print('Accuracy on testing data =', accuracy_testing)
plt.title('GMM classifier')
plt.xticks(())
plt.yticks(())
plt.show()
Building a Classifier Based on Guassian Mixture Models Results

• Accuracy on training data = 87.5

• Accuracy on testing data = 86.6666666667
Finding Subgroups in Stock Market using the Affinity Propagation Models

• Affinity Propagation is a clustering algorithm that doesn't require a number of

clusters to be specified beforehand
• Because of its generic nature and simplicity of implementation, it has found a lot of applications in many
fields.
• It finds out representative clusters, called exemplars, using message passing
• It starts by specifying the measures of similarity that need to be considered
• It simultaneously considers all training data points as potential exemplars
• It then passes messages between the data points until it finds a set of exemplars
• The message passing happens in two alternate steps, called responsibility and
availability.
• Responsibility refers to the message sent from members of the cluster to candidate exemplars, indicating
how well suited the data point would be as a member of this exemplar's cluster
• Availability refers to the message sent from candidate exemplars to potential members of the cluster,
indicating how well suited it would be as an exemplar
• It keeps doing this until the algorithm converges on an optimal set of exemplars
• There is also a parameter called preference, which controls the number of exemplars
that will be found
• If a high value is chosen, it will cause the algorithm to find too many clusters
• If a low value is chosen, it will lead to a small number of clusters
• An optimal value would be the median similarity between the points.
Finding Subgroups in Stock Market using the Affinity Propagation Models
import datetime
import json
import numpy as np
import matplotlib.pyplot as plt
from sklearn import covariance, cluster
import yfinance as yf
input_file = 'company_symbol_mapping.json'
with open(input_file, 'r') as f: company_symbols_map = json.loads(f.read())
symbols, names = np.array(list(company_symbols_map.items())).T
start_date = datetime.datetime(2019, 1, 1) # Load the historical stock quotes
end_date = datetime.datetime(2019, 1, 31)
quotes = [yf.Ticker(symbol).history(start=start_date, end=end_date) for symbol in symbols]
opening_quotes = np.array([quote.Open for quote in quotes]).astype(np.float)
closing_quotes = np.array([quote.Close for quote in quotes]).astype(np.float)
quotes_diff = closing_quotes - opening_quotes
X = quotes_diff.copy().T
X /= X.std(axis=0)
edge_model = covariance.GraphLassoCV()
with np.errstate(invalid='ignore'):
edge_model.fit(X)
_, labels = cluster.affinity_propagation(edge_model.covariance_)
num_labels = labels.max()
print('\nClustering of stocks based on difference in opening and closing quotes:\n')
for i in range(num_labels + 1):
print("Cluster", i+1, "==>", ', '.join(names[labels == i]))
Finding Subgroups in Stock Market using the Affinity Propagation Models Result
Segmenting the Market Based on Shopping Affairs

import csv
import numpy as np
import matplotlib.pyplot as plt
from sklearn.cluster import MeanShift, estimate_bandwidth
input_file = 'sales.csv'
file_reader = csv.reader(open(input_file, 'r'), delimiter=',')
X = []
for count, row in enumerate(file_reader):
if not count:
names = row[1:]
continue
X.append([float(x) for x in row[1:]])
X = np.array(X) # Convert to numpy array
bandwidth = estimate_bandwidth(X, quantile=0.8, n_samples=len(X))
meanshift_model = MeanShift(bandwidth=bandwidth, bin_seeding=True)
meanshift_model.fit(X)
labels = meanshift_model.labels_
cluster_centers = meanshift_model.cluster_centers_
num_clusters = len(np.unique(labels))
print("\nNumber of clusters in input data =", num_clusters)
print("\nCenters of clusters:")
print('\t'.join([name[:3] for name in names]))
for cluster_center in cluster_centers:
print('\t'.join([str(int(x)) for x in cluster_center]))
Segmenting the Market Based on Shopping Affairs

cluster_centers_2d = cluster_centers[:, 1:3]

plt.figure() # Plot the cluster centers
plt.scatter(cluster_centers_2d[:,0], cluster_centers_2d[:,1],s=120, edgecolors='black', facecolors='none')
offset = 0.25
plt.xlim(cluster_centers_2d[:,0].min() - offset * cluster_centers_2d[:,0].ptp(),cluster_centers_2d[:,0].max() +
offset * cluster_centers_2d[:,0].ptp(),)
plt.ylim(cluster_centers_2d[:,1].min() - offset * cluster_centers_2d[:,1].ptp(),cluster_centers_2d[:,1].max() +
offset * cluster_centers_2d[:,1].ptp())
plt.title('Centers of 2D clusters')
plt.show()

Unsupervised Learning: Clustering & Anomaly Detection
No ratings yet
Unsupervised Learning: Clustering & Anomaly Detection
50 pages
ML DSBA Lab7
No ratings yet
ML DSBA Lab7
6 pages
Week 11
No ratings yet
Week 11
49 pages
Wa0033.
No ratings yet
Wa0033.
38 pages
ML UNIT 4 Sir
No ratings yet
ML UNIT 4 Sir
42 pages
Chapter 4
No ratings yet
Chapter 4
30 pages
UNIT - 3 - Clustering
No ratings yet
UNIT - 3 - Clustering
21 pages
Presentation 1
No ratings yet
Presentation 1
47 pages
Unsupervised Learning Part 1
No ratings yet
Unsupervised Learning Part 1
9 pages
Unit IV
No ratings yet
Unit IV
96 pages
DSUP Exp5
No ratings yet
DSUP Exp5
7 pages
ML Clustering2
No ratings yet
ML Clustering2
11 pages
AI Week 11
No ratings yet
AI Week 11
21 pages
(KtabPDF Com) xrwA7TEBGp
No ratings yet
(KtabPDF Com) xrwA7TEBGp
32 pages
CLUSTERING
No ratings yet
CLUSTERING
11 pages
K Means Clustering
No ratings yet
K Means Clustering
11 pages
Unit 4
No ratings yet
Unit 4
22 pages
Chapter 3 p4
No ratings yet
Chapter 3 p4
18 pages
K-Means Clustering Guide
No ratings yet
K-Means Clustering Guide
26 pages
Lecture 12 - Unsupervised Learning - Shoould Be Marged
No ratings yet
Lecture 12 - Unsupervised Learning - Shoould Be Marged
31 pages
Unit 4
No ratings yet
Unit 4
46 pages
A Paper With 12pt Global Font Size
No ratings yet
A Paper With 12pt Global Font Size
13 pages
K.means Clustering
No ratings yet
K.means Clustering
8 pages
Unit 3 & 4 (p18)
No ratings yet
Unit 3 & 4 (p18)
18 pages
Machine Learning Syllabus
No ratings yet
Machine Learning Syllabus
73 pages
Report 1
No ratings yet
Report 1
3 pages
KMeans Clustering
No ratings yet
KMeans Clustering
16 pages
K Means
No ratings yet
K Means
25 pages
Machine Learning Notes-1 (Clustering-1)
No ratings yet
Machine Learning Notes-1 (Clustering-1)
25 pages
Unit 4
No ratings yet
Unit 4
29 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
6 pages
K Means - Ipynb - Colab
No ratings yet
K Means - Ipynb - Colab
10 pages
UNIT-5 Material
No ratings yet
UNIT-5 Material
42 pages
Assignment 2
No ratings yet
Assignment 2
8 pages
K-Means Clustering Guide
No ratings yet
K-Means Clustering Guide
26 pages
A Tutorial On Clustering Algorithms
No ratings yet
A Tutorial On Clustering Algorithms
4 pages
Aam Unit 4 QB With Answer
No ratings yet
Aam Unit 4 QB With Answer
11 pages
K Means Algorithms
No ratings yet
K Means Algorithms
27 pages
ML Lecture06 Unsupervised Learning
No ratings yet
ML Lecture06 Unsupervised Learning
87 pages
Clustering Techniques Explained
No ratings yet
Clustering Techniques Explained
11 pages
DSML-ML09. Unsupervised Learning
No ratings yet
DSML-ML09. Unsupervised Learning
69 pages
CH 5
No ratings yet
CH 5
34 pages
Aiml 8
No ratings yet
Aiml 8
7 pages
ML - K-Means
No ratings yet
ML - K-Means
12 pages
ML Unit 4 V1
No ratings yet
ML Unit 4 V1
30 pages
Aiml Unit 4
No ratings yet
Aiml Unit 4
20 pages
ML Unit-4
No ratings yet
ML Unit-4
14 pages
Algo
No ratings yet
Algo
59 pages
Unit 4
No ratings yet
Unit 4
125 pages
Unit - 4 DWDM
No ratings yet
Unit - 4 DWDM
27 pages
Exp 7
No ratings yet
Exp 7
3 pages
Yunsu Han KNN K Means
No ratings yet
Yunsu Han KNN K Means
8 pages
Unit-Iv Material
No ratings yet
Unit-Iv Material
24 pages
K-Means Clustering Guide
No ratings yet
K-Means Clustering Guide
31 pages
2021 Clustering
No ratings yet
2021 Clustering
50 pages
SML Hand Note Bau by DT
No ratings yet
SML Hand Note Bau by DT
1 page
MLF Mod3
No ratings yet
MLF Mod3
10 pages
ML - Unit - 2
No ratings yet
ML - Unit - 2
13 pages
Clustering Algorithms
No ratings yet
Clustering Algorithms
19 pages
Chapter 2
No ratings yet
Chapter 2
25 pages
Admi Ftcorba
No ratings yet
Admi Ftcorba
41 pages
Advanced Software Architecture PPT
No ratings yet
Advanced Software Architecture PPT
7 pages
Agent Technology
No ratings yet
Agent Technology
8 pages
RDM Mind Map Presentation
No ratings yet
RDM Mind Map Presentation
2 pages
Working With Folders and Windows Utility
No ratings yet
Working With Folders and Windows Utility
49 pages
Basics of RDM Components of RDM Theory Class
No ratings yet
Basics of RDM Components of RDM Theory Class
17 pages
Animations Transitions SlideShow LabClass
No ratings yet
Animations Transitions SlideShow LabClass
13 pages
Architectural Patterns PPT
No ratings yet
Architectural Patterns PPT
5 pages
Architectural Design PPT
No ratings yet
Architectural Design PPT
6 pages
Theory Class ICT Governance Healthcare Media
No ratings yet
Theory Class ICT Governance Healthcare Media
7 pages
Video Conferencing and Social Media
No ratings yet
Video Conferencing and Social Media
6 pages
Google Workspace
No ratings yet
Google Workspace
12 pages
Lab 8: Microsoft PowerPoint
No ratings yet
Lab 8: Microsoft PowerPoint
11 pages
Formal Communication Tools
No ratings yet
Formal Communication Tools
12 pages
Evernote and OneNote
No ratings yet
Evernote and OneNote
10 pages
Theory Class Scope of ICT
No ratings yet
Theory Class Scope of ICT
7 pages
Lab Class 2 Microsoft Word
No ratings yet
Lab Class 2 Microsoft Word
7 pages
Effective Search Engines
No ratings yet
Effective Search Engines
12 pages
Application of ICT Lecture With Images
No ratings yet
Application of ICT Lecture With Images
9 pages
Closing The Interview of HRM
No ratings yet
Closing The Interview of HRM
3 pages
Lab Class Folder Windows Utility
No ratings yet
Lab Class Folder Windows Utility
7 pages
Week 1 - Session 1
No ratings yet
Week 1 - Session 1
17 pages
Thesis Report of Aqsa Bashir 7505
No ratings yet
Thesis Report of Aqsa Bashir 7505
23 pages
Social Media Offensive Behavior Detection
No ratings yet
Social Media Offensive Behavior Detection
7 pages
Week 1 - Session 2
No ratings yet
Week 1 - Session 2
12 pages
1) Aim: Demonstration of Preprocessing of Dataset Student - Arff
No ratings yet
1) Aim: Demonstration of Preprocessing of Dataset Student - Arff
26 pages
Major In: Machine Learning
No ratings yet
Major In: Machine Learning
11 pages
Outlier Detection in Sensor Data Using Ensemble Learning
No ratings yet
Outlier Detection in Sensor Data Using Ensemble Learning
10 pages
Unit 4 Data Analytics
No ratings yet
Unit 4 Data Analytics
11 pages
Fresco
No ratings yet
Fresco
29 pages
Lecture 1-Introduction To Data Mining - M
No ratings yet
Lecture 1-Introduction To Data Mining - M
38 pages
Sensors: Smart Helmet 5.0 For Industrial Internet of Things Using Artificial Intelligence
No ratings yet
Sensors: Smart Helmet 5.0 For Industrial Internet of Things Using Artificial Intelligence
27 pages
Data Mining Functionalities Guide
No ratings yet
Data Mining Functionalities Guide
4 pages
Feature Selection For Unsupervised Learning: Jennifer G. Dy
No ratings yet
Feature Selection For Unsupervised Learning: Jennifer G. Dy
45 pages
M.Tech & Ph.D. CompSci Courses
No ratings yet
M.Tech & Ph.D. CompSci Courses
7 pages
47-Article Text-92-3-10-20221120
No ratings yet
47-Article Text-92-3-10-20221120
7 pages
Data Analysis for Music Researchers
No ratings yet
Data Analysis for Music Researchers
11 pages
Python Image Processing Experiments
No ratings yet
Python Image Processing Experiments
41 pages
Machine Learning for Marketing
No ratings yet
Machine Learning for Marketing
42 pages
Question Paper Code:: (10×2 20 Marks)
No ratings yet
Question Paper Code:: (10×2 20 Marks)
2 pages
Data Warehousing and Data Mining Syllabus
No ratings yet
Data Warehousing and Data Mining Syllabus
2 pages
Weeklydiary (1) (1) PROPER
No ratings yet
Weeklydiary (1) (1) PROPER
13 pages
Literature Review Example Electrical Engineering
100% (3)
Literature Review Example Electrical Engineering
7 pages
Dynamic Clustering Protocol in Wireless Sensor Networks For Precision Agriculture-1
No ratings yet
Dynamic Clustering Protocol in Wireless Sensor Networks For Precision Agriculture-1
26 pages
Viva Questions For 6TH Sem
No ratings yet
Viva Questions For 6TH Sem
13 pages
Partitioning Methods
100% (1)
Partitioning Methods
3 pages
Ai, MLDL Bigda Syllabus For Internship Training
No ratings yet
Ai, MLDL Bigda Syllabus For Internship Training
7 pages
Machine Learing r20 QP
No ratings yet
Machine Learing r20 QP
4 pages
It - III B.tech Sem-II - DWDM Lab Manual (20-21)
No ratings yet
It - III B.tech Sem-II - DWDM Lab Manual (20-21)
94 pages
ML Lab
No ratings yet
ML Lab
75 pages
Statistics
No ratings yet
Statistics
1,130 pages
10 Fuzzy Clustering PDF
100% (1)
10 Fuzzy Clustering PDF
14 pages
Data Mining Exam Guidelines
100% (2)
Data Mining Exam Guidelines
3 pages
HCIA AI Practice Exam All
No ratings yet
HCIA AI Practice Exam All
64 pages
Tourism Enhancement Using LLMs & Neural Network - Report
No ratings yet
Tourism Enhancement Using LLMs & Neural Network - Report
37 pages