0% found this document useful (0 votes)

44 views9 pages

CSE 3121 Information Visualization R Studio All Codes

Uploaded by

Dhaarani Pushpam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views9 pages

CSE 3121 Information Visualization R Studio All Codes

Uploaded by

Dhaarani Pushpam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

CSE 3121 Information Visualization

Name:V.Yuthish Kumar
Regno:21MIA1023

Lab Assignment -2 Association Rule Mining in R

The Apriori algorithm is being used to perform Association Rule mining and the “Adults”
Dataset is being used.
Support measures the frequency of occurrence of a particular itemset in a
dataset.Higher support values indicate that the itemset occurs more frequently in the dataset.

Confidence measures the reliability or certainty of the association rule.A high

confidence value indicates that the occurrence of the consequent item (B) is highly likely given
the occurrence of the antecedent item (A).

Code:

library(arules)
# Load the Adult dataset
data("Adult")

# Explore the Adult dataset

summary(Adult)
inspect(head(Adult)) # showing few rows from the dataset

# Perform association rule mining using apriori algorithm

rules <- apriori(Adult, parameter = list(support = 0.01, minlen = 2))

# Explore the generated rules

summary(rules)
inspect(head(rules))

Output:
Inference:
Support value of 0.01 indicates that (education = Doctorate) and (race=white) appear in
1 percent of rows of the dataset .The total transactions are 48842 ,so the given combination will
appear 493 times. Similarly we can infer various combinations.

Confidence indicates the reliability of association rule; a high confidence value like 0.96
indicates that if (education=5th -6th) appears (capital-loss=None) will also appear most of the
time.

Lab Assignment -3 Clustering in R

We perform K-Means Clustering in R for the the USArrests Dataset
Code:
# Loading necessary libraries
library(ggplot2)

#reading the data

df <- USArrests
View(df)
# Perform k-means clustering for murder and assault columns
km <- kmeans(df[, c("Murder", "Assault")], centers = 3, nstart = 25)
km

# Add cluster assignment to original dataset

df$cluster <- as.factor(km$cluster)

# Visualizing clusters
ggplot(df, aes(x = Murder, y = Assault, color = cluster)) +
geom_point() +
labs(title = "Visualiztion of Clusters on UsArrests Dataset",
x = "Murder Rate",
y = "Assault Rate")

Output:
These are some of the rows on the USArrests Dataset

We perform Clustering using only the Murder and Assault columns .We have selected the
number of clusters as 3.
Here we group different cities into clusters based on the murder rate and assault rate per city.

Lab Assignment 4- Visualisation and Statistics in R.

Here we perform Various Visualization techniques and some statistical calculations.

Code:

# Data definition
data<-c(19, 23, 11, 5, 16, 21, 32, 14, 19, 27, 39,32,20,21)

# Calculate median
median_value <- median(data)
median_value
# Calculate mean
mean_value <- mean(data)
mean_value
#calculating standard deviation
deviation <-sd(data)
deviation

Histogram:
hist(data, xlab = "Ages of Students", col = "brown", border = "black")

Bar plot
# bar plot

H <- c(23,32,28,39,41)
M <- c("John","Julie","Jackie","Sam","Robert")

barplot(H,names.arg=M,xlab="Name",ylab="Age",col="blue",
main="Ages ")
Pie Chart:
# pie chart
x <- c(21, 30, 34, 15)
labels <- c("Chennai", "Hyderabad", "Kerala", "Mumbai")
# Plot the chart.
pie(x,labels,main="Percentage of Unemployment per City")

Lab Assignment 5 -Market Basket Analysis

One of the main applications of Association Rule Mining is Market Basket Analysis , Apriori
algorithm is being used to perform Market Basket Analysis in R.The Groceries dataset is being
used to perform Market Basket Analysis .

Market Basket Analysis (MBA) is a data mining technique used to discover associations
between different items purchased together in transactions. It is widely applied in retail and
e-commerce industries to understand customer purchasing behavior and to optimize product
placement
Code:
library(arules)

# Loading groceries dataset

data("Groceries")
summary(transactions)
inspect(head(transactions))
rules <- apriori(transactions, parameter = list(support = 0.001, confidence = 0.5))

inspect(head(rules))
Output:

Inference:
Support value of 0.003 indicates that cereals and whole milk appear in 0.3 percent of
rows of the dataset .The total transactions are 9835 ,so the given combination will appear 36
times.So we could infer that when a person buys cereals there is a good chance they will also
buy whole milk

Confidence indicates the reliability of association rule; a confidence value like 0.64
indicates that if cereals are bought there is a good chance that whole milk will also be bought
time.
Lab 6 - Decision tree in R
We have used the Iris dataset to plot the decision tree and the rpart library is being used

Code:
# Load the dataset
data(iris)
View(iris)
# Explore the structure of the dataset
str(iris)

# Train the decision tree model

library(rpart)
model <- rpart(data = iris)

# Plot the decision tree

library(rpart.plot)
rpart.plot(model)
Output:

We could identify the type of flower based on the Petal length and Petal width with the help of
the decision tree.

Kassambara, Alboukadel - Machine Learning Essentials - Practical Guide in R (2018)
100% (1)
Kassambara, Alboukadel - Machine Learning Essentials - Practical Guide in R (2018)
424 pages
R For Data Science Sample Chapter
100% (1)
R For Data Science Sample Chapter
39 pages
A Study of Spinoza by James Martineau PDF
No ratings yet
A Study of Spinoza by James Martineau PDF
424 pages
AWS Certified SysOp Administrator Guide
50% (2)
AWS Certified SysOp Administrator Guide
5 pages
K-Means Cluster Analysis UC Business Analytics R Programming Guide
No ratings yet
K-Means Cluster Analysis UC Business Analytics R Programming Guide
19 pages
The Challenges in The Training and Education of Library and Information Science Students
No ratings yet
The Challenges in The Training and Education of Library and Information Science Students
57 pages
Decision Support Systems
100% (7)
Decision Support Systems
18 pages
Data Warehouse and Data Mining Syllabus
No ratings yet
Data Warehouse and Data Mining Syllabus
5 pages
Handbook of Usability and User-Experience (UX), Marcelo M. Soares - 1
No ratings yet
Handbook of Usability and User-Experience (UX), Marcelo M. Soares - 1
371 pages
Network Analysis
No ratings yet
Network Analysis
53 pages
Assignment Clustering
No ratings yet
Assignment Clustering
22 pages
Interconnection Diagram
No ratings yet
Interconnection Diagram
1 page
Market Basket Analysis Using: R Tool
No ratings yet
Market Basket Analysis Using: R Tool
23 pages
Introduction To HCI
No ratings yet
Introduction To HCI
47 pages
DWM Unit 5 Mining Frequent Patterns and Cluster Analysis
100% (1)
DWM Unit 5 Mining Frequent Patterns and Cluster Analysis
15 pages
Fluids Mind Map Rubric
100% (1)
Fluids Mind Map Rubric
1 page
Master Enablement Plan Alteryx
0% (1)
Master Enablement Plan Alteryx
14 pages
Unit4 Datascience
No ratings yet
Unit4 Datascience
43 pages
ARM and Clustering
No ratings yet
ARM and Clustering
79 pages
Transactions: 1. Introduction To Transaction Processing - 1.1 Single User VS Multi User Systems
No ratings yet
Transactions: 1. Introduction To Transaction Processing - 1.1 Single User VS Multi User Systems
22 pages
List of Sci - Scie Journals
No ratings yet
List of Sci - Scie Journals
5 pages
YanchangZhao Refcard Data Mining
No ratings yet
YanchangZhao Refcard Data Mining
3 pages
Association Rules
No ratings yet
Association Rules
29 pages
Analysis Using Statistical: Introduction & Data Exploration
No ratings yet
Analysis Using Statistical: Introduction & Data Exploration
23 pages
R Record-1
No ratings yet
R Record-1
53 pages
Study Materials - Restricted Boltzmann Machine
No ratings yet
Study Materials - Restricted Boltzmann Machine
6 pages
Unit 6 - Machine Learning in R
No ratings yet
Unit 6 - Machine Learning in R
45 pages
R Tools Manual New
No ratings yet
R Tools Manual New
35 pages
Chap3 Part1-OutilsAnalyseConception V2425EV
No ratings yet
Chap3 Part1-OutilsAnalyseConception V2425EV
56 pages
Datamininganddataware
No ratings yet
Datamininganddataware
25 pages
What Is Cluster Analysis?: - Cluster: A Collection of Data Objects
No ratings yet
What Is Cluster Analysis?: - Cluster: A Collection of Data Objects
77 pages
Clustering 2
No ratings yet
Clustering 2
11 pages
7 K-Means Clustering
No ratings yet
7 K-Means Clustering
27 pages
Open Thesis Database
100% (3)
Open Thesis Database
7 pages
Lecture 7 - Integrated Analysis With R
No ratings yet
Lecture 7 - Integrated Analysis With R
79 pages
Module 5
No ratings yet
Module 5
8 pages
Overview of Clustering:: UNIT-5
No ratings yet
Overview of Clustering:: UNIT-5
27 pages
Statistical Computing With R: Masters in Data Sciences 503 (S28) Third Batch, SMS, TU, 2024
No ratings yet
Statistical Computing With R: Masters in Data Sciences 503 (S28) Third Batch, SMS, TU, 2024
35 pages
BDA MSC It
No ratings yet
BDA MSC It
35 pages
R Reference Card For Data Mining
No ratings yet
R Reference Card For Data Mining
3 pages
Datamining Lab Record
No ratings yet
Datamining Lab Record
36 pages
Lab Manual - DSR
No ratings yet
Lab Manual - DSR
32 pages
66 GB 95 GB 90 GB: 23 GB 21 GB 63 GB 19 GB
No ratings yet
66 GB 95 GB 90 GB: 23 GB 21 GB 63 GB 19 GB
4 pages
Sistem Informasi Administrasi Pelayanan Kesehatan Di Puskesmas Kiarapedes
No ratings yet
Sistem Informasi Administrasi Pelayanan Kesehatan Di Puskesmas Kiarapedes
22 pages
R Code For Discriminant and Cluster Analysis
No ratings yet
R Code For Discriminant and Cluster Analysis
23 pages
R Lab Program
No ratings yet
R Lab Program
20 pages
Clustering
No ratings yet
Clustering
25 pages
BS 1629
No ratings yet
BS 1629
29 pages
Mla - 2 (Cia - 3) - 20221013
No ratings yet
Mla - 2 (Cia - 3) - 20221013
21 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
21 pages
R - Language Lab Manual - PG 2024
No ratings yet
R - Language Lab Manual - PG 2024
29 pages
Da Thoery
No ratings yet
Da Thoery
24 pages
BDA LabReport-9
No ratings yet
BDA LabReport-9
17 pages
DATAMINING
No ratings yet
DATAMINING
24 pages
YanchangZhao Refcard Data Mining
No ratings yet
YanchangZhao Refcard Data Mining
4 pages
Clustering in R
No ratings yet
Clustering in R
12 pages
Alehandro Lumentah 210211010188 Assignment09
No ratings yet
Alehandro Lumentah 210211010188 Assignment09
10 pages
Classification Using R
No ratings yet
Classification Using R
9 pages
Assignment 2
No ratings yet
Assignment 2
13 pages
Materi Praktikum
No ratings yet
Materi Praktikum
7 pages
Artificial Intelligence Chatbot in Android System Using Open Source Program-O
No ratings yet
Artificial Intelligence Chatbot in Android System Using Open Source Program-O
7 pages
R Lab File Deepak
No ratings yet
R Lab File Deepak
27 pages
Da Exp9,10
No ratings yet
Da Exp9,10
9 pages
1Z0 042 PDF Free 1Z0 042 Exam Question Download 1Z0 042
No ratings yet
1Z0 042 PDF Free 1Z0 042 Exam Question Download 1Z0 042
6 pages
Practical 7 1
No ratings yet
Practical 7 1
9 pages
Chapter 2
No ratings yet
Chapter 2
17 pages
Testing Plan Document
No ratings yet
Testing Plan Document
12 pages
4063 Final复习资料
No ratings yet
4063 Final复习资料
6 pages
Assignment Problem
No ratings yet
Assignment Problem
10 pages
Database Management System Class 10 Questions and Answers
No ratings yet
Database Management System Class 10 Questions and Answers
11 pages
FullMarks - Clustering StudentSolution 2
No ratings yet
FullMarks - Clustering StudentSolution 2
13 pages
Blood Donation
No ratings yet
Blood Donation
20 pages
Durant Photographyperformance 2010
No ratings yet
Durant Photographyperformance 2010
9 pages
Study Materials - Denoising Autoencoders
No ratings yet
Study Materials - Denoising Autoencoders
7 pages
Predictive Analysis 5
No ratings yet
Predictive Analysis 5
8 pages
Python DM Lab Manual Part 2
No ratings yet
Python DM Lab Manual Part 2
8 pages
Module 4
No ratings yet
Module 4
7 pages
Assignment Mod 3 Introduction To OLTP and OLAP
No ratings yet
Assignment Mod 3 Introduction To OLTP and OLAP
6 pages
Lp2-Etl Model Assignment No. 2: R (2) C (4) V (2) T (2) Total (10) Dated Sign
No ratings yet
Lp2-Etl Model Assignment No. 2: R (2) C (4) V (2) T (2) Total (10) Dated Sign
7 pages
Introduction To Computer Information Systems Chapter 5 Databases and Database Management System
No ratings yet
Introduction To Computer Information Systems Chapter 5 Databases and Database Management System
22 pages
Rstudio Study Notes For PA 20181126
No ratings yet
Rstudio Study Notes For PA 20181126
6 pages
Data Mining Chapter 2: Market Basket Analysis
No ratings yet
Data Mining Chapter 2: Market Basket Analysis
4 pages
Market Basket Analysis
No ratings yet
Market Basket Analysis
7 pages
Study Materials - Sparse Autoencoder
No ratings yet
Study Materials - Sparse Autoencoder
8 pages
R Reference Card For Data Mining
No ratings yet
R Reference Card For Data Mining
4 pages
Practical No - 4
No ratings yet
Practical No - 4
3 pages
Presentation On Youtube Streamers Analysis
No ratings yet
Presentation On Youtube Streamers Analysis
9 pages
Cluster R
No ratings yet
Cluster R
1 page
K Means Clustering in R Example - Learn by Marketing
No ratings yet
K Means Clustering in R Example - Learn by Marketing
3 pages
Modelling With R
No ratings yet
Modelling With R
3 pages
Database Management System (Introduction, KEYS, ACID Properties)
No ratings yet
Database Management System (Introduction, KEYS, ACID Properties)
4 pages
District Panchayat Office - Welcome To East Godavari District Web Portal - India
No ratings yet
District Panchayat Office - Welcome To East Godavari District Web Portal - India
1 page
BDD gestCOMPTE
No ratings yet
BDD gestCOMPTE
3 pages
Scope of The Project
No ratings yet
Scope of The Project
2 pages
Introduction To Business Statistics Through R Software: Software
From Everand
Introduction To Business Statistics Through R Software: Software
Editor IJSMI
No ratings yet
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet

CSE 3121 Information Visualization R Studio All Codes

Uploaded by

CSE 3121 Information Visualization R Studio All Codes

Uploaded by

CSE 3121 Information Visualization

Lab Assignment -2 Association Rule Mining in R

Confidence measures the reliability or certainty of the association rule.A high

# Explore the Adult dataset

# Perform association rule mining using apriori algorithm

# Explore the generated rules

Lab Assignment -3 Clustering in R

#reading the data

# Add cluster assignment to original dataset

Lab Assignment 4- Visualisation and Statistics in R.

Here we perform Various Visualization techniques and some statistical calculations.

Lab Assignment 5 -Market Basket Analysis

# Loading groceries dataset

# Train the decision tree model

# Plot the decision tree

You might also like