0% found this document useful (0 votes)

6 views7 pages

Bi 5

Uploaded by

sifovec135

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views7 pages

Bi 5

Uploaded by

sifovec135

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Intelligenceand Data Analytics

Rusiness L13
Lab Manual
Business Intelligence and Data Analvtics
Practical No. 8 Lab Manual

Porformn the data clustering using clusterra alçortr m

srequire(graphics)
Al

#a2.-dimensional
lexample
R Console rbind(matrix(rnorm(100, sd =0.3), ncol 2)
. matrix(rnorm( 100, mean =1,3d a
0.3). ncol 2)
"y")
olnames(x)<- c("x"
sfd<-kmeans(x.2))
625. 2:4, clustering with 2 clusters of sizes51.49
K-means
Clustermeans,:

t0.02623258 -0.05595237
996460484 L.00834326

Clustering vector:
12

22222 22.
2222122
|lII11111L|1IIill2222:2222222
22.
(82] 22 2
squares by cluster :
within cluster sum of
1110.683124 9.464926
Y%)
(between_SS / total SS = 714

Avalable components:

[1]"cluster" "centers" "totss" "withinss" "tot.withinss" "betweenss* iter

50. 50
>K-means clustering with 2 clusters of slzes

» plot(x, col = clScluster)

>points(clScenters, col = 1:2, pch 8, cex =2)

> # sum of squares

>SS<- function(x) sum(scale(x, scale a FALSE)^2)

> ## cluster centers "fiitted" to each obs.:

>fited.x <- fltted(c): head(fitted.x)

10.02623258 -0 05595237
10.02623258 -0.05595237
10.02623258 -0.05595237
10.02623258 -0.05595237
10.02623258 -0.05595237 TachLaouledys
10.02623258 -0.0s595237
Tech Kaouledge
PuDiCatI0
Business.
Intelligenceand| Data Analytics
L-75
L-74
Lab Manual custer means
Lab Manual
Business Intelligence and Data Analytics y
L0040085 1.3382030
-015007776-0.4181972
134308070.850146s
0.70216480.8706605
-025769550.1991452
&
0.3498496 0.0499787

Clusteringvector:

6252265622

10
1]56
2344134444311131113
14
(82]443144344 4 4133311641

Wthín cluster
sum of
25 2 526525626 2 6 5 6 6 65262626 5 563434141 4 34
squares by cluster:
00 (1J0.9875594 0.8958093 0.93791644 1.8520779 1.0357106 1.6801028
89.S %)
(betweenSS/total_SS =

Avallable components:
## Equalities: --
"totss")]), # the sanme two columns 11 "cluster" "centers!" "totss" "withínss" "totwithinss" "betweenss 'slze'
cbind (cl[c("betweenss", "tot. withinss" "iter "laul
c(ss(fittedx), ss(resid.x), ss(x)))
cl$cluster)
>plot(x, col =
pch =8)
stopifnot(allequal(cls totss, ss(x)). >points(cl$centers, col =1:5,
+ all.equal(clS tot.withinss, ss(resid.x)),
## these three are the same:

+ all.equal (clS betweenss, ss (fitted.x),

+ all.equal(cl$ betweenss, cl$totss - cIStot.
withinss).
+ ## and hence also
+ all.equal(ss(x), ss(itted.x) +ss(resid.x))
+)

kmeans(x,1)$withinss # trivial one-cluster, (its W.SS == ss(x)

##random starts do help here with too many clusters

## (and are often recommended anyway!): 15

(cl <- kmeans(x, 6, nstart = 29))

K-means clustering with 6 clusters of sizes 15, 16, 13, 21, 14. 21
PODications

Tech Knouled
PuDICatio0S
Intelligence allU
Buslnes
c L-77
Lab Manual Lab Manual
L-76 Coefficients:
Business Intelligence and Data Analytics
EstímateStd. Error t value Pr(>\)
Practical No. 9
8.1473 27.0454 0.301 0.7709
Antercept)
0.3827 0.1693 2.261 0.0536,
on the given datawarehoUse vala (power) of
Alm : Perform the Linear rearession where exponent iboth these
through an equation,
variables are related plotted as agraph. A
In Linear Regression thesetwO
variables is 1. Mathematically alinear relationship represents a
straight line when
non-linear lenif
codes: 0
'*** 0.001 **0.01 "0.050.,1"1
to 1 creates a curve
variable is not equal
relationship where the exponent of any standarderror: 13.8 on 8degrees of freedom
y= ax + b is an equation for linear regression. Restdual
constantswhich are callaa. R-squared: 0.3899, Adjusted R-squared: 0.3136
variable and a and b are
response variable, x is the predictor height is known. To do this we
Multiple
on 1l and
Where, y is the when his 8 DF, p-value; 0.05363
predicting weight of a person F-statlstic: 5.113
coefficients. Asimple example of regression is persons
and weight of person. Predlctthe
welghttoof new
need to have the relationship between height
The steps to create the relationship is - predictorvector.
height and corresponding weight. # The ,175, 139, 186, 125, 146, 199, 183, 162, 121)
>x<(141,
gathering a sample of observed values of
Carry out the experiment of
functions in R.
Create a relationship model using the lm) resposne vector.
using these ,# The
coefficients from the model created and create the mathematical equation -c(93, 84, 56, 81,
57. 47, 86,71, 61, 49)
Find the
prediction. Also called residuals.
model to know the average error in I function. relation <-Im(yx)
Get a summary of the relationship theeIm)
predlct) function in R.
>#Apply
To predict the weight of new persons, use the
person with height170.
,#Find weight of a
Create Relationship Model & get the Coefficients
170)
>X<- c(141, 175, 139, 186, 125, 146, 199, 183, 162, 121) >a<- data.frame(x=
>y<- c(93, 84, 56, 81, 57, 47, 86, 71, 61, 49) predict(relation,a)
> relation <- Im(y~x) >result <-
>print(result)
>print(relation) 1
73.20728
Call:
Visualize the Regression Graphically
Im(formula =y ~ x) variable.
># Create the predictor and response
186, 125, 146, 199, 183, 162, 121)
Coefficients:
>x< c(141, 175, 139,

(Intercept) X >y<- c(93, 84, 56, 81, 57, 47, 86, 71, 61, 49)
8,1473 0.3827 >relation <- Im(y~x)
Get the Summary of the Relationship
="linearregression.png]
>#Give the chart file a name. png(ile
>print(summary(relation))
> #Plot the chart.
Call: ="Weight in
abline(Im(x~y),cex= 1.3,pch 16,xlab
Im(formula =y~x) Weight Regression",
plot(y,x,col ="blue",main ="Height &
Residuals:
Kg"ylab ="Height in cm")
Min 1Q Median 3Q Max TechKnouledge
PUDICations
-17.022 -6.750 -2.164 1.688 30,891
TechKnouledge
PubICatlons
telligenceand Data Analytics
I
wsiness L79

L-78
Lab Manual Lab Manual
Business Intelligence and Data Analytics

Height &Welght Regresslon

TechKnouledge
PUDIC atlons

Tech Knouledge
PubIlcationS
!
IntelligenceandData Analytics
o.diness
Business Intelligence and Data Lab Manual
Analytics L-80

Lab Manual

.
0.92
092

085

)97

Trchliewlde

TerhKnowledge
Duhc 3tlon:
lntelligenceand| Data Analytics
Business Intelligence and Data Lab Manual
Business
L-83
Analytics L-82

Lab Manual

0.92 0.97

092 097

0.85
0.85

-5.09
-5.09

asts

0.92 0.97
0.92 0.97

0.85
0.85

-509
-5.09

).92 0.97

085

-5.09

TechKouled
PuDICations

Tech Knouledge
PUblC a tions
Business Intelligence and Data Analytics L-84
Lab Manual
BusinessIntelligenceandI Data Analytics
L-85

7
Ne
Practical No. 10 Lab Manual

VSUALIZATOMS Performthe logistic regression on the given

Alm:
The in-built data set
"mtcars" datawarehouse data.
describes different
"mtcars'" data set, the transmission mode (automatic or models of a car with thelr various
or 1). We can create a logistic regression model manual)theis described by the columnengine
value(0
cyl.
between columns "am" 3 am speci whichfliscataiobinary
ns. In
and other
RRGui (32-bit) -[R Conzole] columns - hp, wt and
R File Edit
View Mise Psckages Wndows Help

Bezde RXA é 110 2.629

FIL TERS
Mazde AX4 #ac 1 6 110 2.875
4 93 2.320
HOrREt 4 Dyíve 6 116 3.215
Hornet Sportabout e 175 3.440
Valiant 6 105 3.969

t Create Regression Model

We use the glm) function to create the regression model and get its summary for analysis.
RGui (32-bi0 -(R Conzole]
R Fle Edn Vie Misc Packeges Windors Help

inpu c- CArs l, c("arc*, "cyi", "tp", ");

E1nERT(3.dat&) )

Ceance ResL4ais:
Hedian
-2.17272 -9.14907 -0.01444 0.14116 1.27641

Cerficent:
Zstizate 3td.
.1l632 2.429 0.c252
(1ntercer:) 239. 1.07282 0.455 0.8491
0.4878g
o.03259 0.01926 1,729 o.0240.
-9.14947 4.153)2 -2.203 o,Q216"

31çn2r. ts: 0 1444 0.001 . o.02 * 0.05 ." 0 . 2 :

a 32 degrees of tret1a
Sul! 1evLance: 43.229?
9.5415 cn 29 degrees or treesom
Res1duel deviance:
AIC: 17.641

1teretic23:
Nater ot T19ber 5coziag

"cyl" and "hp", we consider

more than 0.05 for the variables
n the summary as the
p-value in the last column is
variable "am". Only welght (wt) impacts the "am" value in
value of the
to beinsignificant in contributing to the DO0
this regression model.

Techknouledge
PuDcatlons

Rays of Truth Crystals of Light Information and Guidance For The Golden Age by Fred Bell
100% (2)
Rays of Truth Crystals of Light Information and Guidance For The Golden Age by Fred Bell
868 pages
Keith McNulty - Handbook of Regression Modeling in People Analytics-Routledge (2021)
100% (1)
Keith McNulty - Handbook of Regression Modeling in People Analytics-Routledge (2021)
272 pages
Listening Acept
100% (1)
Listening Acept
29 pages
Data Analytics Unit 3
No ratings yet
Data Analytics Unit 3
104 pages
310 Drum Lifting Jacks, Shafts, Loading Traverses
100% (1)
310 Drum Lifting Jacks, Shafts, Loading Traverses
14 pages
Full Download A Walk Through Combinatorics An Introduction To Enumeration and Graph Theory 4th Edition Miklós Bóna PDF
No ratings yet
Full Download A Walk Through Combinatorics An Introduction To Enumeration and Graph Theory 4th Edition Miklós Bóna PDF
34 pages
Ba All Notes Merge - Merged
No ratings yet
Ba All Notes Merge - Merged
385 pages
ASME B31J B31J Essentials Why These Are Useful in Piping Stress Analysis
No ratings yet
ASME B31J B31J Essentials Why These Are Useful in Piping Stress Analysis
4 pages
Hirac (Manhole Installation)
No ratings yet
Hirac (Manhole Installation)
7 pages
DMV Unit 3 PPT - RSK - 250419 - 125620 Jfhuehiwhu
No ratings yet
DMV Unit 3 PPT - RSK - 250419 - 125620 Jfhuehiwhu
89 pages
Book CHPT 9 PPT - SLR
No ratings yet
Book CHPT 9 PPT - SLR
87 pages
Summative Test Reading and Writing
No ratings yet
Summative Test Reading and Writing
3 pages
Lecture Notes - Logistic Regression
100% (1)
Lecture Notes - Logistic Regression
11 pages
DFPC Fire Instructor I NFPA 1041 2007
No ratings yet
DFPC Fire Instructor I NFPA 1041 2007
10 pages
IDEA TRIBE - 2025 - Broucher
No ratings yet
IDEA TRIBE - 2025 - Broucher
4 pages
IAWA J. Suppl.5. Wood Anatomy Mimosoideae
No ratings yet
IAWA J. Suppl.5. Wood Anatomy Mimosoideae
119 pages
Inter-Personal Communication-Listening, Feedback Collaborative Processes in Work Groups
No ratings yet
Inter-Personal Communication-Listening, Feedback Collaborative Processes in Work Groups
18 pages
Linear Regression With Python
No ratings yet
Linear Regression With Python
140 pages
The Tech Interview Playbook: From DSA to System Design
From Everand
The Tech Interview Playbook: From DSA to System Design
Chinmoy Mukherjee
No ratings yet
Second Stats Packet 24
No ratings yet
Second Stats Packet 24
100 pages
Oliver Twist Essay Questions
100% (2)
Oliver Twist Essay Questions
4 pages
Oracle Fusion HRMS UAE HR Data Rel13 1
No ratings yet
Oracle Fusion HRMS UAE HR Data Rel13 1
138 pages
Business Analytics Unit - V Notes - 60637708 - 2025 - 05 - 15 - 02 - 16
No ratings yet
Business Analytics Unit - V Notes - 60637708 - 2025 - 05 - 15 - 02 - 16
37 pages
Model Evaluation
No ratings yet
Model Evaluation
80 pages
Practical - Regression
No ratings yet
Practical - Regression
114 pages
Unit 2
No ratings yet
Unit 2
80 pages
LinearRegressionUsing R
No ratings yet
LinearRegressionUsing R
91 pages
R Module 11 - Statistics
No ratings yet
R Module 11 - Statistics
35 pages
Regression Models Course Notes
No ratings yet
Regression Models Course Notes
102 pages
Vibration Meter Circuit Using LED Driver IC LM3915 - Gadgetronicx
No ratings yet
Vibration Meter Circuit Using LED Driver IC LM3915 - Gadgetronicx
4 pages
Statlearn PDF
No ratings yet
Statlearn PDF
123 pages
Rules and Procedures of Solving Mathematical Problems
No ratings yet
Rules and Procedures of Solving Mathematical Problems
17 pages
Linear Regression With R
No ratings yet
Linear Regression With R
45 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
27 pages
BDA Lab Manual (12 Weeks)
No ratings yet
BDA Lab Manual (12 Weeks)
22 pages
Frequency Distribution Table
No ratings yet
Frequency Distribution Table
2 pages
BDA MSC It
No ratings yet
BDA MSC It
35 pages
Lesson 1
No ratings yet
Lesson 1
14 pages
Introduction To Correlation and Regression
No ratings yet
Introduction To Correlation and Regression
53 pages
Unit 3
No ratings yet
Unit 3
30 pages
Preview-9781000427899 A41277316
No ratings yet
Preview-9781000427899 A41277316
28 pages
Statistical Regression
No ratings yet
Statistical Regression
32 pages
Recipes For Data Processing
No ratings yet
Recipes For Data Processing
51 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
48 pages
Reliability Analysis
No ratings yet
Reliability Analysis
22 pages
MCQS Unit IV Jacobian2
No ratings yet
MCQS Unit IV Jacobian2
6 pages
Tema-3-Econometria-Tema-3 en
No ratings yet
Tema-3-Econometria-Tema-3 en
21 pages
Ch9 - Correlation Regression
No ratings yet
Ch9 - Correlation Regression
23 pages
Experiment No.8 - Fit Simple Linear Regression Models Using Built-In Functions.
No ratings yet
Experiment No.8 - Fit Simple Linear Regression Models Using Built-In Functions.
8 pages
Lecture 4.3 Regression-1
No ratings yet
Lecture 4.3 Regression-1
30 pages
Linearregression
No ratings yet
Linearregression
18 pages
Project 5 Surabhi Sood - Report
No ratings yet
Project 5 Surabhi Sood - Report
34 pages
2SC5200/FJL4315 NPN Epitaxial Silicon Transistor: Applications
No ratings yet
2SC5200/FJL4315 NPN Epitaxial Silicon Transistor: Applications
7 pages
Session 6-15 - Unit II & III: Probability and Distribution, Classical Tests
No ratings yet
Session 6-15 - Unit II & III: Probability and Distribution, Classical Tests
34 pages
WEEK
No ratings yet
WEEK
17 pages
Final Cost Practical
No ratings yet
Final Cost Practical
29 pages
Samai (Hod) Applied Accounting Year 1 Business Economices Sec Sem
No ratings yet
Samai (Hod) Applied Accounting Year 1 Business Economices Sec Sem
19 pages
Linear Regression in Scikit-Learn (Sklearn) - An Introduction - Datagy
No ratings yet
Linear Regression in Scikit-Learn (Sklearn) - An Introduction - Datagy
22 pages
Ramesh Babu Pushpanathan Consultant-SAP
No ratings yet
Ramesh Babu Pushpanathan Consultant-SAP
15 pages
Non-Linear Data Models: Anol Bhattacherjee, Ph.D. University of South Florida
No ratings yet
Non-Linear Data Models: Anol Bhattacherjee, Ph.D. University of South Florida
28 pages
Islp 1
No ratings yet
Islp 1
15 pages
cs447 - Tool Making Predictions With Simple Linear Regression
No ratings yet
cs447 - Tool Making Predictions With Simple Linear Regression
5 pages
Aimil Ist Lot Delivery
No ratings yet
Aimil Ist Lot Delivery
2 pages
Linear Model
No ratings yet
Linear Model
10 pages
R Unit 4th and 5th
No ratings yet
R Unit 4th and 5th
17 pages
Presentation Business Applications
No ratings yet
Presentation Business Applications
18 pages
Predictive Modeling-Handouts
No ratings yet
Predictive Modeling-Handouts
11 pages
Approvals - Listofproducts - Siemens 2019 PDF
No ratings yet
Approvals - Listofproducts - Siemens 2019 PDF
3 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
Data Science Lab 5
No ratings yet
Data Science Lab 5
8 pages
Determinants of Work-Readiness: Siti Nurlaela Kurjono Rasto
No ratings yet
Determinants of Work-Readiness: Siti Nurlaela Kurjono Rasto
7 pages
20BCE1205 Lab3
No ratings yet
20BCE1205 Lab3
9 pages
Why's and Wherefore's
No ratings yet
Why's and Wherefore's
15 pages
BIP5,8,9,10 Mahesh
No ratings yet
BIP5,8,9,10 Mahesh
7 pages
R Tutorial Slides
No ratings yet
R Tutorial Slides
13 pages
What Is Empirical - Models
No ratings yet
What Is Empirical - Models
14 pages
Marking scheme-PT-1-XII Physics
No ratings yet
Marking scheme-PT-1-XII Physics
3 pages
Mindanao State University General Santos City: Simple Linear Regression
No ratings yet
Mindanao State University General Santos City: Simple Linear Regression
12 pages
7708 - MBA PredAnanBigDataNov21
No ratings yet
7708 - MBA PredAnanBigDataNov21
11 pages
Merge DPP-10 - 21 - Isomerism Chemistry - Dropper NEET 13
No ratings yet
Merge DPP-10 - 21 - Isomerism Chemistry - Dropper NEET 13
1 page
Problem Sheet
No ratings yet
Problem Sheet
2 pages
Day 6
No ratings yet
Day 6
3 pages
Regressi On
No ratings yet
Regressi On
16 pages
R Data Analysis
No ratings yet
R Data Analysis
10 pages
Rstudio Study Notes For PA 20181126
No ratings yet
Rstudio Study Notes For PA 20181126
6 pages
Untitled Document
No ratings yet
Untitled Document
6 pages
ESDL Lab Manual
No ratings yet
ESDL Lab Manual
7 pages
Predictive Analytics Tool Simple Linear Regression
No ratings yet
Predictive Analytics Tool Simple Linear Regression
2 pages
Correlation and Regression
No ratings yet
Correlation and Regression
5 pages
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
From Everand
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
Fouad Sabry
No ratings yet