Baylas - Linear Regression Analysis

The dataset contains two columns: number of hours students studied and the marks they got. There is a strong positive correlation of 0.976 between hours studied and scores. A linear regression model found that hours studied significantly predicts scores, with an R-squared value of 0.95. Plotting hours against scores and adding a linear regression line shows their strong linear relationship.

Uploaded by

Fatima Baylas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

58 views11 pages

Baylas - Linear Regression Analysis

Uploaded by

Fatima Baylas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Student

Dataset
Study
Hours

Insights
Baylas, Fatima Joan B.
BSCS-2
CSPE 3100 : Data Science
Dataset
The data set contains two columns that is the number of hours
student studied and the marks they got.

himanshunakrani

kaggle datasets download -d

himanshunakrani/student-study-hours
library(readr)
install.packages("ggplot2")
library(ggplot2)

studstudyhour = read.csv("score.csv", sep=",")

summary(studstudyhour)
head(studstudyhour)

hours = studstudyhour[,"Hours"]
scores = studstudyhour[,"Scores"]

#plot(x,y)
plot(hours, scores, pch = 16, col = "blue")

#correlation of between x and y

cor(hours, scores)
#linear regresssion model
model = lm(scores~hours, data=studstudyhour)
summary(model)
abline(model)

#using ggplot
ggplot(data = studstudyhour,aes(x = hours,y = scores)) +
geom_point(colour = "black",size = 1.5) +
geom_smooth(method = "lm",se = FALSE,colour = "red",size = 0.8)
Insights

Insights
> cor(hours, scores)
[1] 0.9761907
#plot(x,y)
plot(hours, scores, pch = 16, col = "blue")
> model = lm(scores~hours, data=studstudyhour)
> summary(model)

Call:
lm(formula = scores ~ hours, data = studstudyhour)

Residuals:
Min 1Q Median 3Q Max
-10.578 -5.340 1.839 4.593 7.265
R-square value: 0.95

P-value: < 2.2e-16 Coefficients:

Estimate Std. Error t value Pr(>|t|)
(Intercept) 2.4837 2.5317 0.981 0.337
hours 9.7758 0.4529 21.583 <2e-16 ***
---
Signif. codes:
0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 5.603 on 23 degrees of freedom

Multiple R-squared: 0.9529, Adjusted R-squared: 0.9509
F-statistic: 465.8 on 1 and 23 DF, p-value: < 2.2e-16
abline(model)
ggplot(data = studstudyhour,aes(x = hours,y = scores)) +
geom_point(colour = "black",size = 1.5) +
geom_smooth(method = "lm",se = FALSE,colour = "red",size = 0.8)

Data Set and Linear Regression Analysis
No ratings yet
Data Set and Linear Regression Analysis
2 pages
Matlab-Median and Mode
100% (1)
Matlab-Median and Mode
12 pages
A028 GLM-SC3
No ratings yet
A028 GLM-SC3
137 pages
Hasil Dan Analisa Data Bru
No ratings yet
Hasil Dan Analisa Data Bru
25 pages
Harmonic Seasonal Models
No ratings yet
Harmonic Seasonal Models
10 pages
Swen2 1
No ratings yet
Swen2 1
13 pages
ComandosR RLS
No ratings yet
ComandosR RLS
66 pages
Heart Disease Prediction Model
No ratings yet
Heart Disease Prediction Model
35 pages
Data Analytics - R Markdown
No ratings yet
Data Analytics - R Markdown
20 pages
Soruma SECOND ASSEsiment Final L Reg
No ratings yet
Soruma SECOND ASSEsiment Final L Reg
34 pages
Yaikob Second Assesiment Final
No ratings yet
Yaikob Second Assesiment Final
33 pages
Da (22C01156)
No ratings yet
Da (22C01156)
26 pages
Acr Model
No ratings yet
Acr Model
6 pages
Assingment 8: Chitresh Kumar
No ratings yet
Assingment 8: Chitresh Kumar
20 pages
Multicollinearity and Oaxaca - Tutorial
No ratings yet
Multicollinearity and Oaxaca - Tutorial
35 pages
soruma-SECOND-ASSEsiment L Reg
No ratings yet
soruma-SECOND-ASSEsiment L Reg
33 pages
Name: Badigi Shivakumar Reg - No: 20MIS0173 Lab - Slot: L9+L10 Date: 02-09-2021
No ratings yet
Name: Badigi Shivakumar Reg - No: 20MIS0173 Lab - Slot: L9+L10 Date: 02-09-2021
10 pages
Maths Lab
No ratings yet
Maths Lab
17 pages
Tutorial 5
No ratings yet
Tutorial 5
12 pages
Butler With Deliveries
No ratings yet
Butler With Deliveries
19 pages
Logistic Regression Using Python
No ratings yet
Logistic Regression Using Python
9 pages
17-Econometrics-Linear Regression
No ratings yet
17-Econometrics-Linear Regression
18 pages
Open Lab 2
No ratings yet
Open Lab 2
15 pages
Problem Set
No ratings yet
Problem Set
8 pages
SOCY7706: Longitudinal Data Analysis Instructor: Natasha Sarkisian Two Wave Panel Data Analysis
No ratings yet
SOCY7706: Longitudinal Data Analysis Instructor: Natasha Sarkisian Two Wave Panel Data Analysis
12 pages
ch4 4试验设计与数据分析-最速上升法
No ratings yet
ch4 4试验设计与数据分析-最速上升法
10 pages
Ary Reg
No ratings yet
Ary Reg
10 pages
EJEMPLO
No ratings yet
EJEMPLO
11 pages
Hasil Estimasi
No ratings yet
Hasil Estimasi
6 pages
How To Perform Simple Linear Regression in Python
No ratings yet
How To Perform Simple Linear Regression in Python
8 pages
Regression Analysis Simple Regression
No ratings yet
Regression Analysis Simple Regression
10 pages
Statistics Assignment
No ratings yet
Statistics Assignment
4 pages
Apotelesmata 2000+ Robust
No ratings yet
Apotelesmata 2000+ Robust
4 pages
07 - Polynomial Regression
No ratings yet
07 - Polynomial Regression
4 pages
Exame Do Dia 13 12 2019
No ratings yet
Exame Do Dia 13 12 2019
8 pages
Unit 3 6
No ratings yet
Unit 3 6
3 pages
DSAAct 7
No ratings yet
DSAAct 7
4 pages
BT PTTKNC
No ratings yet
BT PTTKNC
5 pages
Home Work Week 11
No ratings yet
Home Work Week 11
3 pages
Exercice V
No ratings yet
Exercice V
5 pages
Lesson 16 - Correlation and Regression: Return To Cover Page
No ratings yet
Lesson 16 - Correlation and Regression: Return To Cover Page
6 pages
May 25
No ratings yet
May 25
3 pages
AE6207 - Solution 4 - 2024
No ratings yet
AE6207 - Solution 4 - 2024
6 pages
Panel Data Analysis
No ratings yet
Panel Data Analysis
5 pages
Practical No-2
No ratings yet
Practical No-2
4 pages
Results 1
No ratings yet
Results 1
4 pages
Topic 1 Class Exercises
No ratings yet
Topic 1 Class Exercises
5 pages
Stata Output Panel Hsiao 1986 Example
No ratings yet
Stata Output Panel Hsiao 1986 Example
5 pages
分組作業三
No ratings yet
分組作業三
4 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
3 pages
Pool
No ratings yet
Pool
1 page
Regression Analyses
No ratings yet
Regression Analyses
1 page
Quiz
No ratings yet
Quiz
1 page
Lab 11 Indiv
No ratings yet
Lab 11 Indiv
2 pages
Linear Regression For Real
No ratings yet
Linear Regression For Real
1 page
Econometrics For Finanace Test 2
No ratings yet
Econometrics For Finanace Test 2
1 page
Linear Regression
No ratings yet
Linear Regression
1 page
Study Time Vs Performance Project
No ratings yet
Study Time Vs Performance Project
3 pages
Simple Linear Regression Theory Answers
No ratings yet
Simple Linear Regression Theory Answers
2 pages
Digital Signal and Image Processing using MATLAB, Volume 3: Advances and Applications, The Stochastic Case
From Everand
Digital Signal and Image Processing using MATLAB, Volume 3: Advances and Applications, The Stochastic Case
Gérard Blanchet
3/5 (1)

Baylas - Linear Regression Analysis

Uploaded by

Baylas - Linear Regression Analysis

Uploaded by

Student

kaggle datasets download -d

studstudyhour = read.csv("score.csv", sep=",")

#correlation of between x and y

P-value: < 2.2e-16 Coefficients:

Residual standard error: 5.603 on 23 degrees of freedom

You might also like