0% found this document useful (0 votes)

26 views10 pages

R Programming Student Lab Manual-52-63-3-12

how you with the roadmap of r programming

Uploaded by

lucyrahul004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views10 pages

R Programming Student Lab Manual-52-63-3-12

how you with the roadmap of r programming

Uploaded by

lucyrahul004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

R function pgeom(q, prob, lower.

tail) is the cumulative probability

( lower. tail = TRUE for left tail, lower. tail = FALSE for right tail) of less
than or equal to q failures prior to success.
Example: A sports marketer randomly selects persons on the street until he
encounters someone who attended a game last season. what is the
probability the marketer fails to find someone who attended a game in x <=
5 trials before finding someone who attended a game on the next trial when
the population probability is p = 0.20?
Sol:p = 0.20 n = 5
# exact
pgeom(q = n, prob = p, lower.tail = TRUE)
[1] 0.737856

Experiment 10 Implement built in R functions for sample statistics and

Statistics tests
10(a) Calculate Confidence Interval in R for Normal Distribution
# Assume mean= 12
# Standard deviation = 3
# Sample size n= 30
# 95 percent confidence interval so tails are .925 calculate confidence interval

Solution
> center <- 12
> sd <- 3
> n <- 30
> E <- qnorm(0.975)*sd/sqrt(n)
>E
[1] 1.073516
> lower_bound <- center – E
>lower_bound
[1] 10.92648
>upper_bound <- center + E
>upper_bound

53
[1] 13.07352
Therefore lower_bound is 10.92648 and upper_bound is 13.07352
Thus the range in this case is between 10.9 and 13.1 (rounding outwards).

Experiment 10 (b) Hypothesis Testing

Hypothesis testing is mathematically related to the problem of finding
confidence intervals. However, the approach is different. For one, you use
the data to tell you where the unknown parameters should lie, for
hypothesis testing, you make a hypothesis about the value of the unknown
parameter and then calculate how likely it is that you observed the data or
worse. However, with R you will not notice much difference as the same
functions are used for both. The way you use them is slightly different
though.
Question : Consider a simple survey. You ask 100 people (randomly chosen)
and 42 say “yes” to your question. Does this support the hypothesis that the
true proportion is 50%? To answer this, we set up a test of hypothesis. The null
hypothesis, denoted H0 is that p = .5, the alternative hypothesis, denoted HA, in
this example would be p 6= 0.5. This is a so called “two-sided” alternative. To
test the assumptions, we use the function prop.test as with the confidence
interval calculation.
Solution:
> prop.test(42,100,p=.5)

Note the p-value of 0.1336. The p-value reports how likely we are to see this
data or worse assuming the null hypothesis. The notion of worse, is implied by
the alternative hypothesis. In this example, the alternative is two sided as too
small a value or too large a value or the test statistic is consistent with HA. In
particular, the p-value is the probability of 42 or fewer or 58 or more answer
54
“yes” when the chance a person will answer “yes” is fifty-fifty.

Now, the p-value is not so small as to make an observation of 42 seem

unreasonable in 100 samples assuming the null hypothesis. Thus, one would
“accept” the null hypothesis. Next, we repeat, only suppose we ask 1000 people
and 420 say yes. Does this still support the null hypothesis that p = 0.5?

Now the p-value is tiny (that’s 0.0000004956!) and the null hypothesis is not
supported. That is, we “reject” the null hypothesis. This illustrates the the p
value depends not just on the ratio, but also n. In particular, it is because the
standard error of the sample average gets smaller as n gets larger.

Experiment 11.Implement R Program to predict data using Linear Regression model.

Linear regression is one of the simplest and most common supervised machine
learning algorithms that data scientists use for predictive modeling. In this post,
we’ll use linear regression to build a model that predicts cherry tree volume
from metrics that are much easier for folks who study trees to measure.

 Collect some data relevant to the problem (more is almost always better).
 Clean, augment, and preprocess the data into a convenient form, if
needed.
 Conduct an exploratory analysis of the data to get a better sense of it.
 Using what you find as a guide, construct a model of some aspect of the
data.
 Use the model to answer the question you started with, and validate your
results.

55
 The Simple Linear Regression is handled by the inbuilt function ‘lm’ in R.

Creating the Linear Regression Model and fitting it

with training_Set
regressor = lm(formula = Y ~ X, data = training_set)
This line creates a regressor and provides it with the data set to train.
Multiple Linear Regression is also handled by the function lm.

Creating the Multiple Linear Regressor and fitting

it with Training Set
regressor = lm(Y ~ .,data = training_set)
The expression ‘Y ~ .” takes all variables except Y in the training_set as
independent variables.
lm() Function
This function creates the relationship model between the predictor and the
response variable.
Create Relationship Model & get the coefficients
x <- c(151, 174, 138, 186, 128, 136, 179, 163, 152, 131)
y <- c(63, 81, 56, 91, 47, 57, 76, 72, 62, 48)

# Apply the lm() function.

relation <- lm(y~x)

print(relation)
Call:
lm(formula = y ~ x)

56
Get the Summary of the Relationship
x <- c(151, 174, 138, 186, 128, 136, 179, 163, 152, 131)
y <- c(63, 81, 56, 91, 47, 57, 76, 72, 62, 48)
# Apply the lm() function.
relation <- lm(y~x)

predict() Function
Syntax
The basic syntax for predict() in linear regression is −
predict(object, newdata)
Following is the description of the parameters used −
 object is the formula which is already created using the lm() function.
 newdata is the vector containing the new value for predictor variable.

Predict the weight of new persons

# The predictor vector.
x <- c(151, 174, 138, 186, 128, 136, 179, 163, 152, 131)

# The resposne vector.

y <- c(63, 81, 56, 91, 47, 57, 76, 72, 62, 48)

# Apply the lm() function.

57
relation <- lm(y~x)

# Find weight of a person with height 170.

a <- data.frame(x = 170)
result <- predict(relation,a)
print(result)
1
76.22869
Visualize the Regression Graphically

# Create the predictor and response variable.

x <- c(151, 174, 138, 186, 128, 136, 179, 163, 152, 131)
y <- c(63, 81, 56, 91, 47, 57, 76, 72, 62, 48)
relation <- lm(y~x)

# Give the chart file a name.

png(file = "linearregression.png")
# Plot the chart.
plot(y,x,col = "blue",main = "Height & Weight Regression",
abline(lm(x~y)),cex = 1.3,pch = 16,xlab = "Weight in Kg",ylab = "Height in cm")
# Save the file.
dev.off()

58
Experiment 11
The most used plotting function in R programming is the plot() function. It
is a generic function, meaning, it has many methods which are called according
to the type of object passed to plot().
In the simplest case, we can pass in a vector and we will get a scatter plot of
magnitude vs index. But generally, we pass in two vectors and a scatter plot of
these points are plotted.
For example, the command plot(c(1,2),c(3,5)) would plot the points (1,3) and
(2,5).
Here is a more concrete example where we plot a sine function form range
-pi to pi.

x <- seq(-pi,pi,0.1)
plot(x, sin(x))

59
Adding Titles and Labelling Axes
We can add a title to our plot with the parameter main. Similarly, xlab and ylab
can be used to label the x-axis and y-axis respectively.

plot(x, sin(x),
main="The Sine Function", ylab="sin(x)")

Changing Color and Plot Type

We can see above that the plot is of circular points and black in color. This is
the default color.
We can change the plot type with the argument type. It accepts the following
strings and has the given effect.

"p" - points "l" - lines

"b" - both points and lines
"c" - empty points joined by lines "o" - overplotted points and lines "s" and "S" -
stair steps
"h" - histogram-like vertical lines
"n" - does not produce any points or line
Similarly, we can define the color using col.

plot(x, sin(x),
main="The Sine Function", ylab="sin(x)",
type="l", col="blue")

60
R 3D PLOTS
There are many functions in R programming for creating 3D plots. In this
section, we will discuss on the persp() function which can be used to create 3D
surfaces in perspective view.
This function mainly takes in three variables, x, y and z where x and y are
vectors defining the location along x- and y-axis. The height of the surface (z-
axis) will be in the matrix z. As an example,
Let’s plot a cone. A simple right circular cone can be obtained with the
following function.

cone <- function(x, y){ sqrt(x^2+y^2)

}
Now let’s prepare our variables.

x<- y <- seq(-1, 1, length= 20)

z <- outer(x, y, cone)
We used the function seq() to generate vector of equally spaced numbers.
Then, we used the outer() function to apply the function cone at every
combination of x and y.
Finally, plot the 3D surface as follows.
persp(x, y, z)

61
Adding Titles and Labelling Axes to Plot
We can add a title to our plot with the parameter main. Similarly, xlab, ylab and
zlab can be used to label the three axes.
Rotational angles
We can define the viewing direction using parameters theta and phi.
By default theta, azimuthal direction, is 0 and phi, colatitude direction, is 15.
Colouring and Shading Plot
Colouring of the plot is done with parameter col. Similarly, we can add shading
with the parameter shade.
persp(x, y, z,
main="Perspective Plot of a Cone", zlab = "Height",
theta = 30, phi = 15,
col = "springgreen", shade = 0.5)

CHAPTER 1 2 and 3 RESEARCH
100% (3)
CHAPTER 1 2 and 3 RESEARCH
26 pages
Enabling and Extending Prompts
No ratings yet
Enabling and Extending Prompts
6 pages
R Stastics PDF
No ratings yet
R Stastics PDF
30 pages
R Commands
No ratings yet
R Commands
5 pages
WEEK
No ratings yet
WEEK
17 pages
Unit 2 R
No ratings yet
Unit 2 R
16 pages
R Intro 2011
No ratings yet
R Intro 2011
115 pages
R1 Plots
No ratings yet
R1 Plots
20 pages
R Programming Slides
No ratings yet
R Programming Slides
73 pages
An R Tutorial Starting Out
No ratings yet
An R Tutorial Starting Out
9 pages
Which Test When: 1 Exploratory Tests
No ratings yet
Which Test When: 1 Exploratory Tests
5 pages
R Short Tutorial
No ratings yet
R Short Tutorial
5 pages
Statistical Analysis
No ratings yet
Statistical Analysis
26 pages
Basic Statistics
No ratings yet
Basic Statistics
66 pages
ECON 1100 R04 - R.Commands PDF
No ratings yet
ECON 1100 R04 - R.Commands PDF
15 pages
00 Lab Notes
No ratings yet
00 Lab Notes
8 pages
R Manual PDF
No ratings yet
R Manual PDF
78 pages
Data Analysis Using R - 5
No ratings yet
Data Analysis Using R - 5
9 pages
Useful R Functions-1
No ratings yet
Useful R Functions-1
4 pages
Statistics Cheat Sheet
100% (1)
Statistics Cheat Sheet
4 pages
Econometrics I - R Summary (Maite Cabeza-Gutes)
No ratings yet
Econometrics I - R Summary (Maite Cabeza-Gutes)
77 pages
1research Methodology For Commerce Lab
No ratings yet
1research Methodology For Commerce Lab
35 pages
SC&RP - Unit 5
No ratings yet
SC&RP - Unit 5
36 pages
R Unit 4th and 5th
No ratings yet
R Unit 4th and 5th
17 pages
R Examples
No ratings yet
R Examples
56 pages
R Studio
No ratings yet
R Studio
41 pages
Unit 4 - R Programming
No ratings yet
Unit 4 - R Programming
26 pages
Introduction To R: 1 Getting Started
No ratings yet
Introduction To R: 1 Getting Started
14 pages
Rintro
No ratings yet
Rintro
14 pages
Stats101A - Chapter 1
No ratings yet
Stats101A - Chapter 1
25 pages
DSR 2879
No ratings yet
DSR 2879
25 pages
Modern Regression Homework 5-1
No ratings yet
Modern Regression Homework 5-1
8 pages
Session Set Working Directory Choose Directlry
No ratings yet
Session Set Working Directory Choose Directlry
17 pages
Essential R
No ratings yet
Essential R
261 pages
Lucero R Tutorial 2016
No ratings yet
Lucero R Tutorial 2016
135 pages
R Manual
No ratings yet
R Manual
10 pages
Brief Introduction To R Kaustav Banerjee: Decision Sciences Area, IIM Lucknow
No ratings yet
Brief Introduction To R Kaustav Banerjee: Decision Sciences Area, IIM Lucknow
7 pages
This Is The Course Script
No ratings yet
This Is The Course Script
9 pages
CRM Cheat Sheet
No ratings yet
CRM Cheat Sheet
7 pages
R Studio Practicals-1
No ratings yet
R Studio Practicals-1
29 pages
Research Methodology For Commerce Lab
No ratings yet
Research Methodology For Commerce Lab
35 pages
Unit-15 Data Analysis and R
No ratings yet
Unit-15 Data Analysis and R
12 pages
Regression Analysis Using R
No ratings yet
Regression Analysis Using R
17 pages
BAN5
No ratings yet
BAN5
2 pages
Linear Regression
100% (2)
Linear Regression
228 pages
An Introduction To R: Biostatistics 615/815
No ratings yet
An Introduction To R: Biostatistics 615/815
59 pages
Introduction To R PDF
No ratings yet
Introduction To R PDF
56 pages
Lab 01
No ratings yet
Lab 01
36 pages
R Tutorial
No ratings yet
R Tutorial
15 pages
Statistical Models in R
No ratings yet
Statistical Models in R
18 pages
R Functions List
No ratings yet
R Functions List
8 pages
Multiple Linear Regression: Response Explanatory - I
No ratings yet
Multiple Linear Regression: Response Explanatory - I
5 pages
R Intro STAT5000
No ratings yet
R Intro STAT5000
17 pages
Homework 3 R Tutorial: How To Use This Tutorial
No ratings yet
Homework 3 R Tutorial: How To Use This Tutorial
8 pages
Lecture 10 R
No ratings yet
Lecture 10 R
117 pages
STTN 225 R Summary
No ratings yet
STTN 225 R Summary
18 pages
Exercise Sheet - Control Structures and Functions: Hint: You Can Use The Command Diag
No ratings yet
Exercise Sheet - Control Structures and Functions: Hint: You Can Use The Command Diag
4 pages
Linear Regression
No ratings yet
Linear Regression
13 pages
Continuous Distributions in R
No ratings yet
Continuous Distributions in R
155 pages
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
A Short Course in Discrete Mathematics
From Everand
A Short Course in Discrete Mathematics
Edward A. Bender
3/5 (1)
Table 2
No ratings yet
Table 2
1 page
CSP BATCH - 7 Sec 4
No ratings yet
CSP BATCH - 7 Sec 4
55 pages
Applied Physics - Question Bank
No ratings yet
Applied Physics - Question Bank
4 pages
TB
No ratings yet
TB
1 page
Timetable
No ratings yet
Timetable
1 page
CSP BATCH - 7 Sec 2
No ratings yet
CSP BATCH - 7 Sec 2
55 pages
Defense Possible Questions
No ratings yet
Defense Possible Questions
9 pages
Stat LAS 1
No ratings yet
Stat LAS 1
6 pages
Statistics and Probability PDF
50% (2)
Statistics and Probability PDF
435 pages
2 1 R16 Feb 2022
No ratings yet
2 1 R16 Feb 2022
6 pages
Heteroscedasticity3 150218115247 Conversion Gate01
No ratings yet
Heteroscedasticity3 150218115247 Conversion Gate01
10 pages
Don'T Be.: Exploratory and Phone Screening Rounds
No ratings yet
Don'T Be.: Exploratory and Phone Screening Rounds
1 page
3.2 Estimating Population Mean
No ratings yet
3.2 Estimating Population Mean
18 pages
Summer Internship Project Report
No ratings yet
Summer Internship Project Report
51 pages
Estimation Confidence Intervals
No ratings yet
Estimation Confidence Intervals
58 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
36 pages
Statistics Probability Distributions I Is em
No ratings yet
Statistics Probability Distributions I Is em
58 pages
20-Item Summative Test
No ratings yet
20-Item Summative Test
2 pages
Lec4 EDA2025
No ratings yet
Lec4 EDA2025
13 pages
PR2 Q1W2L2 - PPT - Research Variables
100% (5)
PR2 Q1W2L2 - PPT - Research Variables
18 pages
Assignment 3 Answers
No ratings yet
Assignment 3 Answers
3 pages
Weekly Usage Hrs Annual Maintenance Expense (1000s)
No ratings yet
Weekly Usage Hrs Annual Maintenance Expense (1000s)
5 pages
DLP Sampling Techniques
100% (1)
DLP Sampling Techniques
8 pages
07 01 RA41207EN60GLA0 Cell Range
No ratings yet
07 01 RA41207EN60GLA0 Cell Range
35 pages
Normal Distribution PDF
No ratings yet
Normal Distribution PDF
25 pages
Lesson Study With Sharing and Jumping Task
No ratings yet
Lesson Study With Sharing and Jumping Task
20 pages
Understanding Self-Compassion in Adolescents - Validation Study of The Self Compassion Scale
No ratings yet
Understanding Self-Compassion in Adolescents - Validation Study of The Self Compassion Scale
7 pages
Final Thesis Edited
No ratings yet
Final Thesis Edited
92 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
22 pages
Economics 717 Fall 2019 Lecture - IV PDF
No ratings yet
Economics 717 Fall 2019 Lecture - IV PDF
30 pages
Buku Metode Penelitian
No ratings yet
Buku Metode Penelitian
63 pages
Effect of Availability and Utilization of Instructional Materials On Academic Performance of Secondary School Students in Geography in Kaduna State
No ratings yet
Effect of Availability and Utilization of Instructional Materials On Academic Performance of Secondary School Students in Geography in Kaduna State
119 pages
Copulas and Their Applications - Lan Zhang, Vijay P. Singh
No ratings yet
Copulas and Their Applications - Lan Zhang, Vijay P. Singh
620 pages
10000002623ec Connection To 3rd Party Autom Sys
No ratings yet
10000002623ec Connection To 3rd Party Autom Sys
16 pages

R Programming Student Lab Manual-52-63-3-12

Uploaded by

R Programming Student Lab Manual-52-63-3-12

Uploaded by

R function pgeom(q, prob, lower.

tail) is the cumulative probability

Experiment 10 Implement built in R functions for sample statistics and

Experiment 10 (b) Hypothesis Testing

Now, the p-value is not so small as to make an observation of 42 seem

Experiment 11.Implement R Program to predict data using Linear Regression model.

Creating the Linear Regression Model and fitting it

Creating the Multiple Linear Regressor and fitting

# Apply the lm() function.

Predict the weight of new persons

# The resposne vector.

# Apply the lm() function.

# Find weight of a person with height 170.

# Create the predictor and response variable.

# Give the chart file a name.

Changing Color and Plot Type

"p" - points "l" - lines

cone <- function(x, y){ sqrt(x^2+y^2)

x<- y <- seq(-1, 1, length= 20)

You might also like