0% found this document useful (0 votes)

19 views12 pages

BA - Advanced statistical method using R (P2)

Uploaded by

Yogesh Nagvekar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views12 pages

BA - Advanced statistical method using R (P2)

Uploaded by

Yogesh Nagvekar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

304 - BA-SC-BA-03 : ADVANCED STATISTICAL METHODS USING R

– By Pratik Patil
(2019 CBCS Pattern) (Semester - III)

Time : 2½ hours] [Max. Marks :

50
Instructions to the candidates:
1) All questions are compulsory.
2) Make appropriate assumptions wherever required.

Q1) Answer the following questions (Any Five)

a) Enlist basic statistical functions in R.
b) What is difference between parametric and non parametric tests?
c) Define predictive analytics?
d) Explain pbinom ( ) function in R.
e) How do you interprete p value in hypothesis testing?
f) Write a function to get a list of all the packages installed in R.
g) Write a function to obtain the transpose of a matrix in R?
h) What is the purpose of regression analysis in R?

a) Basic Statistical Functions in R:

• Mean: mean(x)
• Median: median(x)
• Mode: mode(x)
• Standard Deviation: sd(x)
• Variance: var(x)
• Range: range(x)
• Minimum: min(x)
• Maximum: max(x)
• Quantiles: quantile(x, probs = c(0.25, 0.5, 0.75))
• Correlation: cor(x, y)
• Summary Statistics: summary(x)
b) Parametric vs. Non-Parametric Tests:
Parametric Tests:
• Assume data follows a specific distribution (usually normal).
• More powerful when assumptions are met.
• Examples: t-test, ANOVA, linear regression.
Non-Parametric Tests:
• Make fewer assumptions about data distribution.
• More robust when assumptions are violated or for ordinal data.
• Examples: Wilcoxon rank-sum test, Kruskal-Wallis test, Spearman
correlation.
c) Predictive Analytics:
• Using statistical techniques and machine learning algorithms to make
predictions about future or unknown events.
• Involves building models based on historical data to uncover patterns and
relationships.
• Applications: forecasting sales, predicting customer behavior, detecting
fraud, risk assessment.
d) pbinom() Function in R:
• Calculates the cumulative probability of a binomial distribution.
• Syntax: pbinom(q, size, prob, lower.tail = TRUE)
o q: number of successes
o size: number of trials
o prob: probability of success
o lower.tail: TRUE for P(X ≤ q), FALSE for P(X > q)
e) Interpreting p-Value:
• The probability of observing a test statistic as extreme or more extreme than
the one calculated, assuming the null hypothesis is true.
• Common threshold: 0.05
• If p-value ≤ 0.05, reject the null hypothesis (suggesting significant evidence
against it).
• If p-value > 0.05, fail to reject the null hypothesis (not enough evidence to
disprove it).
f) Getting List of Installed Packages:
Code snippet
installed.packages()
g) Transpose of a Matrix:
Code snippet
t(matrix_name)
h) Purpose of Regression Analysis in R:
• Modeling the relationship between a dependent variable (outcome) and one
or more independent variables (predictors).
• Used for:
o Predicting future values of the dependent variable.
o Understanding how independent variables influence the dependent
variable.
o Assessing the strength of relationships between variables.

Q2) Answer the following questions (Any Two)

a) Explain T-test of hypothesis testing in R. Write syntax and explain in
detail.
Here's an explanation of T-tests for hypothesis testing in R, including syntax and
details:
T-Test:
• A statistical test used to determine whether the means of two groups
significantly differ.
• Assumes the data is normally distributed and the variances of the two
groups are equal (or approximately equal).
Types of T-Tests in R:
1. One-Sample T-Test:
o Compares the mean of a sample to a known population mean.
o Syntax: t.test(x, mu = hypothesized_mean, alternative =
c("two.sided", "less", "greater"))
2. Two-Sample T-Test:
o Compares the means of two independent samples.
o Syntax: t.test(x, y, paired = FALSE, var.equal = TRUE)
 paired = TRUE for paired samples (e.g., pre-test and post-test
scores).
 var.equal = FALSE for unequal variances (Welch's t-test).
Interpreting Results:
• P-value: The probability of observing a difference as extreme or more
extreme, assuming the null hypothesis is true (no difference).
o Common threshold: p-value ≤ 0.05 suggests rejecting the null
hypothesis (significant difference).
• Confidence interval: A range of values that likely contains the true mean
difference.
Example:
Code snippet
# One-sample t-test:
test_scores <- c(85, 92, 78, 89, 95)
t.test(test_scores, mu = 80)

# Two-sample t-test (independent samples):

group1 <- c(10, 12, 14, 16)
group2 <- c(15, 18, 20, 22)
t.test(group1, group2)
Key Considerations:
• Check assumptions of normality and equal variances using visualizations
and tests (e.g., Q-Q plots, Shapiro-Wilk test, F-test).
• Use non-parametric alternatives (e.g., Wilcoxon rank-sum test) if
assumptions are not met.
• Consider sample size and effect size (practical significance) alongside
statistical significance.

b) Define probability. Explain any two functions of probability distribution.

Probability Defined:
Probability measures the likelihood or chance of an event occurring. It is a
numerical value between 0 and 1, where:
• 0: The event is impossible.
• 1: The event is certain.
• Values between 0 and 1: Represent the degree of uncertainty, with higher
values indicating greater likelihood.

Probability plays a crucial role in various fields, including statistics, mathematics,

decision-making, and everyday life. It helps us quantify uncertainty, predict
outcomes, and make informed choices under uncertain conditions.

Functions of Probability Distribution:

Probability distributions provide a mathematical framework for describing the

probabilities of different possible outcomes of a random event. Here are two
important functions associated with them:

1. Cumulative Distribution Function (CDF):

• The CDF of a probability distribution (F(x)) gives the probability that a
random variable will be less than or equal to a specific value (x).
• It is a non-decreasing function, always starting at 0 and approaching 1 as x
approaches infinity.
• Useful for calculating the probability of an event falling within a certain
range.
2. Probability Density Function (PDF):
• The PDF of a probability distribution (f(x)) describes the probability density
of a random variable taking on a specific value (x).
• It represents the "instantaneous rate" of change of the CDF at x.
• Useful for visualizing the "spread" of the distribution and understanding the
relative likelihood of different outcomes.

By understanding these functions, we can gain valuable insights into the behavior
of random phenomena and make informed decisions based on predicted
probabilities.
c) What is linear regression? What do you mean by dependent and
independent variables? What is difference between linear & multiple
regression?
Linear Regression Explained:
Linear regression is a statistical technique used to model the relationship between
a dependent variable (the outcome you want to predict) and one or more
independent variables (the factors you believe influence the outcome). This
relationship is modeled by a straight line, hence the "linear" part of the name.
Key Components:
• Dependent Variable: The variable you're trying to predict or explain. It's
often denoted as "y".
• Independent Variables: The variables you believe influence the dependent
variable. They're often denoted as "x1", "x2", etc.
• Linear Model: The equation representing the relationship between the
dependent and independent variables. It typically takes the form y = β0 +
β1x1 + β2x2 + ε, where:
o β0 is the intercept (the y-value when all independent variables are
zero).
o β1, β2, etc. are the regression coefficients, indicating the contribution
of each independent variable to the dependent variable.
o ε is the error term, accounting for unexplained variance.
Types of Linear Regression:
• Simple Linear Regression: Only one independent variable is used to predict
the dependent variable.
• Multiple Linear Regression: Two or more independent variables are used to
predict the dependent variable.
Difference between Linear and Multiple Regression:
• Complexity: Simple regression is less complex and easier to interpret, while
multiple regression allows for more nuanced analysis by considering the
influence of multiple factors.
• Interpretation: Simple regression coefficients directly quantify the effect of
the single independent variable on the dependent variable. In multiple
regression, interpretations require considering the interplay and potential
interactions between multiple variables.
• Application: Simple regression is suitable for basic analysis with one
dominant factor, while multiple regression is beneficial for exploring
complex relationships involving multiple contributing factors.
Overall, linear regression is a powerful statistical tool for understanding and
predicting linear relationships between variables. Choosing between simple and
multiple regression depends on the specific context and research question.

Q3) Answer the following question (Any one).

[5946]-317 1
a) Examine ANOVA in R? State the assumptions and explain one way
ANOVA in detail. Also state benefits of ANOVA.
Examining ANOVA in R:
Analysis of Variance (ANOVA) is a statistical technique used to compare the
means of three or more groups. It analyzes whether the differences between group
means are statistically significant or simply due to chance.
Assumptions of ANOVA:
1. Normality: Each group's data should be normally distributed.
2. Homoscedasticity: Variances of all groups should be equal.
3. Independence: Observations within each group should be independent of
each other.
4. Equal interval data: The data should be measured on an interval or ratio
scale.
One-Way ANOVA in R:
Here's how to perform a one-way ANOVA in R:
1. Load Data:
Code snippet
data <- read.csv("your_data.csv")
group <- data$group_variable # Specify group variable
outcome <- data$outcome_variable # Specify outcome variable
2. Perform ANOVA:
Code snippet
anova_model <- aov(outcome ~ group, data = data)
summary(anova_model)
3. Interpret Results:
• The summary will show the F-statistic and p-value.
• If the p-value is less than your significance level (e.g., 0.05), you reject the
null hypothesis that the group means are equal.
• You can further investigate pairwise comparisons between groups using
post-hoc tests like Tukey's HSD.
Benefits of ANOVA:
• Compare multiple groups: Allows for simultaneous comparison of several
group means, reducing the need for multiple t-tests.
• Control for other variables: Can include additional factors in the model to
control for their influence on the outcome variable.
• Robustness: Relatively robust to departures from normality compared to
other methods.
• Interpretability: Provides easily interpretable estimates of group means and
effect sizes.
Remember to check the assumptions before using ANOVA and consider
alternative methods if they are not met.

b) What do you mean by dimension reduction? Explain linear discrimination

analysis (LDA) with sytax. Also explain application of LDA in marketing
domain.
Here's an explanation of dimension reduction, LDA, and its application in
marketing:
Dimension Reduction:
• It's a technique that involves reducing the number of variables (dimensions)
in a dataset while retaining most of the important information.
• It's beneficial for:
o Handling high-dimensional data that can be computationally
expensive or difficult to visualize.
o Mitigating the curse of dimensionality, where performance of some
algorithms degrades with too many dimensions.
o Identifying the most important features for prediction or analysis.
Linear Discriminant Analysis (LDA):
• A supervised dimension reduction technique for classification.

• Aims to find a linear combination of features that best separates two or more
classes in the data.
• Projects data onto a lower-dimensional space where classes are maximally
separated, aiding classification.
R Syntax for LDA:
Code snippet
library(MASS)

lda_model <- lda(class ~ ., data = your_data) # class is the categorical variable

# Predict classes for new data:

predictions <- predict(lda_model, new_data)
predictions$class
Application of LDA in Marketing:
• Customer Segmentation: Identifying distinct customer groups based on
demographics, purchase behavior, preferences, etc.
• Market Targeting: Determining which segments to focus marketing efforts
on for maximum impact.
• Campaign Response Prediction: Predicting which customers are most likely
to respond to specific marketing campaigns.
• Customer Churn Prediction: Identifying customers at risk of leaving and
taking proactive measures to retain them.
• Product Recommendation: Recommending products or services that align
with customer preferences and interests.
Key Considerations:
• LDA assumes normally distributed data within each class.

• It's sensitive to outliers, so consider pre-processing steps.

• It works best when classes are well-separated in the original space.
• For non-linear relationships, consider non-linear LDA or other techniques
like kernel LDA.

Q4) Answer the following question (Any One)

a) Describe descriptive analytics in R. Explain any three functions of

descriptive analytics in R.
Descriptive analytics in R involves summarizing and describing the key
characteristics of a dataset. It helps us understand the basic features, patterns, and
distributions of the data.
Here are three commonly used functions for descriptive analytics in R:
1. summary():
o Provides a concise overview of a dataset or variable.
o Includes measures of central tendency (mean, median), spread (range,
quartiles, standard deviation), and missing values.
o Example: summary(data) or summary(data$variable)
2. str():
o Displays the internal structure of a dataset or object.
o Reveals data types, variable names, and dimensions.
o Example: str(data)
3. table():
o Produces frequency tables for categorical variables.
o Shows the count of occurrences for each unique value.
o Example: table(data$category)
Additional functions for descriptive analytics in R:
• mean(): Calculates the arithmetic mean (average).
• median(): Calculates the median (middle value).
• sd(): Calculates the standard deviation (measure of spread).
• var(): Calculates the variance.
• range(): Returns the minimum and maximum values.
• quantile(): Computes specified quantiles (e.g., quartiles, percentiles).
• hist(): Creates histograms to visualize distributions.
• boxplot(): Creates box plots to visualize distributions and outliers.
Benefits of Descriptive Analytics:
• Data Understanding: Gain insights into the nature of your data.
• Pattern Identification: Discover trends, patterns, and relationships.
• Outlier Detection: Identify unusual or extreme values that might warrant
further investigation.
• Data Cleaning: Detect errors or inconsistencies that need correction.
• Visualization Guidance: Inform the choice of appropriate visualizations.
Descriptive analytics is often the first step in data analysis, laying a foundation for
further exploration and modeling.

b) What is logistics regression in R? Assume suitable data and explain how

do you interprete regression coefficients in R?
Here's an explanation of logistic regression in R and how to interpret its
coefficients:
Logistic Regression:
• A statistical method used to model the probability of a binary outcome (e.g.,
yes/no, success/failure) based on a set of independent variables.
• Employs a logistic function (S-shaped curve) to transform predicted values
into probabilities between 0 and 1.
• Doesn't assume a linear relationship between the independent variables and
the outcome.
Interpreting Regression Coefficients in R:
1. Fit the Model:
Code snippet
model <- glm(y ~ x1 + x2 + ..., data = your_data, family = "binomial")
summary(model)
2. Examine Coefficients:
• The summary table displays coefficients for each independent variable.
• Positive coefficients indicate a positive association with the probability of
the outcome (increase in x increases probability).
• Negative coefficients indicate a negative association (increase in x decreases
probability).
3. Interpret Odds Ratios:
• Exponentiate coefficients to obtain odds ratios: exp(coefficient).
• Odds ratio represents the change in odds of the outcome occurring per unit
increase in the independent variable, holding other variables constant.
• Example: An odds ratio of 1.5 for x1 means a unit increase in x1 is
associated with 1.5 times higher odds of the outcome.
4. Significance:
• P-values assess the statistical significance of each coefficient.
• Small p-values (e.g., less than 0.05) suggest a significant relationship
between the variable and the outcome.
Example:
Code snippet
model <- glm(purchased ~ age + income, data = customer_data, family =
"binomial")
summary(model)
• Positive coefficient for age might suggest older customers are more likely to
purchase.
• Negative coefficient for income might suggest lower-income customers are
more likely to purchase.
Key Considerations:
• Check for multicollinearity (high correlation between independent
variables).
• Assess model fit using measures like AIC and residual diagnostics.
• Consider interactions between variables if relevant.
Logistic regression is a powerful tool for predicting binary outcomes and
understanding the factors that influence them.

Q5) Answer the following questions (Any One)

a) Revise the concept of Time series analysis. Explain how time series
analysis is used for business forecasting?
Time Series Analysis: Unveiling Patterns for Better Business Predictions
Revised Concept:
Time series analysis is a statistical, analytical, and modeling approach used to
understand and forecast how a variable changes over time. It delves into historical
data points, revealing underlying patterns and trends, allowing us to predict future
values with increased accuracy. It's like peering into a crystal ball made of data,
not guaranteeing the future, but providing the clearest glimpse possible.
Business Forecasting with Time Series Analysis:
Companies leverage time series analysis to anticipate trends in various aspects,
enhancing decision-making and gaining a competitive edge. Here's how it plays
out in different scenarios:
1. Sales Forecasting:
• Predicting future sales volume allows for efficient inventory management,
resource allocation, and targeted marketing campaigns.
• Time series analysis identifies seasonal patterns, promotional effects, and
economic influences on sales, leading to more accurate forecasts.
2. Demand Forecasting:
• Understanding future demand for products or services helps optimize
production planning, logistics, and personnel needs.
• This analysis accounts for external factors like weather patterns, competitor
strategies, and market fluctuations, ensuring resources are available when
needed.
3. Customer Churn Prediction:
• Identifying customers at risk of leaving helps implement retention strategies
before they say goodbye.
• Analyzing past customer behavior, engagement, and service interactions
reveals patterns that predict potential churn, allowing targeted interventions.
4. Financial Market Predictions:
• Time series analysis can be applied to historical stock prices, economic
indicators, and news sentiment to forecast future market trends.
• While not crystal balls, these models provide valuable insights for
investment decisions and portfolio management.
Benefits of Time Series Analysis for Business:
• Improved decision-making: Data-driven forecasts lead to more informed
strategic and operational decisions.
• Reduced uncertainty: Understanding future trends mitigates risks and allows
for proactive planning.
• Enhanced resource allocation: Efficient optimization of inventory,
personnel, and marketing budgets.
• Competitive advantage: Early identification of opportunities and threats
keeps businesses ahead of the curve.
In conclusion, time series analysis is a powerful tool for businesses to unravel the
mysteries of time and use its insights to confidently navigate the future. By
harnessing the power of historical data, companies can make informed decisions,
optimize resources, and gain a competitive edge in today's dynamic market.

b) Write short Notes (Any one)

i) F Test in R

ii) Bayes Theorem

iii) Correlation analysis

Short Notes: All in One!

i) F Test in R
Purpose: Compare the variances of two populations.
How to perform:
• var.test() function: Takes two vectors or formulas and returns statistics.
• anova() function: Used with linear models to compare nested models and
variance changes.
Interpretation:
• High F-statistic and low p-value: Variances likely different.
• Small F-statistic and high p-value: Variances likely equal.
Assumptions: Normality of data.
ii) Bayes Theorem
Formula: 𝑃𝑃(𝐻𝐻|𝐷𝐷) = 𝑃𝑃(𝐷𝐷|𝐻𝐻)𝑃𝑃(𝐻𝐻) / ∑𝑃𝑃(𝐷𝐷|𝐻𝐻_i)𝑃𝑃(𝐻𝐻_i)
• Updates belief in a hypothesis (H) based on new evidence (D).
• P(H|D): Posterior probability of H given D.
• P(D|H): Likelihood of D happening given H is true.
• P(H): Prior probability of H.
Application: Updating predictions, diagnosing diseases, analyzing evidence.
Limitations: Relies on accurate priors and likelihoods.
iii) Correlation analysis
Purpose: Assess the strength and direction of linear relationships between two
variables.
Types:
• Pearson correlation (r): Measures linear association, ranges from -1 to 1.
• Spearman rank correlation: Non-parametric, focuses on monotonic
relationships.
• Kendall tau correlation: Non-parametric, measures concordance between
ranked data.
Interpretation:
• Positive correlation: Variables move in the same direction.
• Negative correlation: Variables move in opposite directions.
• 0 correlation: No linear relationship.
Limitations: Assumes linearity, may not capture complex relationships.

Emails Lab Activity and Lesson Plan
No ratings yet
Emails Lab Activity and Lesson Plan
11 pages
Econometrics 1 Cumulative Final Study Guide
No ratings yet
Econometrics 1 Cumulative Final Study Guide
35 pages
Stat 151 - Final Review
No ratings yet
Stat 151 - Final Review
15 pages
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
From Everand
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
Joseph George Caldwell
No ratings yet
Car To Car Rescue Write Up
100% (1)
Car To Car Rescue Write Up
16 pages
Unit 2 R
No ratings yet
Unit 2 R
16 pages
Which Test When: 1 Exploratory Tests
No ratings yet
Which Test When: 1 Exploratory Tests
5 pages
Predective Analytics or Inferential Statistics
No ratings yet
Predective Analytics or Inferential Statistics
27 pages
304BA AdvancedStatisticalMethodsUsingR
No ratings yet
304BA AdvancedStatisticalMethodsUsingR
31 pages
DECS Cheat Sheet
No ratings yet
DECS Cheat Sheet
8 pages
Lecture Notes Statistics
100% (2)
Lecture Notes Statistics
117 pages
Linear Regression
100% (2)
Linear Regression
28 pages
6 Continuous Data Analysis
No ratings yet
6 Continuous Data Analysis
49 pages
DAV Short Notes
No ratings yet
DAV Short Notes
5 pages
Data Science Q&A - Latest Ed (2020) - 3 - 1
No ratings yet
Data Science Q&A - Latest Ed (2020) - 3 - 1
2 pages
Analysing Data Using Linear Models 5th Ed January 2021
No ratings yet
Analysing Data Using Linear Models 5th Ed January 2021
388 pages
STAT630Slide Adv Data Analysis
No ratings yet
STAT630Slide Adv Data Analysis
238 pages
Unit 5
No ratings yet
Unit 5
104 pages
Data Analytics Unit 3
No ratings yet
Data Analytics Unit 3
104 pages
Statistics Cheat Sheet
100% (1)
Statistics Cheat Sheet
4 pages
Regression Analysis NEW-1
No ratings yet
Regression Analysis NEW-1
60 pages
Management Science Notes
No ratings yet
Management Science Notes
13 pages
r lang-Unit-04
No ratings yet
r lang-Unit-04
12 pages
AP Statistics Michel Liao
No ratings yet
AP Statistics Michel Liao
20 pages
Book IntroStatistics
No ratings yet
Book IntroStatistics
422 pages
ASMR notes
No ratings yet
ASMR notes
6 pages
15 Types of Regression You Should Know
No ratings yet
15 Types of Regression You Should Know
30 pages
SC&RP - Unit 5
No ratings yet
SC&RP - Unit 5
36 pages
Stat Cor Reg
No ratings yet
Stat Cor Reg
85 pages
r 1m
No ratings yet
r 1m
5 pages
unit5_R
No ratings yet
unit5_R
5 pages
BA - Advanced statistical method using R
No ratings yet
BA - Advanced statistical method using R
13 pages
Greenwood Intermediate Statistics With R
No ratings yet
Greenwood Intermediate Statistics With R
429 pages
Regression
No ratings yet
Regression
9 pages
Correlation and Regression Analyses
No ratings yet
Correlation and Regression Analyses
8 pages
Lucero R Tutorial 2016
No ratings yet
Lucero R Tutorial 2016
135 pages
Linear Regression Model
No ratings yet
Linear Regression Model
3 pages
A Tutorial On How To Run A Simple Linear Regression in Excel
No ratings yet
A Tutorial On How To Run A Simple Linear Regression in Excel
19 pages
CAM625 2019 s1 Module1
No ratings yet
CAM625 2019 s1 Module1
31 pages
R 2nd IA
No ratings yet
R 2nd IA
7 pages
REGRESSION ANALYSIS 1 and 2 Notes
No ratings yet
REGRESSION ANALYSIS 1 and 2 Notes
9 pages
Correlation & Regression Analysis
100% (1)
Correlation & Regression Analysis
39 pages
Intro To Probability and Statistics
No ratings yet
Intro To Probability and Statistics
147 pages
MODULE 6
No ratings yet
MODULE 6
4 pages
Rdias FDP
No ratings yet
Rdias FDP
50 pages
Week01 Lecture BB
No ratings yet
Week01 Lecture BB
70 pages
PARAMETRIC-TEST
No ratings yet
PARAMETRIC-TEST
49 pages
Topic Simple Linear Regression
No ratings yet
Topic Simple Linear Regression
38 pages
Intro To Regresion: Codergirl Data Analysis
No ratings yet
Intro To Regresion: Codergirl Data Analysis
32 pages
Chapter05DemandEstimation (1)
No ratings yet
Chapter05DemandEstimation (1)
41 pages
Regression Coeffient
No ratings yet
Regression Coeffient
52 pages
meWeek 3
No ratings yet
meWeek 3
57 pages
T-Tests, Anovas & Regression: and Their Application To The Statistical Analysis of Neuroimaging
No ratings yet
T-Tests, Anovas & Regression: and Their Application To The Statistical Analysis of Neuroimaging
39 pages
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
Exercises of Statistical Inference
From Everand
Exercises of Statistical Inference
Simone Malacrida
No ratings yet
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
From Everand
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
SUJAUL CHOWDHURY
No ratings yet
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
Exercises of Advanced Statistics
From Everand
Exercises of Advanced Statistics
Simone Malacrida
No ratings yet
Foundations of Elementary Analysis
From Everand
Foundations of Elementary Analysis
Roshan Trivedi
No ratings yet
Statistical Foundations for Psychology
From Everand
Statistical Foundations for Psychology
James C. Ware
No ratings yet
Fundamental Math
From Everand
Fundamental Math
Russell Pead
No ratings yet
qai_5
No ratings yet
qai_5
31 pages
MBA 2019 Pattern Syllabus_Revised_10112022 (1)-254-255
No ratings yet
MBA 2019 Pattern Syllabus_Revised_10112022 (1)-254-255
2 pages
392 introduction to cyber security
No ratings yet
392 introduction to cyber security
7 pages
307 International business environment
No ratings yet
307 International business environment
11 pages
302 DECISION SCIENCE
No ratings yet
302 DECISION SCIENCE
23 pages
308 Project Management
No ratings yet
308 Project Management
11 pages
301 strategic management
No ratings yet
301 strategic management
21 pages
Pampanga's Best
No ratings yet
Pampanga's Best
1 page
UML to Python
No ratings yet
UML to Python
6 pages
The Impact of Ventilation On Air Quality in Indoor Ice Skating Arenas
No ratings yet
The Impact of Ventilation On Air Quality in Indoor Ice Skating Arenas
6 pages
BT 1
No ratings yet
BT 1
2 pages
Tablas de Alcoholes
86% (7)
Tablas de Alcoholes
13 pages
Batumi in Your Pocket
No ratings yet
Batumi in Your Pocket
36 pages
Optimization Problems
No ratings yet
Optimization Problems
6 pages
Bms 1201 Assignment Two - 2025
No ratings yet
Bms 1201 Assignment Two - 2025
2 pages
121 Units Rs. 2,879.40: Chand Zeb
No ratings yet
121 Units Rs. 2,879.40: Chand Zeb
2 pages
ERP Complaint Book
No ratings yet
ERP Complaint Book
12 pages
App Form Questions
No ratings yet
App Form Questions
24 pages
Lesson Plan Pythagorean Theorem
No ratings yet
Lesson Plan Pythagorean Theorem
4 pages
Exerting Influence Without Authority - HBR
100% (1)
Exerting Influence Without Authority - HBR
7 pages
Maverick HSP Owners Manual V2 0 PDF
100% (1)
Maverick HSP Owners Manual V2 0 PDF
43 pages
CT Operating Instructions
No ratings yet
CT Operating Instructions
15 pages
Tone Controls
No ratings yet
Tone Controls
24 pages
Immediate Download Encyclopedia of Policy Studies Second Edition, Revised and Expanded Edition Nagel Ebooks 2024
100% (3)
Immediate Download Encyclopedia of Policy Studies Second Edition, Revised and Expanded Edition Nagel Ebooks 2024
52 pages
Example PPT Case Study 2
No ratings yet
Example PPT Case Study 2
10 pages
Single Case Research Design and Analysis New Directions for Psychology and Education 1st Edition Thomas R Kratochwill Joel R Levin Editors - The ebook in PDF format is available for download
100% (1)
Single Case Research Design and Analysis New Directions for Psychology and Education 1st Edition Thomas R Kratochwill Joel R Levin Editors - The ebook in PDF format is available for download
72 pages
Cambridge IGCSE: Global Perspectives 0457/12
No ratings yet
Cambridge IGCSE: Global Perspectives 0457/12
4 pages
Human Resource Management: Press
No ratings yet
Human Resource Management: Press
35 pages
Assessing Design Solutions
No ratings yet
Assessing Design Solutions
5 pages
Strategic Management in Public and Non-Profit Organisations
No ratings yet
Strategic Management in Public and Non-Profit Organisations
21 pages
The Surface Tension of Molten Aluminum and Al-Si-Mg Alloy Under Vacuum and Hydrogen Atmospheres
No ratings yet
The Surface Tension of Molten Aluminum and Al-Si-Mg Alloy Under Vacuum and Hydrogen Atmospheres
6 pages
Gen Key
No ratings yet
Gen Key
21 pages
Pega Certifications - Pega Academy
No ratings yet
Pega Certifications - Pega Academy
2 pages
Di Module Wiring Sample
No ratings yet
Di Module Wiring Sample
2 pages
Buffer Solution Behaviour On Solubility and Distribution Coefficient of Benzoic Acid Between Two Immiscible Liquids
No ratings yet
Buffer Solution Behaviour On Solubility and Distribution Coefficient of Benzoic Acid Between Two Immiscible Liquids
6 pages

BA - Advanced statistical method using R (P2)

Uploaded by

BA - Advanced statistical method using R (P2)

Uploaded by

304 - BA-SC-BA-03 : ADVANCED STATISTICAL METHODS USING R

Time : 2½ hours] [Max. Marks :

Q1) Answer the following questions (Any Five)

a) Basic Statistical Functions in R:

Q2) Answer the following questions (Any Two)

# Two-sample t-test (independent samples):

b) Define probability. Explain any two functions of probability distribution.

Probability plays a crucial role in various fields, including statistics, mathematics,

Functions of Probability Distribution:

Probability distributions provide a mathematical framework for describing the

1. Cumulative Distribution Function (CDF):

Q3) Answer the following question (Any one).

b) What do you mean by dimension reduction? Explain linear discrimination

lda_model <- lda(class ~ ., data = your_data) # class is the categorical variable

# Predict classes for new data:

• It's sensitive to outliers, so consider pre-processing steps.

Q4) Answer the following question (Any One)

a) Describe descriptive analytics in R. Explain any three functions of

b) What is logistics regression in R? Assume suitable data and explain how

Q5) Answer the following questions (Any One)

b) Write short Notes (Any one)

ii) Bayes Theorem

Short Notes: All in One!

You might also like