0% found this document useful (0 votes)

26 views9 pages

Unit 4 Notes

The document provides an overview of univariate, bivariate, and multivariate analysis, detailing their definitions, objectives, techniques, and applications. It also discusses parametric and non-parametric tests, highlighting their characteristics, examples, advantages, and limitations. Additionally, the document covers cluster analysis as a research methodology tool for identifying patterns in data.

Uploaded by

Nandhini Dhevi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views9 pages

Unit 4 Notes

Uploaded by

Nandhini Dhevi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 9

Univariate Analysis

Definition
 Univariate analysis refers to the examination of a single variable in a dataset.
 It aims to summarize the variable's properties and extract meaningful insights about
its distribution, central tendency, and variability.
Objectives
 To describe the data using summary statistics.
 To identify patterns or anomalies within the data.
 To understand the spread and central location of the data values.
Techniques
1. Measures of Central Tendency
o Mean: The average value of the data.
o Median: The middle value when data is sorted.
o Mode: The most frequently occurring value.
2. Measures of Dispersion
o Range: Difference between the highest and lowest values.
o Variance: Measures how data points differ from the mean.
o Standard Deviation: Square root of variance, showing the average distance
from the mean.
o Interquartile Range (IQR): Difference between the third and first quartiles.
3. Visualization Tools
o Histograms: Show frequency distributions.
o Bar Charts: Represent categorical data.
o Pie Charts: Show proportions of categories.
o Frequency Tables: List data values along with their frequencies.
Applications
 Analyzing the distribution of student exam scores in a class.
 Summarizing the ages of employees in an organization.
 Understanding the revenue generated by a specific product.
Bivariate Analysis
Definition
 Bivariate analysis examines the relationship between two variables.
 It helps to identify correlations, associations, and dependencies between variables.
Objectives
 To determine if there is a relationship between the variables.
 To quantify the strength and direction of the relationship (positive or negative).
 To predict the value of one variable based on another.
Techniques
1. Correlation Analysis
o Pearson’s Correlation Coefficient: Measures the linear relationship between
two continuous variables (ranges from -1 to +1).
o Spearman’s Rank Correlation: Used for ordinal data or non-linear
relationships.
2. Regression Analysis
o Simple Linear Regression: Predicts the value of a dependent variable (Y)
based on an independent variable (X) using the equation: Y = a + bX.
3. Cross-Tabulation
o Summarizes categorical data to show the frequency distribution of
combinations of variables.
4. Visualization Tools
o Scatter Plots: Show relationships between two continuous variables.
o Line Graphs: Illustrate trends over time.
o Boxplots: Compare data distributions between groups.
Applications
 Analyzing the relationship between study hours and exam scores.
 Examining the impact of advertising expenditure on sales.
 Identifying correlations between employee experience and performance.

Multivariate Analysis
Definition
 Multivariate analysis involves studying three or more variables simultaneously.
 It helps uncover complex relationships and interactions among variables.
Objectives
 To understand the combined effect of multiple variables on an outcome.
 To identify patterns, clusters, and latent structures in data.
 To build predictive models that include multiple predictors.
Techniques
1. Multiple Regression
o Extends simple regression to include multiple independent variables: Y = a +
b1X1 + b2X2 + ... + bnXn.
2. Factor Analysis
o Identifies underlying factors or constructs that explain the correlations among
variables.
3. Cluster Analysis
o Groups similar data points based on shared characteristics (e.g., customer
segmentation).
4. Principal Component Analysis (PCA)
o Reduces the dimensionality of data while retaining as much variability as
possible.
5. Discriminant Analysis
o Classifies data into predefined categories based on predictor variables.
Visualization Tools
 3D Scatter Plots: Show relationships among three continuous variables.
 Heat Maps: Visualize correlations or frequency distributions.
 Parallel Coordinate Plots: Compare multiple variables simultaneously.
Applications
 Predicting customer purchase behavior using demographic and psychographic data.
 Analyzing economic trends influenced by inflation, unemployment, and GDP.
 Studying patient outcomes based on multiple health indicators.

Key Differences

Aspect Univariate Bivariate Multivariate

Analysis of two Analysis of three or more
Definition Analysis of one variable
variables variables

Distribution and Relationship and

Focus Interactions and patterns
summary correlation

Central tendency, PCA, factor analysis,

Techniques Correlation, regression
dispersion clustering

Scatter plots, line

Visualization Histograms, bar charts 3D plots, heat maps
graphs

Exam scores,
Applications Study hours vs. grades Predictive modeling
demographics

Introduction

 Statistical tests are used to analyze data and draw conclusions about populations.
 Based on the assumptions about the population distribution, these tests are divided
into Parametric and Non-Parametric tests.

2. Parametric Tests

Definition

 Parametric tests are statistical tests that assume the data follows a specific distribution
(commonly normal distribution).

Key Characteristics

 Relies on assumptions about population parameters (e.g., mean, variance).

 Assumes normality in data.
 Requires data to be on an interval or ratio scale.
 Typically more powerful when assumptions are met.

Examples of Parametric Tests

1. t-Test
o Compares means between two groups.
o Types:
 Independent t-test (two unrelated groups).
 Paired t-test (same group measured twice).
2. ANOVA (Analysis of Variance)
o Compares means among three or more groups to determine if at least one group
differs significantly.

 Types of ANOVA:

1. One-Way ANOVA:

 Tests the effect of a single factor on a dependent variable.

 Example: Comparing average test scores among students from three
different schools.
2. Two-Way ANOVA:

 Examines the effect of two independent variables and their interaction.

 Example: Analyzing the effect of gender and teaching method on test scores

3. z-Test
o Used when sample size is large (n > 30) and population variance is known.

 Purpose:

 Used to compare sample mean to a population mean or two sample means when
population variance is known.

 Types of z-Test:

1. One-Sample z-Test:
o Compares a sample mean to a known population mean.

o Example: Checking if average exam scores of a class differ from the national
average.
2. Two-Sample z-Test:
o Compares means of two independent groups.
o Example: Comparing male and female heights in a population.
3. Proportion z-Test:
o Compares proportions between two groups.
o Example: Testing the effectiveness of two marketing strategies based on
customer conversion rates

o
4. Pearson’s Correlation Coefficient
o Measures the strength of linear association between two variables.

 Key Features:

5. Range: -1 to +1.
o +1: Perfect positive correlation.

o -1: Perfect negative correlation.

o 0: No linear relationship.
6. Assumes both variables are normally distributed and measured on an interval/ratio
scale.
o 
7. Regression Analysis
o Evaluates the relationship between dependent and independent variables.

Simple Linear Regression:

8. One independent variable.

9. Example: Predicting salary based on years of experience.
10. Formula: Y=a+bX+ϵY = a + bX + \epsilonY=a+bX+ϵ, where:
o YYY: Dependent variable.
o XXX: Independent variable.
o aaa: Intercept.
o bbb: Slope.
o ϵ\epsilonϵ: Error term.
o

Advantages

 Higher statistical power when assumptions are met.

 Provides precise estimates of parameters.

Limitations

 Sensitive to deviations from assumptions (e.g., normality, homoscedasticity).

 Cannot be used with ordinal or nominal data.

3. Non-Parametric Tests
Definition

 Non-parametric tests do not assume any specific population distribution.

Key Characteristics

 Does not require data to follow normal distribution.

 Can be used with ordinal, nominal, or non-metric data.
 Often called "distribution-free" tests.

Examples of Non-Parametric Tests

1. Chi-Square Test
o Tests the association between categorical variables.

o Goodness-of-fit test or test for independence.

2. Mann-Whitney U Test
o Alternative to the independent t-test.
o Compares ranks between two groups.
3. Wilcoxon Signed-Rank Test
o Alternative to the paired t-test.
o Compares two related samples.
4. Kruskal-Wallis Test
o Alternative to one-way ANOVA.
o Compares ranks among three or more groups.
5. Spearman’s Rank Correlation
o Measures the strength of the monotonic relationship between two variables.
6. Friedman Test
o Alternative to repeated-measures ANOVA.
o Compares three or more related groups.

Advantages

 Flexible: Can be used with small sample sizes and non-normal data.
 Simple: Requires fewer assumptions.
 Suitable for ordinal and nominal data.

Limitations

 Lower statistical power compared to parametric tests.

 Results may be less precise or harder to interpret.
4. Differences Between Parametric and Non-Parametric Tests

Aspect Parametric Tests Non-Parametric Tests

No assumption about data
Assumptions Assumes normal distribution.
distribution.
Data Type Interval or ratio. Ordinal, nominal, interval, or ratio.
Sample Size Requires larger sample sizes. Works with small sample sizes.
Statistical Power Higher when assumptions are met. Lower statistical power.
More complex; requires more
Complexity Simpler and easier to apply.
computation.

5. When to Use

Parametric Tests

 Use when:
o Data is continuous and normally distributed.

o Sample size is large enough to justify normality.

Non-Parametric Tests

 Use when:
o Data is not normally distributed.

o Sample size is small.

o Data is ordinal, nominal, or ranks.

Cluster Analysis

Cluster Analysis in Research Methodology is a vital tool for identifying patterns or groups
in data. It helps researchers classify objects, variables, or cases into clusters based on their
similarities, enabling better understanding and decision-making. Cluster analysis is
particularly important in exploratory research where the primary aim is to discover hidden
patterns without predefined hypotheses.

Importance in Research Methodology

1. Exploratory Tool: Helps researchers identify natural groupings in data without prior
knowledge of group labels.
2. Data Reduction: Reduces a large dataset into manageable clusters for further
analysis.
3. Hypothesis Generation: Forms the basis for developing new hypotheses by
identifying patterns.
4. Multi-Disciplinary Application: Used across disciplines like social sciences,
biology, marketing, psychology, and healthcare.

Process of Cluster Analysis in Research

1. Define the Objective: Identify the purpose of clustering (e.g., segmenting

populations, classifying variables).
2. Prepare the Data:
o Normalize/standardize data to ensure fair comparison.
o Address missing values and outliers.
3. Select Clustering Technique: Choose a method based on research objectives and
data type (e.g., hierarchical clustering for small datasets, K-Means for large datasets).
4. Choose Similarity/Dissimilarity Measure:
o Use appropriate metrics such as Euclidean distance, Manhattan distance, or
Cosine similarity.
5. Run the Clustering Algorithm: Apply the selected algorithm to the dataset.
6. Evaluate Cluster Validity:
o Assess the quality of clusters using metrics like the Silhouette Coefficient,
Dunn Index, or Elbow Method.
7.

Introduction To Econometrics - Stock & Watson - CH 4 Slides
100% (2)
Introduction To Econometrics - Stock & Watson - CH 4 Slides
84 pages
Achieve Maths-Bk6-Data Statistics Drawing Graphs - FREE 2019
100% (2)
Achieve Maths-Bk6-Data Statistics Drawing Graphs - FREE 2019
66 pages
51 Multiple Questions and Answers On Research Process in Physical Education
No ratings yet
51 Multiple Questions and Answers On Research Process in Physical Education
12 pages
Biol 180 WIN 2022 Practice Exam 1
No ratings yet
Biol 180 WIN 2022 Practice Exam 1
2 pages
BRM Data Analysis Techniques
No ratings yet
BRM Data Analysis Techniques
53 pages
New Syllabus For BCOM Semester Wise 2018-19
No ratings yet
New Syllabus For BCOM Semester Wise 2018-19
22 pages
Exam Research 2 Students
No ratings yet
Exam Research 2 Students
7 pages
Assignment 4 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 4 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
4 pages
COZ MJV 3is SLM MODULE 4
No ratings yet
COZ MJV 3is SLM MODULE 4
19 pages
Measures of Variability and Position
No ratings yet
Measures of Variability and Position
34 pages
Analytical Chemistry Lecture 3
No ratings yet
Analytical Chemistry Lecture 3
37 pages
Lecture Notes in MAED Stat Part 1
100% (1)
Lecture Notes in MAED Stat Part 1
15 pages
Sa12125 PDF
No ratings yet
Sa12125 PDF
21 pages
SI Fall 23
No ratings yet
SI Fall 23
56 pages
Topic 2 Frequency Distribution and Data Presentation, Measures of Central Tendency and Dispersion
No ratings yet
Topic 2 Frequency Distribution and Data Presentation, Measures of Central Tendency and Dispersion
46 pages
Fs2action Research Proposal Jamito Charisse April Beed4a
No ratings yet
Fs2action Research Proposal Jamito Charisse April Beed4a
7 pages
Meta Anaysis Assignment
No ratings yet
Meta Anaysis Assignment
17 pages
Theory and Methods in Political Science - 2 Version - p4
No ratings yet
Theory and Methods in Political Science - 2 Version - p4
68 pages
Gpts Are GPTS: An Early Look at The Labor Market Impact Potential of Large Language Models
No ratings yet
Gpts Are GPTS: An Early Look at The Labor Market Impact Potential of Large Language Models
35 pages
RESEARCH Methodology: Associate Professor in Management Pondicherry University Karaikal Campus Karaikal - 609 605
No ratings yet
RESEARCH Methodology: Associate Professor in Management Pondicherry University Karaikal Campus Karaikal - 609 605
46 pages
Price Book Value & Tobin's Q: Which One Is Better For Measure Corporate Governance?
No ratings yet
Price Book Value & Tobin's Q: Which One Is Better For Measure Corporate Governance?
6 pages
Akaike Information Criterion
100% (1)
Akaike Information Criterion
6 pages
T-Test For A Proportion
No ratings yet
T-Test For A Proportion
5 pages
Research Paper Chapter 1 3 Bardelosa Et Al.
No ratings yet
Research Paper Chapter 1 3 Bardelosa Et Al.
29 pages
PIPS
No ratings yet
PIPS
31 pages
Jurnal 8
No ratings yet
Jurnal 8
10 pages
Probability NST Notes
No ratings yet
Probability NST Notes
3 pages
Audit 2 PG CH 12 15
No ratings yet
Audit 2 PG CH 12 15
3 pages
Quantitative Control Diagram
No ratings yet
Quantitative Control Diagram
30 pages
Data Analysis: Parametric vs. Non-Parametric Tests
No ratings yet
Data Analysis: Parametric vs. Non-Parametric Tests
19 pages
Advanced Data Analysis Binder 2015
100% (1)
Advanced Data Analysis Binder 2015
165 pages
Presentation1HOD SIR-1
No ratings yet
Presentation1HOD SIR-1
13 pages
MBA60 - 616 Techniques
No ratings yet
MBA60 - 616 Techniques
42 pages
2017 Business
No ratings yet
2017 Business
9 pages
Slicks IDL Math Stats
No ratings yet
Slicks IDL Math Stats
1 page
Data Processing and Analysis: The Purpose of Analyzing Data Is
No ratings yet
Data Processing and Analysis: The Purpose of Analyzing Data Is
13 pages
Consolidated DA
No ratings yet
Consolidated DA
41 pages
BRM Presentation Group 5 - Univariate & Bivariate Analysis
No ratings yet
BRM Presentation Group 5 - Univariate & Bivariate Analysis
26 pages
9.bivariate Analysis
No ratings yet
9.bivariate Analysis
64 pages
01 Multivariate Analysis
100% (1)
01 Multivariate Analysis
40 pages
D1UA401B Research Methodology-UNIT-4 Pazhanisamy-BBA IV Semester Section19
No ratings yet
D1UA401B Research Methodology-UNIT-4 Pazhanisamy-BBA IV Semester Section19
108 pages
PR2 Q2 Week 78 Learning Materials
No ratings yet
PR2 Q2 Week 78 Learning Materials
13 pages
Quantitative Research Methods
No ratings yet
Quantitative Research Methods
18 pages
Notes On
No ratings yet
Notes On
2 pages
Descriptive Analysis
No ratings yet
Descriptive Analysis
35 pages
Presentation by Shahira Hussain
No ratings yet
Presentation by Shahira Hussain
21 pages
Module 3 - Lesson 3.2 Quantitative Data Analysis
No ratings yet
Module 3 - Lesson 3.2 Quantitative Data Analysis
41 pages
Notes Unit-4 BRM
No ratings yet
Notes Unit-4 BRM
10 pages
Chapter 5 Data Analysis Ab
No ratings yet
Chapter 5 Data Analysis Ab
56 pages
Sample RM File PDF
No ratings yet
Sample RM File PDF
39 pages
Bms Pom
No ratings yet
Bms Pom
63 pages
All Statistical Tests and Their Applications Updated Latest Latest Latest Latest
No ratings yet
All Statistical Tests and Their Applications Updated Latest Latest Latest Latest
14 pages
Em (601) Report# 9
No ratings yet
Em (601) Report# 9
6 pages
Data Analysis Plan Handout
No ratings yet
Data Analysis Plan Handout
15 pages
PPT
No ratings yet
PPT
29 pages
Statistical Tests - Handout PDF
No ratings yet
Statistical Tests - Handout PDF
21 pages
Dr. Dame Presentation Last
No ratings yet
Dr. Dame Presentation Last
19 pages
MR Unit-V
No ratings yet
MR Unit-V
13 pages
Q2 PR2-Weeks-7-8-CONCLUSION
No ratings yet
Q2 PR2-Weeks-7-8-CONCLUSION
12 pages
Math Stats
No ratings yet
Math Stats
4 pages
Analysis
No ratings yet
Analysis
26 pages
Quantitative Analysis Using Spss
100% (1)
Quantitative Analysis Using Spss
42 pages
Types of Statistical Tests by Purpose
No ratings yet
Types of Statistical Tests by Purpose
4 pages
Statistical Analysis Tools in Analyzing Quantitative Data
No ratings yet
Statistical Analysis Tools in Analyzing Quantitative Data
5 pages
Data Analysis Guide
No ratings yet
Data Analysis Guide
4 pages
Analysing Quantitative Data - DPPM-2020
No ratings yet
Analysing Quantitative Data - DPPM-2020
34 pages
Rohit Seminar
No ratings yet
Rohit Seminar
22 pages
Business Stats
No ratings yet
Business Stats
5 pages
Introductions Wps Office
100% (1)
Introductions Wps Office
8 pages
DataAnalytics (Unit 2)
No ratings yet
DataAnalytics (Unit 2)
131 pages
Data Analysis and Interpretation
No ratings yet
Data Analysis and Interpretation
24 pages
Parametric Vs Non-Parametric Test
No ratings yet
Parametric Vs Non-Parametric Test
14 pages
Statistical Tools - Summary
No ratings yet
Statistical Tools - Summary
4 pages
CH 5
No ratings yet
CH 5
26 pages
Q2 M4 Lesson 6 - Planning Data Analysis
No ratings yet
Q2 M4 Lesson 6 - Planning Data Analysis
15 pages
Group-1 BSCOS401B
No ratings yet
Group-1 BSCOS401B
49 pages
Inquiries Chapter 4
No ratings yet
Inquiries Chapter 4
6 pages
Lecture 4 Regression Analysis
No ratings yet
Lecture 4 Regression Analysis
51 pages
DAV Short Notes
No ratings yet
DAV Short Notes
5 pages
Data Analysis Part II
No ratings yet
Data Analysis Part II
22 pages
Unit 1
No ratings yet
Unit 1
24 pages
Engineering Math Class Note II-1
No ratings yet
Engineering Math Class Note II-1
26 pages
Data Analysis - Selecting A Test
No ratings yet
Data Analysis - Selecting A Test
5 pages
Not 1
No ratings yet
Not 1
8 pages
Business Research Methods Unit 4
No ratings yet
Business Research Methods Unit 4
25 pages
Unit 4
No ratings yet
Unit 4
21 pages
Dva 2
No ratings yet
Dva 2
13 pages
Intro SPSS by Sherif Modified
No ratings yet
Intro SPSS by Sherif Modified
45 pages
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
Overview Of Bayesian Approach To Statistical Methods: Software
From Everand
Overview Of Bayesian Approach To Statistical Methods: Software
Vinaitheerthan Renganathan
No ratings yet

Unit 4 Notes

Uploaded by

Unit 4 Notes

Uploaded by

Univariate Analysis

Aspect Univariate Bivariate Multivariate

Distribution and Relationship and

Central tendency, PCA, factor analysis,

Scatter plots, line

 Relies on assumptions about population parameters (e.g., mean, variance).

Examples of Parametric Tests

 Tests the effect of a single factor on a dependent variable.

 Examines the effect of two independent variables and their interaction.

o -1: Perfect negative correlation.

Simple Linear Regression:

8. One independent variable.

 Higher statistical power when assumptions are met.

 Sensitive to deviations from assumptions (e.g., normality, homoscedasticity).

 Non-parametric tests do not assume any specific population distribution.

 Does not require data to follow normal distribution.

Examples of Non-Parametric Tests

o Goodness-of-fit test or test for independence.

 Lower statistical power compared to parametric tests.

Aspect Parametric Tests Non-Parametric Tests

o Sample size is large enough to justify normality.

o Sample size is small.

Importance in Research Methodology

Process of Cluster Analysis in Research

1. Define the Objective: Identify the purpose of clustering (e.g., segmenting

You might also like