0% found this document useful (0 votes)

8 views6 pages

Unit 3

The document provides an overview of data types, including qualitative and quantitative data, and further divides quantitative data into discrete and continuous categories. It explains the construction and purpose of frequency curves, standard deviation, variance, covariance, quartiles, and percentiles, detailing their calculations and significance in statistical analysis. Additionally, it highlights the differences between frequency curves, polygons, and histograms.

Uploaded by

anjalitak906

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views6 pages

Unit 3

Uploaded by

anjalitak906

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

DATA AND ITS TYPES

Data: A set of values recorded on one or more observational units. Or it is factual information
collected during research studies.

Qualitative data: The variables that yield observations on which individuals can be categorized
according to certain characteristics or qualities are referred as qualitative variables or data, e.g.
gender, occupation, marital status, and educational level.

Quantitative data: The variables that yield observations that can be measured are considered
quantltative data, e.g. height, weight, blood pressure, serum cholesterol, body temperitture
Quantitatijre data are further divided into discrete and continuou5 data.

Discrete data: The data in a whole number is called discrete data such as number of children in a
family, pulse rate, ESR, blood sugar, blood pressure, etc. It can be understood with following
example. Pulse rate of ten people recorded is as follow:
80, 72, 75, 82, 77, 83, 86, 74, 78, 88

Continuous data: The data which can be measured in fractional values such as height, weight, body
temperature, chest circumference, etc. is called continuous data. It can be understood with following
example:

Weight of ten students of 10th class is recorded in Kilograms is as follow:

45.7, 50.2, 48.9, 48.4, 56.5, 44.5, 47.8, 47.8, 45.5, 46.3

Frequency Curve:
A frequency curve is a smooth, graphical representation of a frequency distribution, which shows
how often different values occur in a dataset. The curve is formed by connecting the midpoints of
the top edges of a histogram's bars with a freehand, smooth curve.

Construction of Frequency Curve/How to draw Frequency Curve:

X-axis:
It represents the variable being measured (e.g., height, weight, scores).
Y-axis:
It represents the frequency, or how many times each value (or range of values) occurs.
Shape:
The curve's shape can reveal information about the data distribution.
Common shapes include:

1) Normal distribution (bell curve): Symmetrical, with the highest frequency in the center and
tails tapering off on both sides.
2) Skewed distributions: Asymmetrical, with a longer tail on one side, indicating a higher
concentration of values towards one end.
3) U-shaped curve: Has a low frequency in the center and higher frequencies at the extremes.
4) J-shaped curve: Starts with a high peak and then slopes downward.
5) Mixed curve: A combination of different shapes.

Normal (Bell Shaped) Frequency Curve Skewed Frequency Curve

U Shaped Frequency Curve J shaped Frequency Curve

Purpose of a frequency curve:

Visual representation: It provides a clear visual representation of how the data is distributed.
Comparison: It allows for easy comparison of different datasets or distributions.
Understanding the underlying distribution: It helps to understand the shape, symmetry, and
spread of the data.

Difference between frequency curve, polygon, and histogram:

Histogram: Uses bars to represent the frequency of each class interval.
Frequency polygon: Joins the midpoints of the top edges of the histogram bars with straight lines.
Frequency curve: Connects the midpoints of the histogram bars with a smooth, freehand curve,
rather than straight lines
STANDARD DEVIATION:
It is the measure of the dispersion of statistical data. Standard Deviation shows how much variation
from the mean exists. The standard deviation indicates a “typical” deviation from the mean. Standard
deviation calculates the extent to which the values differ from the average. A change in even one
value affects the value of standard deviation.

Standard Deviation is denoted by (σ)

Calculation of Standard Deviation:

σ = Variance

VARIANCE:
Variance is a statistical measure that shows how much the values in a dataset deviate from the mean
(average). It gives a sense of how spread out or concentrated the data is.

• If variance is low, data points are close to the mean.

• If variance is high, data points are spread out over a wider range.

Calculation of variance:

∑(x−xˉ)2
Variance(σ ) =
2

or
Variance = σ2
Covariance of Data:
Covariance is a measure of the relationship between two random variables and to what extent, they
change together or in other words, it defines the changes between the two variables, such that change
in one variable is equal to change in another variable. Covariance is measured in units, which are
calculated by multiplying the units of the two variables.

Types of Covariance
Covariance can have both positive and negative values. Based on this, it has two types:

• Positive Covariance
• Negative Covariance

Positive Covariance
If the covariance for any two variables is positive, that means, both the variables move in the same
direction. Here, the variables show similar behaviour. That means, if the values (greater or lesser) of
one variable corresponds to the values of another variable, then they are said to be in positive
covariance.
Negative Covariance
• If the covariance for any two variables is negative, that means, both the variables move in the
opposite direction. It is the opposite case of positive covariance, where greater values of
one variable correspond to lesser values of another variable and vice-versa.

Where,

xi = data value of x

yi = data value of y

x̄ = mean of x

ȳ = mean of y
N = number of data values.

If cov(X, Y) is greater than zero, the covariance for any two variables is positive and both the
variables move in the same direction.

If cov(X, Y) is less than zero, the covariance for any two variables is negative and both the variables
move in the opposite direction.

If cov(X, Y) is zero, there is no relation between two variables.

Quartile:
Quartiles are the set of values which has three points dividing the data set into four identical parts.
The middle part of the three quarters measures the central point of distribution and shows the data
which are near to the central point. The lower part of the quarters indicates just half information set
which comes under the median and the upper part shows the remaining half, which falls over the
median.

Quartiles divide the entire set into four equal parts. So, there are three quartiles, first, second and
third represented by Q1, Q2 and Q3, respectively. Q2 is the median, since it indicates the position of
the item in the list and thus, is a positional average. To find quartiles of a group of data, arrange the
data in ascending order.

Quartiles (Q1, Q2, Q3)

Quartiles Formula
Suppose, Q3 is the upper quartile is the median of the upper half of the data set. Whereas, Q1 is the lower
quartile and median of the lower half of the data set. Q2 is the median. Consider, we have n number of items
in a data set. Then the quartiles are given by;

Q1 = [(n+1)/4]th item

Q2 = [(n+1)/2]th item

Q3 = [3(n+1)/4]th item
Percentile:
A percentile is a statistical measure that indicates the relative standing of a value within a dataset.
For example, if a student scores in the 90th percentile on a test, they have scored better than 90% of
the other students who took the test. A percentile is a measure used to indicate the value below which
a given percentage of observations in a group of observations fall.

Formula of Percentile

For calculating the percentile of 'x' in the data,

Percentile = (Number of values below 'x'/Total number of values) × 100

Steps for calculating percentile:

Step 1: Arrange Data

Sort the data set in ascending order.

Step 2: Calculate Rank

After arranging the data in order, we need to calculate the rank. The formula for rank is given as

Rank = (Desired Percentile/100) × (n+1)

Where n is the number of observations.

Step 3: Find the Value

Case 1: If the rank is a whole number, the value at that position in the ordered dataset is the desired
percentile.

Case 2: If the rank is a decimal, interpolate to the nearest whole number to find the percentile value.

The general formula to find the Pth percentile is:

P=100n×(N+1)P=n100×(N+1)

Let's assume we have the following data set: 10, 20, 30, 40, 50, 60, 70, 80, 90, 100. To find the 70th
percentile:

R=70100×(10+1)=7.7R=10070×(10+1)=7.7

The 70th percentile lies between the 7th and 8th values. Thus, the 70th percentile is a value between
70 and 80.

Descriptive Statistics and Exploratory Data Analysis
No ratings yet
Descriptive Statistics and Exploratory Data Analysis
36 pages
IB Mathtextbook AI Corrections Updated
100% (1)
IB Mathtextbook AI Corrections Updated
53 pages
Summative Q4 WK 5 8
No ratings yet
Summative Q4 WK 5 8
5 pages
Group 4 Data Management Notes
No ratings yet
Group 4 Data Management Notes
21 pages
Class 1
No ratings yet
Class 1
52 pages
Basic Statistical Concepts - Measures of Location
No ratings yet
Basic Statistical Concepts - Measures of Location
14 pages
Stats Lecture 1
No ratings yet
Stats Lecture 1
45 pages
Freq. Distribution Characteristics
No ratings yet
Freq. Distribution Characteristics
13 pages
Statistics For Data Science
100% (1)
Statistics For Data Science
27 pages
Inbound 1530185091425444579
No ratings yet
Inbound 1530185091425444579
16 pages
Stat Chapter 5-9
No ratings yet
Stat Chapter 5-9
32 pages
Basic Statistics
100% (9)
Basic Statistics
73 pages
Statistics Introduction
No ratings yet
Statistics Introduction
37 pages
Desc. Stat
No ratings yet
Desc. Stat
41 pages
1 Basics of Stat (Statistics IEM 2-2)
No ratings yet
1 Basics of Stat (Statistics IEM 2-2)
29 pages
Basic of Statistics #5 (!!!)
No ratings yet
Basic of Statistics #5 (!!!)
49 pages
Lesson1 Shs
No ratings yet
Lesson1 Shs
6 pages
Data Management
No ratings yet
Data Management
57 pages
AL - I (Unit - I)
No ratings yet
AL - I (Unit - I)
19 pages
Statistics 1232445944520487 1
No ratings yet
Statistics 1232445944520487 1
101 pages
4.1 Introduction To Statistics SK 1
No ratings yet
4.1 Introduction To Statistics SK 1
76 pages
Basic Stat 1
No ratings yet
Basic Stat 1
50 pages
Craps Monter Carlo Simulation Model (Davis-Flood)
No ratings yet
Craps Monter Carlo Simulation Model (Davis-Flood)
9 pages
Basics of Statistics: Definition: Science of Collection, Presentation, Analysis, and Reasonable
100% (1)
Basics of Statistics: Definition: Science of Collection, Presentation, Analysis, and Reasonable
33 pages
Data Management
No ratings yet
Data Management
43 pages
Basic Biostats Part
No ratings yet
Basic Biostats Part
59 pages
EST Prep (Probability)
No ratings yet
EST Prep (Probability)
8 pages
Lecture Afffasfafa
No ratings yet
Lecture Afffasfafa
29 pages
2.data Description
No ratings yet
2.data Description
57 pages
Intro To Stat
No ratings yet
Intro To Stat
50 pages
Intro To Stat1
No ratings yet
Intro To Stat1
31 pages
Unit 4
No ratings yet
Unit 4
152 pages
Process Capability Study 1500
No ratings yet
Process Capability Study 1500
6 pages
Notes 3 Descriptive Statistics RJMurden 2021
No ratings yet
Notes 3 Descriptive Statistics RJMurden 2021
47 pages
Statistics For Bussiness: By: Dr. (C) Nanik Istianingsih, S.E., M.E., C.LMA., C.PR., C.DM
No ratings yet
Statistics For Bussiness: By: Dr. (C) Nanik Istianingsih, S.E., M.E., C.LMA., C.PR., C.DM
31 pages
Histogram and Qunatiles
No ratings yet
Histogram and Qunatiles
12 pages
1.ungrouped Data Mean, Median&Mode
No ratings yet
1.ungrouped Data Mean, Median&Mode
39 pages
SALMAN ALAM SHAH - Definitions of Statistics
No ratings yet
SALMAN ALAM SHAH - Definitions of Statistics
16 pages
Lesson 02 Probability and Statistics
No ratings yet
Lesson 02 Probability and Statistics
127 pages
2 Research - 2ND QT - Week 1 - 10 14 2024
No ratings yet
2 Research - 2ND QT - Week 1 - 10 14 2024
13 pages
Midterm Exam Reviewer
No ratings yet
Midterm Exam Reviewer
12 pages
STATS
No ratings yet
STATS
3 pages
MEDIAN
No ratings yet
MEDIAN
6 pages
Gen Ed Math 2021
No ratings yet
Gen Ed Math 2021
13 pages
Central Tendency - Fall 20
No ratings yet
Central Tendency - Fall 20
38 pages
Chapter 5
No ratings yet
Chapter 5
143 pages
Module 6 Statistics
No ratings yet
Module 6 Statistics
44 pages
Sampling Design and Analysis MTH 494: Ossam Chohan Assistant Professor CIIT Abbottabad
No ratings yet
Sampling Design and Analysis MTH 494: Ossam Chohan Assistant Professor CIIT Abbottabad
34 pages
Frequency Tables & Bar Charts Worksheet
No ratings yet
Frequency Tables & Bar Charts Worksheet
8 pages
Module-4 PPT
No ratings yet
Module-4 PPT
54 pages
Jerome Statistics
No ratings yet
Jerome Statistics
12 pages
Week 5A - Statistics Handout
No ratings yet
Week 5A - Statistics Handout
9 pages
Statistics
No ratings yet
Statistics
46 pages
Sta 103 L1 Upda2
No ratings yet
Sta 103 L1 Upda2
104 pages
Article Review 1 Eng
No ratings yet
Article Review 1 Eng
30 pages
Measure of Central Tendency
No ratings yet
Measure of Central Tendency
20 pages
Data Management (1) (1) - Compressed
No ratings yet
Data Management (1) (1) - Compressed
46 pages
Module 2 - Statistical Foundations
No ratings yet
Module 2 - Statistical Foundations
108 pages
Unit 3 - Descriptive Statistics
No ratings yet
Unit 3 - Descriptive Statistics
44 pages
MMW Reviewer
No ratings yet
MMW Reviewer
9 pages
Day 01-Basic Statistics
No ratings yet
Day 01-Basic Statistics
36 pages
Basic Statistics
No ratings yet
Basic Statistics
52 pages
Statistics 1
No ratings yet
Statistics 1
10 pages
f592b059 1643454320549
No ratings yet
f592b059 1643454320549
39 pages
CH 2 Lecture Notes
No ratings yet
CH 2 Lecture Notes
12 pages
FIN10002 - Notes Master
No ratings yet
FIN10002 - Notes Master
44 pages
NITKclass 1
No ratings yet
NITKclass 1
50 pages
Dsbda Unit 2
No ratings yet
Dsbda Unit 2
155 pages
A Brief History of Statistics (Selected Topics) : ALPHA Seminar
No ratings yet
A Brief History of Statistics (Selected Topics) : ALPHA Seminar
15 pages
MATH 361 (Autosaved)
No ratings yet
MATH 361 (Autosaved)
17 pages
Scope Tutorial Manual PDF
No ratings yet
Scope Tutorial Manual PDF
116 pages
Frequency Distribution Table: Measure of Dispersion: Range, Variance, Standard Deviation
No ratings yet
Frequency Distribution Table: Measure of Dispersion: Range, Variance, Standard Deviation
4 pages
2.central Tendency and Dispersion
No ratings yet
2.central Tendency and Dispersion
114 pages
Mean Median Mode Lesson Plan
No ratings yet
Mean Median Mode Lesson Plan
3 pages
Lekcija 2 - Deskriptovna Statistika
No ratings yet
Lekcija 2 - Deskriptovna Statistika
76 pages
QM 2 Tute 3
No ratings yet
QM 2 Tute 3
32 pages
MAT 161 Lesson - 3
100% (1)
MAT 161 Lesson - 3
30 pages
BCom IIRegular
No ratings yet
BCom IIRegular
24 pages
Module Compilation Statistics
No ratings yet
Module Compilation Statistics
25 pages
This Content Downloaded From 83.253.247.117 On Wed, 26 Apr 2023 10:52:51 UTC
No ratings yet
This Content Downloaded From 83.253.247.117 On Wed, 26 Apr 2023 10:52:51 UTC
30 pages
Maths Class 11th .... by Diksha Kajal Khushi Muskan
No ratings yet
Maths Class 11th .... by Diksha Kajal Khushi Muskan
17 pages
Statistical Hydrology
No ratings yet
Statistical Hydrology
39 pages
STATPRB - Quarter 3 - Module 3 (FINAL)
No ratings yet
STATPRB - Quarter 3 - Module 3 (FINAL)
24 pages
Lab 27 - Create Raised Median
No ratings yet
Lab 27 - Create Raised Median
20 pages
Ferreira JCE2013
No ratings yet
Ferreira JCE2013
9 pages
Mindmap Business Intelligence (BI)
No ratings yet
Mindmap Business Intelligence (BI)
2 pages
Statistics I Essentials
From Everand
Statistics I Essentials
Emil G. Milewski
No ratings yet
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
Statistics: a QuickStudy Laminated Reference Guide
From Everand
Statistics: a QuickStudy Laminated Reference Guide
BarCharts Publishing, Inc.
No ratings yet

Unit 3

Uploaded by

Unit 3

Uploaded by

DATA AND ITS TYPES

Weight of ten students of 10th class is recorded in Kilograms is as follow:

Construction of Frequency Curve/How to draw Frequency Curve:

Normal (Bell Shaped) Frequency Curve Skewed Frequency Curve

U Shaped Frequency Curve J shaped Frequency Curve

Purpose of a frequency curve:

Difference between frequency curve, polygon, and histogram:

Standard Deviation is denoted by (σ)

Calculation of Standard Deviation:

• If variance is low, data points are close to the mean.

If cov(X, Y) is zero, there is no relation between two variables.

Quartiles (Q1, Q2, Q3)

For calculating the percentile of 'x' in the data,

Percentile = (Number of values below 'x'/Total number of values) × 100

Steps for calculating percentile:

Step 1: Arrange Data

Sort the data set in ascending order.

Step 2: Calculate Rank

Rank = (Desired Percentile/100) × (n+1)

Where n is the number of observations.

Step 3: Find the Value

The general formula to find the Pth percentile is:

You might also like