0% found this document useful (0 votes)

85 views7 pages

Statistical Treatment

The document discusses statistical treatment of data, which involves applying statistical methods to data. It describes two main types of statistical treatment: descriptive statistics, which summarize data through graphs and statistics, and inferential statistics, which make predictions and test hypotheses about data. Descriptive statistics describe characteristics of a dataset through measures of central tendency, distribution, and variability. Inferential statistics make generalizations about a population based on a sample through techniques like hypothesis testing, confidence intervals, and regression/correlation analysis. Random sampling is important for inferential techniques to make accurate generalizations.

Uploaded by

Kris Lea Delos Santos

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

85 views7 pages

Statistical Treatment

Uploaded by

Kris Lea Delos Santos

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

( ) – optional kung kung gusto mo ibali or dai or pwede man script

Highlights – example para maintindihan kang reporter pero ok lang na dai ilaag sa
ppt
STATISTICAL TREATMENT
 The term “statistical treatment” is a catch all term which means to apply
any statistical method to your data.
 The term “statistical treatment” is a catch all term which means to apply
any statistical method to your data.
 This process is important for businesses because it allows them to take
customer feedback and turn it into actionable insights.

( Treatments are divided into two groups: descriptive statistics, which summarize your data as a
graph or summary statistic and inferential statistics, which make predictions and test
hypotheses about your data. )

Descriptive Statistics
 Descriptive statistics are used to describe the overall characteristics of a dataset.
 The term ‘descriptive statistics’ can be used to describe both individual
quantitative observations (also known as ‘summary statistics’) as well as the
overall process of obtaining insights from these data.
 Describe the features of populations and/or samples.
 Organize and present data in a purely factual way.
 Present final results visually, using tables, charts, or graphs.
 Draw conclusions based on known data.
 Use measures like central tendency, distribution, and variance.

Types of Descriptive Statistics
Distribution

 shows us the frequency of different outcomes (or data points) in a

population or sample. (We can show it as numbers in a list or table, or we
can represent it graphically)

Examples

Central Tendency

 measurements that look at the typical central values within a dataset.

 a general term used to describe a variety of central measurements.
 might include central measurements from different quartiles of a larger
dataset.

Common measures of central tendency include:

The mean: The average value of all the data points.

The median: The central or middle value in the dataset.

The mode: The value that appears most often in the dataset.

Variability

 Known as dispersion.
 describes how values are distributed or spread out.
 Identifying variability relies on understanding the central tendency
measurements of a dataset. (However, like central tendency, variability is
not just one measure. It is a term used to describe a range of
measurements.)

Common measures of variability include:

 Standard deviation: This shows us the amount of variation or dispersion. Low

standard deviation implies that most values are close to the mean. High standard
deviation suggests that the values are more broadly spread out.

 Minimum and maximum values: These are the highest and lowest values in a
dataset or quartile. Using the example of our hair color dataset again, the
minimum and maximum values are 13 and 130 respectively.

 Range: This measures the size of the distribution of values. This can be easily
determined by subtracting the smallest value from the largest. So, in our hair
color dataset, the range is 117 (130 minus 13).

 Kurtosis: This measures whether or not the tails of a given distribution contain

extreme values (also known as outliers). If a tail lacks outliers, we can say that it
has low kurtosis. If a dataset has a lot of outliers, we can say it has high kurtosis.

 Skewness: This is a measure of a dataset’s symmetry. If you were to plot a bell-

curve and the right-hand tail was longer and fatter, we would call this positive
skewness. If the left-hand tail is longer and fatter, we call this negative skewness.
This is visible in the following image.
Inferential Statistics

 inferential statistics focus on making generalizations about a larger population

based on a representative sample of that population.
 inferential statistics focuses on making predictions (rather than stating facts) its
results are usually in the form of a probability.
 Use samples to make generalizations about larger populations.
 Help us to make estimates and predict future outcomes.
 Present final results in the form of probabilities.
 Draw conclusions that go beyond the available data.
 Use techniques like hypothesis testing, confidence intervals, and regression and
correlation analysis.

(Random sampling is very important for carrying out inferential techniques, but it is not
always straightforward)

Random sample
1. Defining a population
This simply means determining the pool from which you will draw your sample. As we
explained earlier, a population can be anything—it isn’t limited to people. So it could be
a population of objects, cities, cats, pugs, or anything else from which we can derive
measurements!
2. Deciding your sample size
The bigger your sample size, the more representative it will be of the overall population.
Drawing large samples can be time-consuming, difficult, and expensive. Indeed, this is
why we draw samples in the first place—it is rarely feasible to draw data from an entire
population. Your sample size should therefore be large enough to give you confidence
in your results but not so small that the data risk being unrepresentative (which is just
shorthand for inaccurate). This is where using descriptive statistics can help, as they
allow us to strike a balance between size and accuracy.
3. Randomly select a sample
Once you’ve determined the sample size, you can draw a random selection. You might
do this using a random number generator, assigning each value a number and selecting
the numbers at random. Or you could do it using a range of similar techniques or
algorithms (we won’t go into detail here, as this is a topic in its own right, but you get the
idea).
4. Analyze the data sample
Once you have a random sample, you can use it to infer information about the larger
population. It’s important to note that while a random sample is representative of a
population, it will never be 100% accurate. For instance, the mean (or average) of a
sample will rarely match the mean of the full population, but it will give you a good idea
of it. For this reason, it’s important to incorporate your error margin in any analysis
(which we cover in a moment). This is why, as explained earlier, any result from
inferential techniques is in the form of a probability.
However, presuming we’ve obtained a random sample, there are many inferential
techniques for analyzing and obtaining insights from those data. The list is long, but
some techniques worthy of note include:
 Hypothesis testing
 Confidence intervals
 Regression and correlation analysis

Hypothesis testing
 involves checking that your samples repeat the results of your hypothesis (or
proposed explanation).
 aim is to rule out the possibility that a given result has occurred by chance.

A topical example of this is the clinical trials for the covid-19 vaccine. Since it’s
impossible to carry out trials on an entire population, we carry out numerous trials on
several random, representative samples instead.

The hypothesis test, in this case, might ask something like: ‘Does the vaccine reduce
severe illness caused by covid-19?’ By collecting data from different sample groups, we
can infer if the vaccine will be effective. If all samples show similar results and we know
that they are representative and random, we can generalize that the vaccine will have
the same effect on the population at large. On the flip side, if one sample shows higher
or lower efficacy than the others, we must investigate why this might be. For instance,
maybe there was a mistake in the sampling process, or perhaps the vaccine was
delivered differently to that group. In fact, it was due to a dosing error that one of the
Covid vaccines actually proved to be more effective than other groups in the trial…
Which shows how important hypothesis testing can be. If the outlier group had simply
been written off, the vaccine would have been less effective!

Confidence interval
 used to estimate certain parameters for a measurement of a population (such as
the mean) based on sample data.
 Rather than providing a single mean value, the confidence interval provides a
range of values.
 This is often given as a percentage.

For example, let’s say you’ve measured the tails of 40 randomly selected cats.
You get a mean length of 17.5cm. You also know the standard deviation of tail
lengths is 2cm. Using a special formula, we can say the mean length of tails in
the full population of cats is 17.5cm, with a 95% confidence interval.
Essentially, this tells us that we are 95% certain that the population mean
(which we cannot know without measuring the full population) falls within the
given range. This technique is very helpful for measuring the degree of
accuracy within a sampling method.

Regression Analysis
 aims to determine how one dependent (or output) variable is impacted by one or
more independent (or input) variables.
 It’s often used for hypothesis testing and predictive analytics.

For example, to predict future sales of sunscreen (an output variable) you might
compare last year’s sales against weather data (which are both input variables) to see
how much sales increased on sunny days.

Correlation Analysis
 measures the degree of association between two or more datasets.
 correlation does not infer cause and effect.
For instance, ice cream sales and sunburn are both likely to be higher on sunny days—
we can say that they are correlated. But it would be incorrect to say that ice cream
causes sunburn!

Lesson Plan Measures of Central Tendency
100% (11)
Lesson Plan Measures of Central Tendency
13 pages
Statistics For Data Analysis
No ratings yet
Statistics For Data Analysis
13 pages
Unit-2 Data Analytics Approaches
No ratings yet
Unit-2 Data Analytics Approaches
24 pages
Ads Exp1
No ratings yet
Ads Exp1
4 pages
3 4 Research 8 2
No ratings yet
3 4 Research 8 2
54 pages
Bachu Assignment
No ratings yet
Bachu Assignment
25 pages
DeMeasure of Central Tendency and Dispersion
No ratings yet
DeMeasure of Central Tendency and Dispersion
15 pages
Lec 22
No ratings yet
Lec 22
38 pages
Stats 1 For Students
No ratings yet
Stats 1 For Students
60 pages
Lesson 2
No ratings yet
Lesson 2
39 pages
Statistics През
No ratings yet
Statistics През
46 pages
Smat3: Statistics and Robability
No ratings yet
Smat3: Statistics and Robability
8 pages
Raja Daniyal (0000242740) 8614 - Assignment 1
No ratings yet
Raja Daniyal (0000242740) 8614 - Assignment 1
30 pages
Summary of The First Ten Chapters
No ratings yet
Summary of The First Ten Chapters
28 pages
Chapter 6 Research Methods
No ratings yet
Chapter 6 Research Methods
24 pages
Statistics (Curso Completo)
No ratings yet
Statistics (Curso Completo)
9 pages
Statistics - Compendium - DMS IIT DELHI - 2025
No ratings yet
Statistics - Compendium - DMS IIT DELHI - 2025
18 pages
Unit 6 - Data and Sampling Methods
No ratings yet
Unit 6 - Data and Sampling Methods
5 pages
Group 4 Handouts
No ratings yet
Group 4 Handouts
8 pages
Descriptive Statistics: Sample
No ratings yet
Descriptive Statistics: Sample
5 pages
Statistical Tool and Treatment
No ratings yet
Statistical Tool and Treatment
20 pages
Jul 23 - Statistics
No ratings yet
Jul 23 - Statistics
65 pages
Statistics For Data Science
No ratings yet
Statistics For Data Science
30 pages
Research Presentation
No ratings yet
Research Presentation
29 pages
Difference Between Descriptive and Inferential Statistics
100% (1)
Difference Between Descriptive and Inferential Statistics
9 pages
Chapter 1
No ratings yet
Chapter 1
25 pages
Sampling in Statistics
From Everand
Sampling in Statistics
Stephanie Glen
No ratings yet
RSU - Statistics - Lecture 1 - Final - myRSU
100% (1)
RSU - Statistics - Lecture 1 - Final - myRSU
44 pages
Business Statstics Complete
No ratings yet
Business Statstics Complete
13 pages
Ge8 Statistics
No ratings yet
Ge8 Statistics
2 pages
Bes Summary
No ratings yet
Bes Summary
11 pages
Regression
No ratings yet
Regression
86 pages
Business Analytics
No ratings yet
Business Analytics
44 pages
Lec 1
No ratings yet
Lec 1
7 pages
Lecture 1-2-118
No ratings yet
Lecture 1-2-118
117 pages
Business Statistics I BBA 1303: Muktasha Deena Chowdhury Assistant Professor, Statistics, AUB
100% (1)
Business Statistics I BBA 1303: Muktasha Deena Chowdhury Assistant Professor, Statistics, AUB
54 pages
STAT100 - Full Course Notes
No ratings yet
STAT100 - Full Course Notes
27 pages
BMS Group 4
No ratings yet
BMS Group 4
26 pages
Seminar 4
No ratings yet
Seminar 4
43 pages
A Brief (Very Brief) Overview of Biostatistics: Jody Kreiman, PHD Bureau of Glottal Affairs
No ratings yet
A Brief (Very Brief) Overview of Biostatistics: Jody Kreiman, PHD Bureau of Glottal Affairs
56 pages
EDA - Reviewer Midterm
No ratings yet
EDA - Reviewer Midterm
9 pages
Statistics and Probability - Midterm Reviewer
No ratings yet
Statistics and Probability - Midterm Reviewer
13 pages
Unit IV - Analytics Tasks (Students)
No ratings yet
Unit IV - Analytics Tasks (Students)
127 pages
Statistics
No ratings yet
Statistics
152 pages
Session 1 On Descriptive Statistics
No ratings yet
Session 1 On Descriptive Statistics
24 pages
DAVA Notes 1-1
No ratings yet
DAVA Notes 1-1
19 pages
Psychological Stats Reviewer
No ratings yet
Psychological Stats Reviewer
11 pages
Chapters 1 and 2chapters 1 and 2chapters 1 and 2chapters 1 and 2chapters 1 and 2
No ratings yet
Chapters 1 and 2chapters 1 and 2chapters 1 and 2chapters 1 and 2chapters 1 and 2
47 pages
Spreadsheet Module 4 Fin
No ratings yet
Spreadsheet Module 4 Fin
49 pages
Population
No ratings yet
Population
27 pages
Simple Regression Analysis
No ratings yet
Simple Regression Analysis
13 pages
Mean, Median, Mode and Standard Deviation
No ratings yet
Mean, Median, Mode and Standard Deviation
42 pages
Difference Between Descriptive and Inferential Statistics
No ratings yet
Difference Between Descriptive and Inferential Statistics
8 pages
Lecture 4 - Data Science Statistics
No ratings yet
Lecture 4 - Data Science Statistics
21 pages
Type II Error
No ratings yet
Type II Error
6 pages
Biostatistics 140127003954 Phpapp02
No ratings yet
Biostatistics 140127003954 Phpapp02
47 pages
Pointers To Review Statistics
No ratings yet
Pointers To Review Statistics
6 pages
Biostatistics Summary
No ratings yet
Biostatistics Summary
5 pages
Reviewer Statistics
No ratings yet
Reviewer Statistics
5 pages
Descriptive Statistics: Six Sigma Thinking, #3
From Everand
Descriptive Statistics: Six Sigma Thinking, #3
Sumeet Savant
No ratings yet
Machine Learning - A Complete Exploration of Highly Advanced Machine Learning Concepts, Best Practices and Techniques: 4
From Everand
Machine Learning - A Complete Exploration of Highly Advanced Machine Learning Concepts, Best Practices and Techniques: 4
Peter Bradley
No ratings yet
Mil Key Words
No ratings yet
Mil Key Words
10 pages
Applied Economics Notes
No ratings yet
Applied Economics Notes
2 pages
Earth and Life Science Reviewer
100% (3)
Earth and Life Science Reviewer
4 pages
EMTECH Google Site Presentation
No ratings yet
EMTECH Google Site Presentation
63 pages
Normal Distribution1
100% (1)
Normal Distribution1
8 pages
Effect of Data Changes On Dispersion
No ratings yet
Effect of Data Changes On Dispersion
1 page
Stat LAS 15
No ratings yet
Stat LAS 15
7 pages
STT033 Modules 1 6
No ratings yet
STT033 Modules 1 6
99 pages
ISO System of Limits and Fits (Tolerances)
No ratings yet
ISO System of Limits and Fits (Tolerances)
4 pages
Correlation & Regression in Hindi-1
No ratings yet
Correlation & Regression in Hindi-1
14 pages
Tide Prediction For 9 Constituents: (Using Least Squares Method)
No ratings yet
Tide Prediction For 9 Constituents: (Using Least Squares Method)
23 pages
Partnership Operation Exercises With Answers and Solutions Merged
No ratings yet
Partnership Operation Exercises With Answers and Solutions Merged
2 pages
Mas 101 3
No ratings yet
Mas 101 3
63 pages
Beniga Ma 102 Pre-Test Exam
67% (3)
Beniga Ma 102 Pre-Test Exam
6 pages
Studi Korelasi Koefisien Permeabilitas Vertikal Dan Permeabilitas Horizontal Pada Tanah Lempung
No ratings yet
Studi Korelasi Koefisien Permeabilitas Vertikal Dan Permeabilitas Horizontal Pada Tanah Lempung
11 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
8 pages
Answer Key: Quiz #1A: N I I 1 I
No ratings yet
Answer Key: Quiz #1A: N I I 1 I
4 pages
Correlation and Regression: Fathers' and Daughters' Heights
No ratings yet
Correlation and Regression: Fathers' and Daughters' Heights
43 pages
Sokal Rohlf 2012 Contents
No ratings yet
Sokal Rohlf 2012 Contents
12 pages
Math Finals
No ratings yet
Math Finals
5 pages
It0089 Finalreviewer
100% (1)
It0089 Finalreviewer
143 pages
Business Statistics A First Course 7th Edition Levine Solutions Manualpdf Download
100% (9)
Business Statistics A First Course 7th Edition Levine Solutions Manualpdf Download
57 pages
Formulae of Statistics
No ratings yet
Formulae of Statistics
18 pages
3rd Quarter Exam - Statistics and Probability
100% (3)
3rd Quarter Exam - Statistics and Probability
11 pages
DEV Question Bank
No ratings yet
DEV Question Bank
15 pages
Biostatistics Final Tests 2017 18-1802 PDF
No ratings yet
Biostatistics Final Tests 2017 18-1802 PDF
66 pages
Bcm-106/Bc-02: O Kolkf D Lkaf ( DH VKSJ XF - Kr@O Kolkf D Lkaf ( DH
No ratings yet
Bcm-106/Bc-02: O Kolkf D Lkaf ( DH VKSJ XF - Kr@O Kolkf D Lkaf ( DH
11 pages
Content - Calculating Confidence Intervals
No ratings yet
Content - Calculating Confidence Intervals
1 page
Universidad de Las Fuerzas Armadas "Espe": Departamento de Ciencias de La Energia Y Mecanica Diseño Mecatronico
No ratings yet
Universidad de Las Fuerzas Armadas "Espe": Departamento de Ciencias de La Energia Y Mecanica Diseño Mecatronico
10 pages
Significant Correlation Between Studied Variables
No ratings yet
Significant Correlation Between Studied Variables
2 pages
Gmath Finals Module 2 Chapter 4
100% (3)
Gmath Finals Module 2 Chapter 4
39 pages
Quantitative Analysis 4
No ratings yet
Quantitative Analysis 4
22 pages
Notes In-Statistics
No ratings yet
Notes In-Statistics
3 pages

Statistical Treatment

Uploaded by

Statistical Treatment

Uploaded by

( ) – optional kung kung gusto mo ibali or dai or pwede man script

 shows us the frequency of different outcomes (or data points) in a

 measurements that look at the typical central values within a dataset.

Common measures of central tendency include:

The median: The central or middle value in the dataset.

Common measures of variability include:

 Standard deviation: This shows us the amount of variation or dispersion. Low

 Kurtosis: This measures whether or not the tails of a given distribution contain

 Skewness: This is a measure of a dataset’s symmetry. If you were to plot a bell-

 inferential statistics focus on making generalizations about a larger population

You might also like