0% found this document useful (0 votes)
7 views17 pages

Stat Intro 01 June 2020

Statistics is the science of learning from data, providing a framework for scientific research and decision-making across various sectors, including technology and pharmaceuticals. It involves data collection, presentation, analysis, and interpretation, with applications in understanding consumer behavior and improving product quality. Different sampling techniques, such as simple random, stratified, systematic, and cluster sampling, are used to gather insights from populations and make inferences about larger groups.

Uploaded by

codeworks.edu
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
7 views17 pages

Stat Intro 01 June 2020

Statistics is the science of learning from data, providing a framework for scientific research and decision-making across various sectors, including technology and pharmaceuticals. It involves data collection, presentation, analysis, and interpretation, with applications in understanding consumer behavior and improving product quality. Different sampling techniques, such as simple random, stratified, systematic, and cluster sampling, are used to gather insights from populations and make inferences about larger groups.

Uploaded by

codeworks.edu
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 17

STATISTICS

FOR DATA SCIENCE


2

Statistics—the science of learning from data—provides a framework for


scientific research of all kinds.

Statistics widely used to understand the complex problems of real world to


simplify them to take well informed decisions.
3
Which sector implements statistics?

Companies make thousands of


From Apple to Oracle and products every day and each
Statistics is critical to helping
Google to Microsoft, company must make sure that a
companies understand
statisticians work at a variety of good quality item is sold. But a
consumer tastes and
technology companies to company can't test each and every
preferences so products can
advance the computing item that they ship to you, the
be more effectively marketed.
industry, from ensuring consumer. So the company uses
They gather and analyze data
reliability of hardware statistics to test just a few,
that helps guide the
components to improving the called a sample, of what they
marketing strategies of global
quality of speech recognition make. If the sample passes quality
brands
and image analysis software tests, then the company assumes
applications that all the items made in the
group, called a batch, are good.
4

Pharmaceutical companies Why are some plants more Wal-Mart, a worldwide leading
employ statistics to help them resistant to disease than retailer, keeps track of
develop new drugs that are others? What is the impact everything they sell and use
safe and effective. They often of pesticides on crop statistics to calculate what to
work on teams with doctors and production and the ship to each store and when.
research scientists to design environment? From analysing their vast store
and execute experiments and Statisticians work with of information.
clinical trials. teams of experts to for example, Wal-Mart decided
answer questions that people buy strawberry Pop
Tarts when a hurricane is
predicted in___! So they ship
this product to stores based
upon the weather forecast.
5
Statistics

 Statistics is defined as “ the science which deals


• Data collection
• Presentation of Data
• Analyzing the Data
• Interpretation the Data
• and draw conclusions”.
Statistical methods allow us to accumulate knowledge.
6
Data and Variables

 Data are the facts and figures collected, analyzed, and summarized for
presentation.

 All the data collected in a particular study are referred to as the data set for the
study.

 A variable is a characteristic of interest for the elements.


7

Qualitative variable takes values that are names or labels.


Variable
Ex :
• Region – South, North, East, West
• Gender – Male, Female
• Nationality – Indian, American, African
Qualitative Quantitative
Quantitative variable takes numeric values. They represent
Categorical Numerical
a measurable quantity.
Ex:
• Height Discrete Continuous
• Weight
• Salary
• temperature etc.
8

Discrete variable arise from a counting process. Answers


the question How Many?
Ex :
• Number of students
• Number of defect pieces Quantitative
• Number of workers
• Gender of the customer

Continuous variable arise from a measuring process.


Answers the question How Much?
Ex :

Discrete continuous
Height
• Weight
• Amount of rainfall
• temperature
9

Population and Sample

 Population may refer to an entire group


of people, objects, events, hospital visits,
or measurements. A population can thus
be said to be an aggregate observation of
subjects grouped together by a common
feature.

 Sample is the subset of the population.


The process of selecting a sample is
known as sampling. Number of elements
in the sample is the sample size.
10
Sampling Techniques

 Simple random sampling


 Stratified sampling
 Systematic sampling
 Cluster sampling
Simple 11
random
Sampling

Every element has an


equal chance of getting
selected to be the part
sample. It is used when
we don’t have any kind of
prior information about
the target population.
12

Stratified Sampling

This technique divides the elements of the


population into small subgroups (strata)
based on the similarity in such a way that the
elements within the group are homogeneous
and heterogeneous among the other
subgroups formed. And then the elements
are randomly selected from each of these
strata. We need to have prior information
about the population to create subgroups.
13
Systematic
Sampling

Selection of elements is
systematic and not
random except the first
element. Elements of a
sample are chosen at
regular intervals of
population. All the
elements are put together
in a sequence first where
each element has the
equal chance of being
selected.
14
Cluster
Sampling

Cluster sampling is used


in statistics when natural
groups are present in a
population. The whole
population is subdivided
into clusters, or groups,
and random samples are
then collected from each
group.
15
Stratified vs Cluster Sampling

 In stratified sampling, a sample is drawn


from each strata (using a random sampling
method like simple random sampling
or systematic sampling). let's say you need a
sample size of 6. Two members from each
group (yellow, red, and blue) are selected
randomly.
 In cluster sampling, the sampling unit is the
whole cluster; Instead of sampling individuals
from within each group, a researcher will study
whole clusters.
16
Types of Statistics

Descriptive Statistics:
 Descriptive statistics helps to organize and
Statistics
focus on the main characteristics of the data.
 Summaries of data, which may be tabular,
graphical, or numerical.

Inferential Statistics :
making inferences about the larger Population on
Descriptive Inferential
the basis of data collected from a sample or
samples.
Thank you

You might also like